mlx-examples

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-08-30 02:53:41 +08:00

Author	SHA1	Message	Date
Chime Ogbuji	791727fa1c	Merge remote-tracking branch 'origin/completion_only' into completion_only	2024-11-10 09:54:49 -05:00
Chime Ogbuji	01e330d6bb	Add input masking for fine-tuning in documentation Renamed the batch iteration function (iterate_delineated_batches -> iterate_completion_batches).	2024-11-10 09:54:32 -05:00
Chime Ogbuji	3080102b80	Merge branch 'ml-explore:main' into completion_only	2024-11-09 12:52:30 -05:00
madroid	1e07660184	FLUX: save train config (#1049 )	2024-11-08 17:15:19 -08:00
Chime Ogbuji	7f89ace55e	Merge branch 'ml-explore:main' into completion_only	2024-11-08 18:46:38 -05:00
Awni Hannun	657b4cc0aa	[MLX LM] Sampler refactor + a few improvements (#1094 ) * starting * refactor sampler/processor and a few improvements * fix stream * fix stream generate * fix eos handling in stream generate	2024-11-07 16:15:24 -08:00
Chime Ogbuji	bfa6c2932e	Fix	2024-11-06 20:29:12 -05:00
Chime Ogbuji	960ed79a6b	Update sublist search and calculation of input id length	2024-11-06 13:57:59 -05:00
Chime Ogbuji	3c76a253db	Fix variable reference	2024-11-06 12:33:49 -05:00
Chime Ogbuji	4b88c33a26	Updates CL lora tuner with input masking that uses default_loss (and iterate_batches) by default.	2024-11-05 19:10:01 -05:00
Chime Ogbuji	e0d66f5479	Merge remote-tracking branch 'origin/completion_only' into completion_only	2024-11-05 15:26:08 -05:00
Chime Ogbuji	5579b48974	Minor documentation update	2024-11-05 15:25:45 -05:00
Chime Ogbuji	603dab57be	Merge branch 'ml-explore:main' into completion_only	2024-11-05 15:18:05 -05:00
Chime Ogbuji	b7b3332dc5	Replace iterate_input_masked_batches with iterate_delineated_batches, an updated attempt to better sync with iterate_batches logic	2024-11-05 15:17:23 -05:00
Angelos Katharopoulos	ed9e81dd58	Fix rotating kv cache size (#1093 )	2024-11-05 10:24:24 -08:00
Awni Hannun	6fd1f70f73	fix spm decoder multi-byte (#1092 )	2024-11-05 06:06:26 -08:00
Chime Ogbuji	a1fbc52cf2	Merge branch 'ml-explore:main' into completion_only	2024-11-04 22:00:55 -05:00
Anthony Wu	4394633ce0	mlx_whisper: add support for audio input from stdin (#1012 ) * add support for audio and input name from stdin * refactored to stdin - arg, and output-name template * fix bugs, add test coverage * fix doc to match arg rename * some nits --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-11-04 14:02:13 -08:00
ilyasch2	3b526f0aa1	Add support for falcon-mamba (#1074 ) * Add support for falcon-mamba * nits * nit --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-11-04 12:23:30 -08:00
Anchen	82e3338987	chore(mlx-lm): add max token arg for mlx_lm.chat (#1089 ) * chore(mlx-lm): add max token arg for mlx_lm.chat * chore: update the default max token value	2024-11-04 06:06:34 -08:00
Chime Ogbuji	95fb22449b	Merge branch 'ml-explore:main' into completion_only	2024-11-04 08:48:26 -05:00
Angelos Katharopoulos	331148d8ec	Enable distributed LoRA training (#821 )	2024-11-02 18:02:31 -07:00
Awni Hannun	29c954f4cb	fix (#1082 )	2024-11-02 13:51:38 -07:00
Awni Hannun	0f799947d0	fix (#1079 )	2024-11-01 16:30:32 -07:00
Awni Hannun	e510987870	Clear cache every now and then (#1081 ) * clear cache every now and then * don't need user arg anymore	2024-11-01 14:15:32 -07:00
Awni Hannun	8160e0c4e5	Whisper improvements (#1080 ) * use safetensors in whisper * speed up decoder * version	2024-11-01 10:52:28 -07:00
Alex Barron	85ffd2c96a	Quantized KV Cache (#1075 ) * add QuantizedKVCache * simplify * add tests * single sdpa function * fix sed * in place * fix tests * support different k and v head dims	2024-10-31 16:59:52 -07:00
Awni Hannun	9f34fdbda4	Wire models in MLX LM (#1069 ) * wired in MLX LM * fix synch * comment + nit * version * mlx lm version * bump to 0.19.2	2024-10-31 08:17:14 -07:00
Awni Hannun	8fe9539af7	Fix detokenizer space match for quote (#1072 ) * fix + test * remove transformer flax/torch warning * format	2024-10-27 15:06:07 -07:00
hschaeufler	ab4bf05c6e	Update lora_config.yaml with new param: num_layers (#1068 )	2024-10-26 09:34:46 -07:00
Saurav Maheshkar	4971462bf0	feat(clip): add linear probe evaluation script (#960 )	2024-10-24 21:56:17 -07:00
Awni Hannun	9000e280ae	fix mamba models conversion (#1065 )	2024-10-22 15:44:08 -07:00
madroid	d1d480867b	LoRA: update tools datasets docs (#1063 ) * LoRA: update tools datasets docs * nits * nits --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-10-22 12:19:11 -07:00
Awni Hannun	66e7bcb886	override dtype with quant (#1062 )	2024-10-22 09:56:45 -07:00
aronson	743763bc2e	Handle empty string case in maybe_trim_space (#1055 ) * Handle empty string case in maybe_trim_space * nit --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-10-20 20:46:43 -07:00
madroid	f491d473a3	FLUX: Optimize dataset loading logic (#1038 )	2024-10-15 10:37:45 -07:00
Zak B. Elep	3d62b058a4	fix: typo on flux model preloading (#1050 )	2024-10-15 09:13:01 -07:00
madroid	bbd2003047	FLUX: update README.md (#1036 )	2024-10-14 11:21:41 -07:00
Awni Hannun	605c4854f1	Prompt caching in `mlx_lm.server` (#1026 ) * caching in server * nits * fix tests * don't throw if no metal * comments	2024-10-14 10:57:22 -07:00
Awni Hannun	8dca1a2f60	Tokenizer updates + tests (#1024 ) * tokenizer updates + tests * nit * add can_trim_prompt_cache * nits	2024-10-14 10:48:46 -07:00
Awni Hannun	6c368f2124	bump mac tests to use py39 (#1047 )	2024-10-14 10:40:36 -07:00
Awni Hannun	c799133998	Make llm async eval less brittle (#1040 ) * Make llm async eval less brittle * nit	2024-10-14 10:25:24 -07:00
Seitaro Sugawara	1e0cda68c6	Update README.md (#1045 ) * Update README.md A small typo was fixed in the musicgen README.md. * Update musicgen/README.md --------- Co-authored-by: Awni Hannun <awni.hannun@gmail.com>	2024-10-14 06:21:25 -07:00
Shunta Saito	7612c646f3	Fix PLaMo model to support Grouped Query Attention (#1037 )	2024-10-12 15:26:50 -07:00
Ivan Fioravanti	d8611dd69f	Small typo fixed in flux README.md (#1035 )	2024-10-12 06:14:01 -07:00
Angelos Katharopoulos	a5f2bab070	Add FLUX finetuning (#1028 )	2024-10-11 21:17:41 -07:00
Alex Barron	d72fdeb4ee	MusicGen (#1020 ) * Add MusicGen model * add benchmarks * change to from_pretrained * symlinks * add readme and requirements * fix readme * readme	2024-10-11 10:16:20 -07:00
Awni Hannun	4360e7ccec	clear cache during prompt processing (#1027 )	2024-10-09 16:48:32 -07:00
Awni Hannun	b7373cb44f	fix long prompt generations (#1023 )	2024-10-09 11:09:36 -07:00
Awni Hannun	fca087be49	More cache improvements (#1015 ) * fix rotating kv cache for chat use case * reorg + fixes to caching, unify prompt caching across types and use cases for e.g. caching during a chat * nit in chat * fix tests * fix tests * fix tests * docs * chat command * comments + docs * Define meta_state on all Cache implementations * fixes + trim_prompt_cache api * fix default model --------- Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>	2024-10-07 20:45:51 -07:00

1 2 3 4 5 ...

576 Commits