mlx-examples

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-08-30 02:53:41 +08:00

Author	SHA1	Message	Date
Chime Ogbuji	f9936f77da	Merge branch 'ml-explore:main' into sets_of_hf_datasets	2024-11-08 18:47:27 -05:00
Awni Hannun	657b4cc0aa	[MLX LM] Sampler refactor + a few improvements (#1094 ) * starting * refactor sampler/processor and a few improvements * fix stream * fix stream generate * fix eos handling in stream generate	2024-11-07 16:15:24 -08:00
Chime Ogbuji	90e2da881c	Minor fix	2024-11-06 12:58:00 -05:00
Chime Ogbuji	e45ce38f86	Add ability to fetch raw prompt and completion text from completion datasets	2024-11-06 12:53:54 -05:00
Angelos Katharopoulos	ed9e81dd58	Fix rotating kv cache size (#1093 )	2024-11-05 10:24:24 -08:00
Awni Hannun	6fd1f70f73	fix spm decoder multi-byte (#1092 )	2024-11-05 06:06:26 -08:00
Anthony Wu	4394633ce0	mlx_whisper: add support for audio input from stdin (#1012 ) * add support for audio and input name from stdin * refactored to stdin - arg, and output-name template * fix bugs, add test coverage * fix doc to match arg rename * some nits --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-11-04 14:02:13 -08:00
ilyasch2	3b526f0aa1	Add support for falcon-mamba (#1074 ) * Add support for falcon-mamba * nits * nit --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-11-04 12:23:30 -08:00
Anchen	82e3338987	chore(mlx-lm): add max token arg for mlx_lm.chat (#1089 ) * chore(mlx-lm): add max token arg for mlx_lm.chat * chore: update the default max token value	2024-11-04 06:06:34 -08:00
Chime Ogbuji	78b24a2375	Fix index calculation	2024-11-03 20:36:55 -05:00
Chime Ogbuji	24f40c3b8d	Fix iteration over HF dataset collection	2024-11-03 20:30:47 -05:00
Chime Ogbuji	e477060a00	Fix keyword argument invokation	2024-11-03 20:26:15 -05:00
Chime Ogbuji	04cf93df55	Fixes to references to hf_datasets	2024-11-03 20:04:15 -05:00
Chime Ogbuji	c72122064a	Fixes to config format in documentattion	2024-11-03 20:00:35 -05:00
Chime Ogbuji	1f6c370690	Updates to LoRA documentation	2024-11-03 19:41:09 -05:00
Chime Ogbuji	9df7bbbe3a	Generalize HF datasets to a collection of HF dataasets via `datasets`, adds support for custom chat HF datasets (#1088 ), and fixes (#1087 )	2024-11-03 19:11:54 -05:00
Angelos Katharopoulos	331148d8ec	Enable distributed LoRA training (#821 )	2024-11-02 18:02:31 -07:00
Awni Hannun	29c954f4cb	fix (#1082 )	2024-11-02 13:51:38 -07:00
Awni Hannun	0f799947d0	fix (#1079 )	2024-11-01 16:30:32 -07:00
Awni Hannun	e510987870	Clear cache every now and then (#1081 ) * clear cache every now and then * don't need user arg anymore	2024-11-01 14:15:32 -07:00
Awni Hannun	8160e0c4e5	Whisper improvements (#1080 ) * use safetensors in whisper * speed up decoder * version	2024-11-01 10:52:28 -07:00
Alex Barron	85ffd2c96a	Quantized KV Cache (#1075 ) * add QuantizedKVCache * simplify * add tests * single sdpa function * fix sed * in place * fix tests * support different k and v head dims	2024-10-31 16:59:52 -07:00
Awni Hannun	9f34fdbda4	Wire models in MLX LM (#1069 ) * wired in MLX LM * fix synch * comment + nit * version * mlx lm version * bump to 0.19.2	2024-10-31 08:17:14 -07:00
Awni Hannun	8fe9539af7	Fix detokenizer space match for quote (#1072 ) * fix + test * remove transformer flax/torch warning * format	2024-10-27 15:06:07 -07:00
hschaeufler	ab4bf05c6e	Update lora_config.yaml with new param: num_layers (#1068 )	2024-10-26 09:34:46 -07:00
Saurav Maheshkar	4971462bf0	feat(clip): add linear probe evaluation script (#960 )	2024-10-24 21:56:17 -07:00
Awni Hannun	9000e280ae	fix mamba models conversion (#1065 )	2024-10-22 15:44:08 -07:00
madroid	d1d480867b	LoRA: update tools datasets docs (#1063 ) * LoRA: update tools datasets docs * nits * nits --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-10-22 12:19:11 -07:00
Awni Hannun	66e7bcb886	override dtype with quant (#1062 )	2024-10-22 09:56:45 -07:00
aronson	743763bc2e	Handle empty string case in maybe_trim_space (#1055 ) * Handle empty string case in maybe_trim_space * nit --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-10-20 20:46:43 -07:00
madroid	f491d473a3	FLUX: Optimize dataset loading logic (#1038 )	2024-10-15 10:37:45 -07:00
Zak B. Elep	3d62b058a4	fix: typo on flux model preloading (#1050 )	2024-10-15 09:13:01 -07:00
madroid	bbd2003047	FLUX: update README.md (#1036 )	2024-10-14 11:21:41 -07:00
Awni Hannun	605c4854f1	Prompt caching in `mlx_lm.server` (#1026 ) * caching in server * nits * fix tests * don't throw if no metal * comments	2024-10-14 10:57:22 -07:00
Awni Hannun	8dca1a2f60	Tokenizer updates + tests (#1024 ) * tokenizer updates + tests * nit * add can_trim_prompt_cache * nits	2024-10-14 10:48:46 -07:00
Awni Hannun	6c368f2124	bump mac tests to use py39 (#1047 )	2024-10-14 10:40:36 -07:00
Awni Hannun	c799133998	Make llm async eval less brittle (#1040 ) * Make llm async eval less brittle * nit	2024-10-14 10:25:24 -07:00
Seitaro Sugawara	1e0cda68c6	Update README.md (#1045 ) * Update README.md A small typo was fixed in the musicgen README.md. * Update musicgen/README.md --------- Co-authored-by: Awni Hannun <awni.hannun@gmail.com>	2024-10-14 06:21:25 -07:00
Shunta Saito	7612c646f3	Fix PLaMo model to support Grouped Query Attention (#1037 )	2024-10-12 15:26:50 -07:00
Ivan Fioravanti	d8611dd69f	Small typo fixed in flux README.md (#1035 )	2024-10-12 06:14:01 -07:00
Angelos Katharopoulos	a5f2bab070	Add FLUX finetuning (#1028 )	2024-10-11 21:17:41 -07:00
Alex Barron	d72fdeb4ee	MusicGen (#1020 ) * Add MusicGen model * add benchmarks * change to from_pretrained * symlinks * add readme and requirements * fix readme * readme	2024-10-11 10:16:20 -07:00
Awni Hannun	4360e7ccec	clear cache during prompt processing (#1027 )	2024-10-09 16:48:32 -07:00
Awni Hannun	b7373cb44f	fix long prompt generations (#1023 )	2024-10-09 11:09:36 -07:00
Awni Hannun	fca087be49	More cache improvements (#1015 ) * fix rotating kv cache for chat use case * reorg + fixes to caching, unify prompt caching across types and use cases for e.g. caching during a chat * nit in chat * fix tests * fix tests * fix tests * docs * chat command * comments + docs * Define meta_state on all Cache implementations * fixes + trim_prompt_cache api * fix default model --------- Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>	2024-10-07 20:45:51 -07:00
Awni Hannun	9bc53fc210	convert (#1006 )	2024-10-02 13:13:33 -07:00
madroid	36c1d8e8dc	Server: support function calling (#1003 )	2024-10-02 12:36:07 -07:00
nathan	0866e23a67	repetiton_penalty and logits_bias just using logits_processors (#1004 ) * refactor of repetition_penalty and logits_bias to use logits_processor * nits --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-09-30 08:49:03 -07:00
Zai Thottakath	418d9a5511	Feature: QDoRA (#891 ) * feat: QDoRA with tests and a small bug fix for recalculation of self.m * some simplifications and fixes --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-09-30 08:01:11 -07:00
madroid	aa1c8abdc6	LoRA: Support HuggingFace dataset via data parameter (#996 ) * LoRA: support huggingface dataset via `data` argument * LoRA: Extract the load_custom_hf_dataset function * LoRA: split small functions * fix spelling errors * handle load hf dataset error * fix pre-commit lint * update data argument help * nits and doc --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-09-30 07:36:21 -07:00

1 2 3 4 5 ...

567 Commits