Anchen 
							
						 
					 
					
						
						
							
						
						758f05c09a 
					 
					
						
						
							
							refactor: merge deepseek coder example into hf_llm example ( #234 )  
						
						... 
						
						
						
						* refactor: merge deepseek coder example into hf_llm example
* remove deepseek example
* chore: fix format in readme
* chore: remove default rope_scaling dict and use get to access type and factor to avoid key error
* Update llms/hf_llm/models.py
Co-authored-by: Awni Hannun <awni.hannun@gmail.com >
* chore: fix lint
---------
Co-authored-by: Awni Hannun <awni.hannun@gmail.com > 
						
						
					 
					
						2024-01-06 07:53:46 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						cf0ad26a89 
					 
					
						
						
							
							force fp16 for quantized models ( #240 )  
						
						
						
						
					 
					
						2024-01-05 21:29:15 -08:00 
						 
				 
			
				
					
						
							
							
								Lawrence Wu 
							
						 
					 
					
						
						
							
						
						37856f70a8 
					 
					
						
						
							
							add numpy as a requirement to run lora.py ( #238 )  
						
						... 
						
						
						
						* add numpy as a requirement to run lora.py
* removed unused imports 
						
						
					 
					
						2024-01-05 16:16:28 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						37b41cec60 
					 
					
						
						
							
							Qlora ( #219 )  
						
						... 
						
						
						
						qlora 
						
						
					 
					
						2024-01-04 21:05:59 -08:00 
						 
				 
			
				
					
						
							
							
								Christian Bieniak 
							
						 
					 
					
						
						
							
						
						4fa659acbd 
					 
					
						
						
							
							Handle receiving 0 tokens gracefully ( #231 )  
						
						... 
						
						
						
						* handle 0 tokens gracefully
* Formatting
* Move no token check to statistics section 
						
						
					 
					
						2024-01-04 19:14:13 -08:00 
						 
				 
			
				
					
						
							
							
								Andy Peatling 
							
						 
					 
					
						
						
							
						
						12c9bafbf5 
					 
					
						
						
							
							Update README.md to fix --hf-model param call. ( #229 )  
						
						... 
						
						
						
						Update `--hf-model` to `--hf-path` since the `--hf-model` param does not exist in convert.py. 
						
						
					 
					
						2024-01-04 11:53:51 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						e14afb3e77 
					 
					
						
						
							
							fix to use actual prompt ( #227 )  
						
						
						
						
					 
					
						2024-01-04 11:12:05 -08:00 
						 
				 
			
				
					
						
							
							
								Vaibhav Srivastav 
							
						 
					 
					
						
						
							
						
						f95cf30a31 
					 
					
						
						
							
							Fix upload to hub for HF LLMs conversion script. ( #221 )  
						
						... 
						
						
						
						* Fix upload to hub snippet.
* Weights -> model.
* reverting last commit. 
						
						
					 
					
						2024-01-04 06:06:05 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						a5d6d0436c 
					 
					
						
						
							
							Support Hugging Face models ( #215 )  
						
						... 
						
						
						
						* support hf direct models 
						
						
					 
					
						2024-01-03 15:13:26 -08:00 
						 
				 
			
				
					
						
							
							
								Daniel Strobusch 
							
						 
					 
					
						
						
							
						
						1d09c4fecd 
					 
					
						
						
							
							keep dtype on model conversion ( #186 )  
						
						
						
						
					 
					
						2024-01-02 11:20:29 -08:00 
						 
				 
			
				
					
						
							
							
								Daniel Strobusch 
							
						 
					 
					
						
						
							
						
						85258b2be7 
					 
					
						
						
							
							make parameter naming consistent with other examples. ( #214 )  
						
						
						
						
					 
					
						2024-01-02 08:18:12 -08:00 
						 
				 
			
				
					
						
							
							
								Anchen 
							
						 
					 
					
						
						
							
						
						e632d7aaaa 
					 
					
						
						
							
							fix: deepseek coder tokenizer error ( #211 )  
						
						
						
						
					 
					
						2024-01-01 06:10:37 -08:00 
						 
				 
			
				
					
						
							
							
								Anchen 
							
						 
					 
					
						
						
							
						
						ee3c44d231 
					 
					
						
						
							
							chore: make the Deepseek example compatible with Yi models. ( #205 )  
						
						... 
						
						
						
						* Update convert.py
* Update convert.py
* Update deepseek_coder.py 
						
						
					 
					
						2023-12-30 06:11:33 -08:00 
						 
				 
			
				
					
						
							
							
								bofeng huang 
							
						 
					 
					
						
						
							
						
						581a5733a1 
					 
					
						
						
							
							[Whisper] Load customized MLX model & Quantization ( #191 )  
						
						... 
						
						
						
						* Add option to load customized mlx model
* Add quantization
* Apply reviews
* Separate model conversion and loading
* Update test
* Fix benchmark
* Add notes about conversion
* Improve doc 
						
						
					 
					
						2023-12-29 10:22:15 -08:00 
						 
				 
			
				
					
						
							
							
								Anchen 
							
						 
					 
					
						
						
							
						
						1cdbf9e886 
					 
					
						
						
							
							chore: fix the load quantization model for deepseek coder ( #203 )  
						
						... 
						
						
						
						* chore: fix the load quantization model
* change to explicitly check for quantization config 
						
						
					 
					
						2023-12-29 05:25:38 -08:00 
						 
				 
			
				
					
						
							
							
								Anchen 
							
						 
					 
					
						
						
							
						
						31ddbd7806 
					 
					
						
						
							
							add deepseek coder example ( #172 )  
						
						... 
						
						
						
						* feat: add example for deepseek coder
* chore: remove hardcoded rope_scaling_factor
* feat: add quantization support
* chore: update readme
* chore: clean up the rope scalling factor param in create cos sin theta
* feat: add repetition_penalty
* style /consistency changes to ease future integration
* nits in README
* one more typo
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2023-12-28 21:42:22 -08:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						37fd2464dc 
					 
					
						
						
							
							Add an image2image example in the stable diffusion ( #198 )  
						
						
						
						
					 
					
						2023-12-28 18:31:45 -08:00 
						 
				 
			
				
					
						
							
							
								Benjamin Anderson 
							
						 
					 
					
						
						
							
						
						09566c7257 
					 
					
						
						
							
							add speculative decoding example for llama ( #149 )  
						
						... 
						
						
						
						* speculative decoding
* add sample 0
* spec decode gives same results as regular decode
* rebase
* use accept reject criteria
* switch to t5
* update readme
* readme nit
* nits
* nits
* nits
---------
Co-authored-by: Benjamin Anderson <benjamin@Benjamins-MBP.lan >
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2023-12-28 15:20:43 -08:00 
						 
				 
			
				
					
						
							
							
								Dimo 
							
						 
					 
					
						
						
							
						
						07c163d9d9 
					 
					
						
						
							
							[Whisper] Large-v3 requires 128 Mel frequency bins ( #193 )  
						
						... 
						
						
						
						* Large-v3 requires 128 Mel frequency bins
* extract correct model dimensions and use argparse
* format
* format
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2023-12-28 13:50:35 -08:00 
						 
				 
			
				
					
						
							
							
								bofeng huang 
							
						 
					 
					
						
						
							
						
						e1e56a625b 
					 
					
						
						
							
							Fix benchmark ( #200 )  
						
						
						
						
					 
					
						2023-12-28 11:29:39 -08:00 
						 
				 
			
				
					
						
							
							
								Sunbir Gill 
							
						 
					 
					
						
						
							
						
						78d207fe27 
					 
					
						
						
							
							Fix generate example in README ( #197 )  
						
						
						
						
					 
					
						2023-12-27 13:11:10 -08:00 
						 
				 
			
				
					
						
							
							
								Jiří Moravčík 
							
						 
					 
					
						
						
							
						
						50fceb1a28 
					 
					
						
						
							
							fix: Add numpy to CIFAR's requirements.txt ( #192 )  
						
						
						
						
					 
					
						2023-12-26 15:18:59 -08:00 
						 
				 
			
				
					
						
							
							
								Sushant 
							
						 
					 
					
						
						
							
						
						a516f4635d 
					 
					
						
						
							
							Fixed the return type for the __call__ method in Attention ( #190 )  
						
						
						
						
					 
					
						2023-12-26 09:32:43 -08:00 
						 
				 
			
				
					
						
							
							
								Daniel Strobusch 
							
						 
					 
					
						
						
							
						
						2bd20ef0e0 
					 
					
						
						
							
							shard llama model after conversion and unshard on loading ( #174 )  
						
						
						
						
					 
					
						2023-12-25 11:19:43 -08:00 
						 
				 
			
				
					
						
							
							
								Yifan 
							
						 
					 
					
						
						
							
						
						738448c2d4 
					 
					
						
						
							
							QWEN: Fix unsupported ScalarType BFloat16 ( #187 )  
						
						... 
						
						
						
						Fix unsupported ScalarType BFloat16. 
						
						
					 
					
						2023-12-25 06:10:01 -08:00 
						 
				 
			
				
					
						
							
							
								Vidyasagar Bhargava 
							
						 
					 
					
						
						
							
						
						647e48870a 
					 
					
						
						
							
							updated README ( #184 )  
						
						
						
						
					 
					
						2023-12-24 06:19:53 -08:00 
						 
				 
			
				
					
						
							
							
								devonthomas35 
							
						 
					 
					
						
						
							
						
						939086e6a3 
					 
					
						
						
							
							Mixtral: Stop at EOS token ( #183 )  
						
						... 
						
						
						
						* Stop at EOS token
* Precommit format files
* Fix precommit hooks
* Fix precommit hooks 
						
						
					 
					
						2023-12-23 21:25:42 -08:00 
						 
				 
			
				
					
						
							
							
								Kashif Rasul 
							
						 
					 
					
						
						
							
						
						0371d90ccb 
					 
					
						
						
							
							fashion-mnist example ( #180 )  
						
						... 
						
						
						
						* fashion mnist example
* fix from review 
						
						
					 
					
						2023-12-23 07:34:45 -08:00 
						 
				 
			
				
					
						
							
							
								Daniel Strobusch 
							
						 
					 
					
						
						
							
						
						848f118ac5 
					 
					
						
						
							
							use non-zero exit code on error ( #177 )  
						
						
						
						
					 
					
						2023-12-23 07:10:13 -08:00 
						 
				 
			
				
					
						
							
							
								Daniel Strobusch 
							
						 
					 
					
						
						
							
						
						092e87211e 
					 
					
						
						
							
							fix bad convert parameter ( #178 )  
						
						
						
						
					 
					
						2023-12-23 07:09:49 -08:00 
						 
				 
			
				
					
						
							
							
								Alvaro Bartolome 
							
						 
					 
					
						
						
							
						
						f4709cb807 
					 
					
						
						
							
							Align CLI args and some smaller fixes ( #167 )  
						
						... 
						
						
						
						* Add `.DS_Store` files to `.gitignore`
* Fix variable naming of `config` in `mixtral/convert.py`
* Align CLI args and minor fixes
* standardize
* one more
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2023-12-22 14:34:32 -08:00 
						 
				 
			
				
					
						
							
							
								Vaibhav Srivastav 
							
						 
					 
					
						
						
							
						
						0eaa323c10 
					 
					
						
						
							
							Fix conversion + inference errors. - Mistral ( #176 )  
						
						... 
						
						
						
						* Fix conversion + inference errors.
* wire rope_theta throuugh to nn.RoPE
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2023-12-22 14:10:25 -08:00 
						 
				 
			
				
					
						
							
							
								Todsaporn Banjerdkit 
							
						 
					 
					
						
						
							
						
						7ae445f6c7 
					 
					
						
						
							
							feat: add mistral tps ( #173 )  
						
						... 
						
						
						
						* feat: add mistral tps
* eval params before timing + format
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2023-12-22 07:55:57 -08:00 
						 
				 
			
				
					
						
							
							
								Daniel Strobusch 
							
						 
					 
					
						
						
							
						
						188a91074b 
					 
					
						
						
							
							fix typo ( #169 )  
						
						
						
						
					 
					
						2023-12-21 14:17:11 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						3cf436b529 
					 
					
						
						
							
							Quantize example ( #162 )  
						
						... 
						
						
						
						* testing quantization
* conversion + quantization working
* one config processor
* quantization in mistral / nits in llama
* args for quantization
* llama / mistral conversion in good shape
* phi2 quantized
* mixtral
* qwen conversion 
						
						
					 
					
						2023-12-21 12:59:37 -08:00 
						 
				 
			
				
					
						
							
							
								Juarez Bochi 
							
						 
					 
					
						
						
							
						
						4c9db80ed2 
					 
					
						
						
							
							Add support for byt5 models ( #161 )  
						
						... 
						
						
						
						* Add support for byt5 models
* Remove unused import 
						
						
					 
					
						2023-12-21 08:46:36 -08:00 
						 
				 
			
				
					
						
							
							
								Deven Mistry 
							
						 
					 
					
						
						
							
						
						6c574dbecf 
					 
					
						
						
							
							update path to load weights ( #164 )  
						
						
						
						
					 
					
						2023-12-21 06:31:17 -08:00 
						 
				 
			
				
					
						
							
							
								Sarthak Yadav 
							
						 
					 
					
						
						
							
						
						4addd02988 
					 
					
						
						
							
							updated results ( #165 )  
						
						
						
						
					 
					
						2023-12-21 06:30:17 -08:00 
						 
				 
			
				
					
						
							
							
								wyanzhao 
							
						 
					 
					
						
						
							
						
						22620de3ee 
					 
					
						
						
							
							1. Add user warning for sequences over 2048 tokens in iterate_batches. ( #166 )  
						
						
						
						
					 
					
						2023-12-21 06:29:31 -08:00 
						 
				 
			
				
					
						
							
							
								Daniel Strobusch 
							
						 
					 
					
						
						
							
						
						43b6522af2 
					 
					
						
						
							
							rename --model_path to --model-path ( #151 )  
						
						... 
						
						
						
						use same argument convention for mistral/mixtral as for llama convert. 
						
						
					 
					
						2023-12-21 06:28:57 -08:00 
						 
				 
			
				
					
						
							
							
								Deven Mistry 
							
						 
					 
					
						
						
							
						
						3efb1cc2cc 
					 
					
						
						
							
							fix typo in readme ( #163 )  
						
						
						
						
					 
					
						2023-12-20 19:47:41 -08:00 
						 
				 
			
				
					
						
							
							
								Pedro Cuenca 
							
						 
					 
					
						
						
							
						
						ce30cc3d8f 
					 
					
						
						
							
							Use config.json in llama ( #159 )  
						
						... 
						
						
						
						* Use config.json in llama
* Fix pop
* Fix convert
* Typo 
						
						
					 
					
						2023-12-20 10:34:44 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						27c0a8c002 
					 
					
						
						
							
							Add llms subdir + update README ( #145 )  
						
						... 
						
						
						
						* add llms subdir + update README
* nits
* use same pre-commit as mlx
* update readmes a bit
* format 
						
						
					 
					
						2023-12-20 10:22:25 -08:00 
						 
				 
			
				
					
						
							
							
								Vaibhav Srivastav 
							
						 
					 
					
						
						
							
						
						aed14618ca 
					 
					
						
						
							
							Add config.json to Mixtral. ( #158 )  
						
						... 
						
						
						
						* Add config.json to Mixtral.
* Update mixtral/mixtral.py
Co-authored-by: Pedro Cuenca <pedro@huggingface.co >
---------
Co-authored-by: Pedro Cuenca <pedro@huggingface.co > 
						
						
					 
					
						2023-12-20 09:47:23 -08:00 
						 
				 
			
				
					
						
							
							
								Pedro Cuenca 
							
						 
					 
					
						
						
							
						
						730c50d00a 
					 
					
						
						
							
							Use config.json, add model_type ( #157 )  
						
						... 
						
						
						
						* Use config.json, add model_type
* Update convert to generate config.json 
						
						
					 
					
						2023-12-20 08:39:37 -08:00 
						 
				 
			
				
					
						
							
							
								Vaibhav Srivastav 
							
						 
					 
					
						
						
							
						
						4b7e11bd31 
					 
					
						
						
							
							Add URLs to HF MLX-Community org. ( #153 )  
						
						... 
						
						
						
						* up
* Add ref to MLX org on the README.
* nit: language.
* Standardise org name. 
						
						
					 
					
						2023-12-20 06:57:13 -08:00 
						 
				 
			
				
					
						
							
							
								Pedro Cuenca 
							
						 
					 
					
						
						
							
						
						d8e14c858e 
					 
					
						
						
							
							Add --model_path to phi-2 example script ( #152 )  
						
						
						
						
					 
					
						2023-12-20 06:14:35 -08:00 
						 
				 
			
				
					
						
							
							
								Sarthak Yadav 
							
						 
					 
					
						
						
							
						
						b6e62caf2e 
					 
					
						
						
							
							Added Keyword Spotting Transformer + SpeechCommands example ( #123 )  
						
						... 
						
						
						
						* Added Keyword Transformer + SpeechCommands
* minor fixes in README
* some updates / simplifications
* nits
* fixed kwt skip connections
* readme + format
* updated acknowledgements
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2023-12-19 14:17:48 -08:00 
						 
				 
			
				
					
						
							
							
								Juarez Bochi 
							
						 
					 
					
						
						
							
						
						ebbb7083cc 
					 
					
						
						
							
							T5: Change default dtype to bfloat16 ( #147 )  
						
						... 
						
						
						
						* T5: Change default to bfloat16
* Add myself to contributors
* t5: Change convert.py default to float32 
						
						
					 
					
						2023-12-19 13:44:36 -08:00 
						 
				 
			
				
					
						
							
							
								Junyi Mei 
							
						 
					 
					
						
						
							
						
						62b455f801 
					 
					
						
						
							
							Add Qwen example ( #134 )  
						
						... 
						
						
						
						* Add qwen model draft
* Add readme and requirements for qwen example
* Add model and tokenizer options
* Fix convert and tokenizer
* some updates / style consistency
* move to llm subdir
* readme nit
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2023-12-19 13:06:19 -08:00