Anchen 
							
						 
					 
					
						
						
							
						
						527cea4027 
					 
					
						
						
							
							chore: fix the convert.py script for weights are not sanitized and support quant for non-32 dimensions ( #340 )  
						
						 
						
						... 
						
						
						
						* chore: fix convert script for weights not sanitized and suport quant for non 32 dim
* Update llms/mlx_lm/utils.py
Co-authored-by: Awni Hannun <awni.hannun@gmail.com >
* chore: fix typo
---------
Co-authored-by: Awni Hannun <awni.hannun@gmail.com > 
						
						
					 
					
						2024-01-19 21:07:21 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								bojanbabic 
							
						 
					 
					
						
						
							
						
						61297f547b 
					 
					
						
						
							
							Missing requirements needed for convert script ( #320 )  
						
						 
						
						... 
						
						
						
						* fix requirements and add eos parameter
* fix black
* address comment
* address comments - remove new arg 
						
						
					 
					
						2024-01-18 19:04:24 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						bcc9fc3581 
					 
					
						
						
							
							two minor fixes ( #335 )  
						
						 
						
						
						
						
					 
					
						2024-01-18 14:18:13 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Zheng Qu 
							
						 
					 
					
						
						
							
						
						d8680a89f9 
					 
					
						
						
							
							Add argument --save-every N to lora.py for saving model regularly ( #310 )  
						
						 
						
						
						
						
					 
					
						2024-01-16 20:03:33 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								LeonEricsson 
							
						 
					 
					
						
						
							
						
						b4c20cc7f7 
					 
					
						
						
							
							Stable Diffusion: Input image downsampling ( #276 )  
						
						 
						
						
						
						
					 
					
						2024-01-16 13:45:00 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								AtomicVar 
							
						 
					 
					
						
						
							
						
						2ba5d3db14 
					 
					
						
						
							
							Refactor activation function and loss calculation ( #325 )  
						
						 
						
						
						
						
					 
					
						2024-01-16 13:42:56 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								AtomicVar 
							
						 
					 
					
						
						
							
						
						ce7b65e8c4 
					 
					
						
						
							
							Fix import order of normalizing_flow ( #326 )  
						
						 
						
						
						
						
					 
					
						2024-01-16 08:45:55 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								someone 
							
						 
					 
					
						
						
							
						
						2287294723 
					 
					
						
						
							
							fix mlx_lm generator for chinese ( #321 )  
						
						 
						
						... 
						
						
						
						* fix generator for chinese
* add REPLACEMENT_CHAR
---------
Co-authored-by: cg <cg@qq.com > 
						
						
					 
					
						2024-01-16 07:13:33 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						b0870ed679 
					 
					
						
						
							
							fix response + bump version ( #319 )  
						
						 
						
						
						
						
					 
					
						2024-01-15 11:51:21 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Anchen 
							
						 
					 
					
						
						
							
						
						195bec2fa3 
					 
					
						
						
							
							feat(mlx_lm): add mixtral support in mlx_lm ( #318 )  
						
						 
						
						... 
						
						
						
						* feat: add mixtral support in mlx_lm
* chore: update doc 
						
						
					 
					
						2024-01-15 07:18:14 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Siddharth Mishra-Sharma 
							
						 
					 
					
						
						
							
						
						19b6167d81 
					 
					
						
						
							
							Normalizing flow example ( #133 )  
						
						 
						
						... 
						
						
						
						* Implement normalizing flow Real NVP example
* Add requirements and basic usage to normalizing flow example
* Minor changes to README in normalizing flow example
* Remove trailing commas in function arguments for unified formatting in flows example
* Fix minor typos, add some annotations
* format + nits in README
* readme fix
* mov, minor changes in main, copywright
* remove debug
* fix
* Simplified class structure in distributions; better code re-use in bijectors
* Remove rogue space
* change name again
* nits
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-13 16:58:48 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Marcel Bischoff 
							
						 
					 
					
						
						
							
						
						cd3cff0858 
					 
					
						
						
							
							Phixtral ( #290 )  
						
						 
						
						... 
						
						
						
						* initial
* file
* remove debug
* Adding README
* typo
* simplify readme
* nits in readmes
---------
Co-authored-by: Marcel Bischoff <marcel.bischoff@awarehq.com >
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-13 08:35:03 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Anchen 
							
						 
					 
					
						
						
							
						
						a39b735c3b 
					 
					
						
						
							
							chore(mlx-lm): update phi2 model args to sync with hf config format. ( #311 )  
						
						 
						
						... 
						
						
						
						* chore(mlx-lm): update phi2 model args to sync with hf config format
* chore: fix type hint 
						
						
					 
					
						2024-01-13 07:51:45 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Yousif 
							
						 
					 
					
						
						
							
						
						7575125d5d 
					 
					
						
						
							
							Added lora support for Phi-2 ( #302 )  
						
						 
						
						... 
						
						
						
						* Added lora support for Phi-2
* Added Phi-2 support in fuse and convert
* format + readme
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-12 13:45:30 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Alexandre Boucaud 
							
						 
					 
					
						
						
							
						
						3ac731dd4f 
					 
					
						
						
							
							Fix TypeError in whisper benchmark script ( #306 )  
						
						 
						
						... 
						
						
						
						* Add missing keyword to the decoding options
* Reverting last commit
* Fixing transcribe keyword in benckmark.py
* Add argument name to load_model
This is intended to avoid confusion 
						
						
					 
					
						2024-01-12 13:07:15 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Pedro Cuenca 
							
						 
					 
					
						
						
							
						
						ef93979973 
					 
					
						
						
							
							Update model card uploaded with converted models ( #309 )  
						
						 
						
						
						
						
					 
					
						2024-01-12 13:03:52 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						1fa40067fe 
					 
					
						
						
							
							Change tuple type definitions to use Tuple ( #308 )  
						
						 
						
						
						
						
					 
					
						2024-01-12 11:15:09 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						c1342b8e89 
					 
					
						
						
							
							Use pip for mlx data with speech commands ( #307 )  
						
						 
						
						... 
						
						
						
						* update to use pypi mlx data
* nit in readme 
						
						
					 
					
						2024-01-12 11:06:33 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						c6440416a2 
					 
					
						
						
							
							Mlx llm package ( #301 )  
						
						 
						
						... 
						
						
						
						* fix converter
* add recursive files
* remove gitignore
* remove gitignore
* add packages properly
* read me update
* remove dup readme
* relative
* fix convert
* fix community name
* fix url
* version 
						
						
					 
					
						2024-01-12 10:25:56 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Markus Enzweiler 
							
						 
					 
					
						
						
							
						
						2b61d9deb6 
					 
					
						
						
							
							Updated CIFAR-10 ResNet example to use BatchNorm instead of LayerNorm ( #257 )  
						
						 
						
						... 
						
						
						
						* replaced nn.LayerNorm by nn.BatchNorm
* mlx>=0.0.8 required
* updated default to 30 epochs instead of 100
* updated README after adding BatchNorm
* requires mlx>=0.0.9
* updated README.md with results for mlx-0.0.9 
						
						
					 
					
						2024-01-12 05:43:11 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Anchen 
							
						 
					 
					
						
						
							
						
						6217d7acd0 
					 
					
						
						
							
							Delete llms/hf_llm/models/.gitignore ( #300 )  
						
						 
						
						
						
						
					 
					
						2024-01-11 16:56:50 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Anchen 
							
						 
					 
					
						
						
							
						
						a2402116ae 
					 
					
						
						
							
							refactor(hf_llm): moving phi2 example into hf_llm ( #293 )  
						
						 
						
						... 
						
						
						
						* refactor: moving phi2 example into hf_llm
* chore: clean up
* chore: update phi2 model args so it can load args from config
* fix phi2 + nits + readme
* allow any HF repo, update README
* fix bug in llama
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-11 12:29:12 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Anjor Kanekar 
							
						 
					 
					
						
						
							
						
						e74889d0fa 
					 
					
						
						
							
							prompt parameter ( #291 )  
						
						 
						
						
						
						
					 
					
						2024-01-11 06:04:57 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Anchen 
							
						 
					 
					
						
						
							
						
						7380ebfb0d 
					 
					
						
						
							
							fix: undefined hf_path ( #292 )  
						
						 
						
						
						
						
					 
					
						2024-01-11 05:53:52 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Konstantin Kerekovski 
							
						 
					 
					
						
						
							
						
						047d4650c4 
					 
					
						
						
							
							Add -local flag to llms/hf_llm/convert.py for reading source HF models from filesystem. ( #260 )  
						
						 
						
						... 
						
						
						
						* * Add --local flag for reading models from filesystem and related code for doing so
* Disable uploading to huggingface if --local flag is set
* Remove code related to .bin files and merge fetch_from_local and fetch_from_hub into one function.
* Update llms/hf_llm/convert.py
Co-authored-by: Awni Hannun <awni.hannun@gmail.com >
* format / nits
---------
Co-authored-by: Awni Hannun <awni.hannun@gmail.com >
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-10 19:53:01 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						80d18671ad 
					 
					
						
						
							
							[Lora] Fix generate ( #282 )  
						
						 
						
						... 
						
						
						
						* fix generate
* update readme, fix test, better default
* nits
* typo 
						
						
					 
					
						2024-01-10 16:13:06 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Rishi Narang 
							
						 
					 
					
						
						
							
						
						a2bc8426f2 
					 
					
						
						
							
							Update txt2image.py ( #285 )  
						
						 
						
						... 
						
						
						
						added np alias 
						
						
					 
					
						2024-01-10 09:31:59 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Alwin Arrasyid 
							
						 
					 
					
						
						
							
						
						2bbe9d3bd8 
					 
					
						
						
							
							fix use of args in generate function ( #284 )  
						
						 
						
						
						
						
					 
					
						2024-01-10 08:09:21 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Vaibhav Srivastav 
							
						 
					 
					
						
						
							
						
						44f86092ea 
					 
					
						
						
							
							Fix Tokenizer save error. ( #278 )  
						
						 
						
						
						
						
					 
					
						2024-01-10 05:49:32 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						841c8f7b30 
					 
					
						
						
							
							fix max tokens ( #275 )  
						
						 
						
						
						
						
					 
					
						2024-01-09 21:41:12 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Anchen 
							
						 
					 
					
						
						
							
						
						7cfda327fd 
					 
					
						
						
							
							fix(lora): tokenizer return incompatible mx array ( #271 )  
						
						 
						
						... 
						
						
						
						* fix(lora): tokenizer return incompatible encodeing mx array
* add readme nit
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-09 19:46:38 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						7b258f33ac 
					 
					
						
						
							
							Move lora example to use the same model format / conversion as hf_llm ( #252 )  
						
						 
						
						... 
						
						
						
						* huffing face the lora example to allow more models
* fixes
* comments
* more readme nits
* fusion + works better for qlora
* nits'
* comments 
						
						
					 
					
						2024-01-09 11:14:52 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						bbd7172eef 
					 
					
						
						
							
							Some fixes / cleanup for BERT example ( #269 )  
						
						 
						
						... 
						
						
						
						* some fixes/cleaning for bert + test
* nit 
						
						
					 
					
						2024-01-09 08:44:51 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						6759dfddf1 
					 
					
						
						
							
							Fix SD image conversion ( #266 )  
						
						 
						
						
						
						
					 
					
						2024-01-09 08:41:31 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Alwin Arrasyid 
							
						 
					 
					
						
						
							
						
						6e6eff326e 
					 
					
						
						
							
							fix: use of undefined args in generate function in phi-2 example ( #265 )  
						
						 
						
						
						
						
					 
					
						2024-01-09 06:43:59 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Vaibhav Srivastav 
							
						 
					 
					
						
						
							
						
						bb35e878cb 
					 
					
						
						
							
							[Whisper] Add load from Hub. ( #255 )  
						
						 
						
						... 
						
						
						
						* Add load from Hub.
* Up. 
						
						
					 
					
						2024-01-08 06:20:00 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Vaibhav Srivastav 
							
						 
					 
					
						
						
							
						
						d4c3a9cb54 
					 
					
						
						
							
							[Whisper] Add HF Hub upload option. ( #254 )  
						
						 
						
						... 
						
						
						
						* Add HF Hub upload option.
* up.
* Add missing requirements. 
						
						
					 
					
						2024-01-08 06:18:24 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Anchen 
							
						 
					 
					
						
						
							
						
						6e5b0de4d3 
					 
					
						
						
							
							refactor: make the phi2 example can be directly load the model from hf without convert needed ( #253 )  
						
						 
						
						... 
						
						
						
						* refactor: make the phi2 example can be directly load the model from hf without convert needed
* chore: add super().__init__() for all module, otherwise will cause error in lora 
						
						
					 
					
						2024-01-08 06:01:23 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Nino Risteski 
							
						 
					 
					
						
						
							
						
						9742ad0f51 
					 
					
						
						
							
							Update README.md ( #248 )  
						
						 
						
						... 
						
						
						
						fixed a few typos 
						
						
					 
					
						2024-01-07 20:13:58 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						485fb9ac0f 
					 
					
						
						
							
							quantize linear ( #250 )  
						
						 
						
						
						
						
					 
					
						2024-01-07 18:48:59 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Ikko Eltociear Ashimine 
							
						 
					 
					
						
						
							
						
						737b4c81a3 
					 
					
						
						
							
							Update README.md ( #251 )  
						
						 
						
						... 
						
						
						
						minor fix 
						
						
					 
					
						2024-01-07 11:35:39 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								bofeng huang 
							
						 
					 
					
						
						
							
						
						bf9926489e 
					 
					
						
						
							
							[Whisper] Add word timestamps and confidence scores ( #201 )  
						
						 
						
						... 
						
						
						
						* Add word timestamps and confidence scores
* Create a separate forward_with_cross_qk function
* Move multiple ops from np to mlx, clean comments
* Save alignment_heads
* Cast qk to fp32
* Add test for word-level timestamps and confidence scores
* format + readme
* nit
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-07 10:01:29 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								mc0ps 
							
						 
					 
					
						
						
							
						
						25ebd36112 
					 
					
						
						
							
							Fix typo in lora convert.py ( #245 )  
						
						 
						
						
						
						
					 
					
						2024-01-07 03:30:30 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Nino Risteski 
							
						 
					 
					
						
						
							
						
						b152d12d7b 
					 
					
						
						
							
							Update README.md ( #243 )  
						
						 
						
						... 
						
						
						
						a few typos 
						
						
					 
					
						2024-01-06 11:44:49 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Anchen 
							
						 
					 
					
						
						
							
						
						758f05c09a 
					 
					
						
						
							
							refactor: merge deepseek coder example into hf_llm example ( #234 )  
						
						 
						
						... 
						
						
						
						* refactor: merge deepseek coder example into hf_llm example
* remove deepseek example
* chore: fix format in readme
* chore: remove default rope_scaling dict and use get to access type and factor to avoid key error
* Update llms/hf_llm/models.py
Co-authored-by: Awni Hannun <awni.hannun@gmail.com >
* chore: fix lint
---------
Co-authored-by: Awni Hannun <awni.hannun@gmail.com > 
						
						
					 
					
						2024-01-06 07:53:46 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						cf0ad26a89 
					 
					
						
						
							
							force fp16 for quantized models ( #240 )  
						
						 
						
						
						
						
					 
					
						2024-01-05 21:29:15 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Lawrence Wu 
							
						 
					 
					
						
						
							
						
						37856f70a8 
					 
					
						
						
							
							add numpy as a requirement to run lora.py ( #238 )  
						
						 
						
						... 
						
						
						
						* add numpy as a requirement to run lora.py
* removed unused imports 
						
						
					 
					
						2024-01-05 16:16:28 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						37b41cec60 
					 
					
						
						
							
							Qlora ( #219 )  
						
						 
						
						... 
						
						
						
						qlora 
						
						
					 
					
						2024-01-04 21:05:59 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Christian Bieniak 
							
						 
					 
					
						
						
							
						
						4fa659acbd 
					 
					
						
						
							
							Handle receiving 0 tokens gracefully ( #231 )  
						
						 
						
						... 
						
						
						
						* handle 0 tokens gracefully
* Formatting
* Move no token check to statistics section 
						
						
					 
					
						2024-01-04 19:14:13 -08:00  
					
					
						 
						
						
							
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Andy Peatling 
							
						 
					 
					
						
						
							
						
						12c9bafbf5 
					 
					
						
						
							
							Update README.md to fix --hf-model param call. ( #229 )  
						
						 
						
						... 
						
						
						
						Update `--hf-model` to `--hf-path` since the `--hf-model` param does not exist in convert.py. 
						
						
					 
					
						2024-01-04 11:53:51 -08:00