Kevin Wang 
							
						 
					 
					
						
						
							
						
						8a265f0d54 
					 
					
						
						
							
							Fix incorrect type annotation ( #720 )  
						
						... 
						
						
						
						A `Tuple` is missing in this type annotation. 
						
						
					 
					
						2024-04-24 15:52:43 -07:00 
						 
				 
			
				
					
						
							
							
								dmdaksh 
							
						 
					 
					
						
						
							
						
						7d7e236061 
					 
					
						
						
							
							- Removed unused Python imports ( #683 )  
						
						... 
						
						
						
						- bert/model.py:10: tree_unflatten
  - bert/model.py:2: dataclass
  - bert/model.py:8: numpy
  - cifar/resnet.py:6: Any
  - clip/model.py:15: tree_flatten
  - clip/model.py:9: Union
  - gcn/main.py:8: download_cora
  - gcn/main.py:9: cross_entropy
  - llms/gguf_llm/models.py:12: tree_flatten, tree_unflatten
  - llms/gguf_llm/models.py:9: numpy
  - llms/mixtral/mixtral.py:12: tree_map
  - llms/mlx_lm/models/dbrx.py:2: Dict, Union
  - llms/mlx_lm/tuner/trainer.py:5: partial
  - llms/speculative_decoding/decoder.py:1: dataclass, field
  - llms/speculative_decoding/decoder.py:2: Optional
  - llms/speculative_decoding/decoder.py:5: mlx.nn
  - llms/speculative_decoding/decoder.py:6: numpy
  - llms/speculative_decoding/main.py:2: glob
  - llms/speculative_decoding/main.py:3: json
  - llms/speculative_decoding/main.py:5: Path
  - llms/speculative_decoding/main.py:8: mlx.nn
  - llms/speculative_decoding/model.py:6: tree_unflatten
  - llms/speculative_decoding/model.py:7: AutoTokenizer
  - llms/tests/test_lora.py:13: yaml_loader
  - lora/lora.py:14: tree_unflatten
  - lora/models.py:11: numpy
  - lora/models.py:3: glob
  - speechcommands/kwt.py:1: Any
  - speechcommands/main.py:7: mlx.data
  - stable_diffusion/stable_diffusion/model_io.py:4: partial
  - whisper/benchmark.py:5: sys
  - whisper/test.py:5: subprocess
  - whisper/whisper/audio.py:6: Optional
  - whisper/whisper/decoding.py:8: mlx.nn 
						
						
					 
					
						2024-04-16 07:50:32 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						b8a348c1b8 
					 
					
						
						
							
							Switch to fast RMS/LN Norm ( #603 )  
						
						... 
						
						
						
						* use nn.RMSNorm, use sdpa, cleanup
* bump mlx versions
* minor update
* use fast layer norm
* version bump
* update requirement for whisper
* update requirement for gguf 
						
						
					 
					
						2024-03-23 07:13:51 -07:00 
						 
				 
			
				
					
						
							
							
								Nino Risteski 
							
						 
					 
					
						
						
							
						
						9742ad0f51 
					 
					
						
						
							
							Update README.md ( #248 )  
						
						... 
						
						
						
						fixed a few typos 
						
						
					 
					
						2024-01-07 20:13:58 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						a5d6d0436c 
					 
					
						
						
							
							Support Hugging Face models ( #215 )  
						
						... 
						
						
						
						* support hf direct models 
						
						
					 
					
						2024-01-03 15:13:26 -08:00 
						 
				 
			
				
					
						
							
							
								Benjamin Anderson 
							
						 
					 
					
						
						
							
						
						09566c7257 
					 
					
						
						
							
							add speculative decoding example for llama ( #149 )  
						
						... 
						
						
						
						* speculative decoding
* add sample 0
* spec decode gives same results as regular decode
* rebase
* use accept reject criteria
* switch to t5
* update readme
* readme nit
* nits
* nits
* nits
---------
Co-authored-by: Benjamin Anderson <benjamin@Benjamins-MBP.lan >
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2023-12-28 15:20:43 -08:00