dmdaksh 
							
						 
					 
					
						
						
							
						
						7d7e236061 
					 
					
						
						
							
							- Removed unused Python imports ( #683 )  
						
						... 
						
						
						
						- bert/model.py:10: tree_unflatten
  - bert/model.py:2: dataclass
  - bert/model.py:8: numpy
  - cifar/resnet.py:6: Any
  - clip/model.py:15: tree_flatten
  - clip/model.py:9: Union
  - gcn/main.py:8: download_cora
  - gcn/main.py:9: cross_entropy
  - llms/gguf_llm/models.py:12: tree_flatten, tree_unflatten
  - llms/gguf_llm/models.py:9: numpy
  - llms/mixtral/mixtral.py:12: tree_map
  - llms/mlx_lm/models/dbrx.py:2: Dict, Union
  - llms/mlx_lm/tuner/trainer.py:5: partial
  - llms/speculative_decoding/decoder.py:1: dataclass, field
  - llms/speculative_decoding/decoder.py:2: Optional
  - llms/speculative_decoding/decoder.py:5: mlx.nn
  - llms/speculative_decoding/decoder.py:6: numpy
  - llms/speculative_decoding/main.py:2: glob
  - llms/speculative_decoding/main.py:3: json
  - llms/speculative_decoding/main.py:5: Path
  - llms/speculative_decoding/main.py:8: mlx.nn
  - llms/speculative_decoding/model.py:6: tree_unflatten
  - llms/speculative_decoding/model.py:7: AutoTokenizer
  - llms/tests/test_lora.py:13: yaml_loader
  - lora/lora.py:14: tree_unflatten
  - lora/models.py:11: numpy
  - lora/models.py:3: glob
  - speechcommands/kwt.py:1: Any
  - speechcommands/main.py:7: mlx.data
  - stable_diffusion/stable_diffusion/model_io.py:4: partial
  - whisper/benchmark.py:5: sys
  - whisper/test.py:5: subprocess
  - whisper/whisper/audio.py:6: Optional
  - whisper/whisper/decoding.py:8: mlx.nn 
						
						
					 
					
						2024-04-16 07:50:32 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						b8a348c1b8 
					 
					
						
						
							
							Switch to fast RMS/LN Norm ( #603 )  
						
						... 
						
						
						
						* use nn.RMSNorm, use sdpa, cleanup
* bump mlx versions
* minor update
* use fast layer norm
* version bump
* update requirement for whisper
* update requirement for gguf 
						
						
					 
					
						2024-03-23 07:13:51 -07:00 
						 
				 
			
				
					
						
							
							
								Sergey Shumov 
							
						 
					 
					
						
						
							
						
						15ecf692b9 
					 
					
						
						
							
							Bug fix in lora.py ( #468 )  
						
						
						
						
					 
					
						2024-02-20 12:53:30 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						8fd953ee2b 
					 
					
						
						
							
							Support for slerp merging models ( #455 )  
						
						... 
						
						
						
						* support for slerp merging models
* docs
* update docs
* format' 
						
						
					 
					
						2024-02-19 20:37:15 -08:00 
						 
				 
			
				
					
						
							
							
								Ovid 
							
						 
					 
					
						
						
							
						
						8c9148a8fd 
					 
					
						
						
							
							Make it easier to know in which file we have bad JSON data ( #458 )  
						
						... 
						
						
						
						* Make it easier to know which file we have bad JSON data in.
* Use a loop rather than repeat code sections.
I previously had these as separate cut-n-drooled sections of code. This change makes it a clean loop.
Co-authored-by: Awni Hannun <awni.hannun@gmail.com >
* Small fix to previous code suggestion to restore a missing variable.
---------
Co-authored-by: Awni Hannun <awni.hannun@gmail.com > 
						
						
					 
					
						2024-02-19 20:11:45 -08:00 
						 
				 
			
				
					
						
							
							
								ZHAOKAI WANG 
							
						 
					 
					
						
						
							
						
						0340113e02 
					 
					
						
						
							
							BUG FIX: Decoding results in garbled text when multiple tokens represent a single character (e.g., Chinese). ( #398 )  
						
						... 
						
						
						
						* Decoding results in garbled text when multiple tokens represent a single character (e.g., Chinese).
* Decoding results in garbled text when multiple tokens represent a single character (e.g., Chinese). 
						
						
					 
					
						2024-01-31 19:27:29 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						5aa652d3c2 
					 
					
						
						
							
							remove simplify ( #379 )  
						
						
						
						
					 
					
						2024-01-26 13:54:49 -08:00 
						 
				 
			
				
					
						
							
							
								Anchen 
							
						 
					 
					
						
						
							
						
						8022083979 
					 
					
						
						
							
							feat(lora): add de-quantized support for fuse.py ( #351 )  
						
						... 
						
						
						
						* feat(lora): add de-quantized support for fuse.py
* address comments 
						
						
					 
					
						2024-01-22 17:32:24 -08:00 
						 
				 
			
				
					
						
							
							
								Anchen 
							
						 
					 
					
						
						
							
						
						1415595409 
					 
					
						
						
							
							chore(lora): support mixtral in lora example ( #343 )  
						
						
						
						
					 
					
						2024-01-20 06:07:45 -08:00 
						 
				 
			
				
					
						
							
							
								Zheng Qu 
							
						 
					 
					
						
						
							
						
						d8680a89f9 
					 
					
						
						
							
							Add argument --save-every N to lora.py for saving model regularly ( #310 )  
						
						
						
						
					 
					
						2024-01-16 20:03:33 -08:00 
						 
				 
			
				
					
						
							
							
								Marcel Bischoff 
							
						 
					 
					
						
						
							
						
						cd3cff0858 
					 
					
						
						
							
							Phixtral ( #290 )  
						
						... 
						
						
						
						* initial
* file
* remove debug
* Adding README
* typo
* simplify readme
* nits in readmes
---------
Co-authored-by: Marcel Bischoff <marcel.bischoff@awarehq.com >
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-13 08:35:03 -08:00 
						 
				 
			
				
					
						
							
							
								Yousif 
							
						 
					 
					
						
						
							
						
						7575125d5d 
					 
					
						
						
							
							Added lora support for Phi-2 ( #302 )  
						
						... 
						
						
						
						* Added lora support for Phi-2
* Added Phi-2 support in fuse and convert
* format + readme
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-12 13:45:30 -08:00 
						 
				 
			
				
					
						
							
							
								Anjor Kanekar 
							
						 
					 
					
						
						
							
						
						e74889d0fa 
					 
					
						
						
							
							prompt parameter ( #291 )  
						
						
						
						
					 
					
						2024-01-11 06:04:57 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						80d18671ad 
					 
					
						
						
							
							[Lora] Fix generate ( #282 )  
						
						... 
						
						
						
						* fix generate
* update readme, fix test, better default
* nits
* typo 
						
						
					 
					
						2024-01-10 16:13:06 -08:00 
						 
				 
			
				
					
						
							
							
								Vaibhav Srivastav 
							
						 
					 
					
						
						
							
						
						44f86092ea 
					 
					
						
						
							
							Fix Tokenizer save error. ( #278 )  
						
						
						
						
					 
					
						2024-01-10 05:49:32 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						841c8f7b30 
					 
					
						
						
							
							fix max tokens ( #275 )  
						
						
						
						
					 
					
						2024-01-09 21:41:12 -08:00 
						 
				 
			
				
					
						
							
							
								Anchen 
							
						 
					 
					
						
						
							
						
						7cfda327fd 
					 
					
						
						
							
							fix(lora): tokenizer return incompatible mx array ( #271 )  
						
						... 
						
						
						
						* fix(lora): tokenizer return incompatible encodeing mx array
* add readme nit
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-09 19:46:38 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						7b258f33ac 
					 
					
						
						
							
							Move lora example to use the same model format / conversion as hf_llm ( #252 )  
						
						... 
						
						
						
						* huffing face the lora example to allow more models
* fixes
* comments
* more readme nits
* fusion + works better for qlora
* nits'
* comments 
						
						
					 
					
						2024-01-09 11:14:52 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						485fb9ac0f 
					 
					
						
						
							
							quantize linear ( #250 )  
						
						
						
						
					 
					
						2024-01-07 18:48:59 -08:00 
						 
				 
			
				
					
						
							
							
								mc0ps 
							
						 
					 
					
						
						
							
						
						25ebd36112 
					 
					
						
						
							
							Fix typo in lora convert.py ( #245 )  
						
						
						
						
					 
					
						2024-01-07 03:30:30 -08:00 
						 
				 
			
				
					
						
							
							
								Lawrence Wu 
							
						 
					 
					
						
						
							
						
						37856f70a8 
					 
					
						
						
							
							add numpy as a requirement to run lora.py ( #238 )  
						
						... 
						
						
						
						* add numpy as a requirement to run lora.py
* removed unused imports 
						
						
					 
					
						2024-01-05 16:16:28 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						37b41cec60 
					 
					
						
						
							
							Qlora ( #219 )  
						
						... 
						
						
						
						qlora 
						
						
					 
					
						2024-01-04 21:05:59 -08:00 
						 
				 
			
				
					
						
							
							
								Todsaporn Banjerdkit 
							
						 
					 
					
						
						
							
						
						7ae445f6c7 
					 
					
						
						
							
							feat: add mistral tps ( #173 )  
						
						... 
						
						
						
						* feat: add mistral tps
* eval params before timing + format
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2023-12-22 07:55:57 -08:00 
						 
				 
			
				
					
						
							
							
								Daniel Strobusch 
							
						 
					 
					
						
						
							
						
						188a91074b 
					 
					
						
						
							
							fix typo ( #169 )  
						
						
						
						
					 
					
						2023-12-21 14:17:11 -08:00 
						 
				 
			
				
					
						
							
							
								wyanzhao 
							
						 
					 
					
						
						
							
						
						22620de3ee 
					 
					
						
						
							
							1. Add user warning for sequences over 2048 tokens in iterate_batches. ( #166 )  
						
						
						
						
					 
					
						2023-12-21 06:29:31 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						27c0a8c002 
					 
					
						
						
							
							Add llms subdir + update README ( #145 )  
						
						... 
						
						
						
						* add llms subdir + update README
* nits
* use same pre-commit as mlx
* update readmes a bit
* format 
						
						
					 
					
						2023-12-20 10:22:25 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						1e7f4a5921 
					 
					
						
						
							
							fix use for llama 2 from meta ( #144 )  
						
						
						
						
					 
					
						2023-12-18 19:33:17 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						ff0f172363 
					 
					
						
						
							
							32 GB example  
						
						
						
						
					 
					
						2023-12-15 12:20:15 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						ee2ee0f8e5 
					 
					
						
						
							
							32 GB example  
						
						
						
						
					 
					
						2023-12-15 12:18:29 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						8c8f9d6440 
					 
					
						
						
							
							keep base weights in fp16  
						
						
						
						
					 
					
						2023-12-15 10:42:18 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						84f02ef58b 
					 
					
						
						
							
							use lower precision base weights  
						
						
						
						
					 
					
						2023-12-15 10:29:42 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						d108c558fc 
					 
					
						
						
							
							more nits  
						
						
						
						
					 
					
						2023-12-15 10:06:14 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						fa51553f09 
					 
					
						
						
							
							fix readme  
						
						
						
						
					 
					
						2023-12-15 09:59:07 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						985f413f99 
					 
					
						
						
							
							custom data with lora  
						
						
						
						
					 
					
						2023-12-15 09:56:10 -08:00 
						 
				 
			
				
					
						
							
							
								Daniel Strobusch 
							
						 
					 
					
						
						
							
						
						5515c2a75b 
					 
					
						
						
							
							fix "request access" form url for Llama models  
						
						
						
						
					 
					
						2023-12-13 10:19:29 +01:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						a4d932bf26 
					 
					
						
						
							
							fix conversion  
						
						
						
						
					 
					
						2023-12-10 16:56:41 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						036090f508 
					 
					
						
						
							
							few more nits  
						
						
						
						
					 
					
						2023-12-09 14:20:19 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						98f4346c81 
					 
					
						
						
							
							black format  
						
						
						
						
					 
					
						2023-12-09 14:15:25 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						b8332a1e66 
					 
					
						
						
							
							generalize lora finetuning for llama and mistral  
						
						
						
						
					 
					
						2023-12-09 14:13:55 -08:00 
						 
				 
			
				
					
						
							
							
								张嘉豪 
							
						 
					 
					
						
						
							
						
						4018aed335 
					 
					
						
						
							
							fix: Unsupported BFloat16 Data Type Issue with MPS Backend  
						
						
						
						
					 
					
						2023-12-08 16:19:35 +08:00 
						 
				 
			
				
					
						
							
							
								waterstone 
							
						 
					 
					
						
						
							
						
						ec97c7531b 
					 
					
						
						
							
							Update README.md  
						
						
						
						
					 
					
						2023-12-07 16:44:29 +08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						31bc57c4ff 
					 
					
						
						
							
							add copyright in source  
						
						
						
						
					 
					
						2023-11-30 11:08:53 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						5d6353aab7 
					 
					
						
						
							
							lora  
						
						
						
						
					 
					
						2023-11-29 14:14:11 -08:00