Rifur13 
							
						 
					 
					
						
						
							
						
						2463496471 
					 
					
						
						
							
							[Fix] mx.allclose bug with infinite values ( #539 )  
						
						 
						
						... 
						
						
						
						* Added isclose op and fixed comparison with inf values
* Added 'equal_nan' to match numpy
* format
* Add test
* Update python/src/ops.cpp
Co-authored-by: Awni Hannun <awni.hannun@gmail.com >
* Update python/src/ops.cpp
Co-authored-by: Awni Hannun <awni.hannun@gmail.com >
* Addressed CR comments
* Update python/src/ops.cpp
Co-authored-by: Awni Hannun <awni.hannun@gmail.com >
* nits
---------
Co-authored-by: Awni Hannun <awni.hannun@gmail.com >
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-25 20:47:06 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						f27ec5e097 
					 
					
						
						
							
							More helpful error message in vjp transform + concate bug ( #543 )  
						
						 
						
						... 
						
						
						
						* more helpful message in vjp transform
* fix concatenate on mismatch dims
* typo
* typo 
						
						
					 
					
						2024-01-24 09:58:33 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						f30e63353a 
					 
					
						
						
							
							Minor updates to address a few issues ( #537 )  
						
						 
						
						... 
						
						
						
						* docs on arg indices return type
* arange with nan
* undo isort 
						
						
					 
					
						2024-01-23 22:24:41 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Hazem Essam 
							
						 
					 
					
						
						
							
						
						37fc9db82c 
					 
					
						
						
							
							Added Adafactor ( #415 )  
						
						 
						
						... 
						
						
						
						* Added adafactor
* Added Adafactor and ran pre-commit
* modified operations
* Added docstrings
* Switched two ops to fix a bug
* added underscore for internal functions and removed the plus sign in the last return statment
* Removed parameter rms from the optimizer state because its not needed
* Added simple MNIST test for Adafactor and temporary training log
* remove test files
* nits in docs
* comment nit
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-23 15:11:27 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								AtomicVar 
							
						 
					 
					
						
						
							
						
						755dcf6137 
					 
					
						
						
							
							Enable cross_entropy loss to handle dense targets ( #517 )  
						
						 
						
						... 
						
						
						
						* Enable cross_entropy loss to handle dense targets
Dense targets means probabilities or one-hot encodings.
* better shape check of weights
* nits in docstring
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-23 12:17:22 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								LeonEricsson 
							
						 
					 
					
						
						
							
						
						6b4b30e3fc 
					 
					
						
						
							
							Common neural network initializers nn.initializers ( #456 )  
						
						 
						
						... 
						
						
						
						* initial commit: constant, normal, uniform
* identity, glorot and he initializers
* docstrings
* rm file
* nits
* nits
* nits
* testing suite
* docs
* nits in docs
* more docs
* remove unused template
* rename packakge to nn.innit
* docs, receptive field
* more docs
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-23 06:47:20 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						98c37d3a22 
					 
					
						
						
							
							use axes in tensordot ( #525 )  
						
						 
						
						
						
						
					 
					
						2024-01-22 21:17:00 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						7a34e46677 
					 
					
						
						
							
							Quantize with groups of 32 ( #511 )  
						
						 
						
						... 
						
						
						
						* allow quantize with group sizes of 32
* missing cpu dispatch
* remove print
* Fix qvm for group_size 32
---------
Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com > 
						
						
					 
					
						2024-01-21 06:19:05 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						d52383367a 
					 
					
						
						
							
							format ( #510 )  
						
						 
						
						
						
						
					 
					
						2024-01-20 10:33:46 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Arda Orçun 
							
						 
					 
					
						
						
							
						
						363d3add6d 
					 
					
						
						
							
							Add ValuError message for Adamax ( #508 )  
						
						 
						
						... 
						
						
						
						* ValuError message added
* beta errors added
* some corrections and testing
* Learning rate limitation deleted 
						
						
					 
					
						2024-01-20 07:56:15 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						b207c2c86b 
					 
					
						
						
							
							Power VJP fix for 0 ( #505 )  
						
						 
						
						
						
						
					 
					
						2024-01-20 01:17:40 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						6bf779e72b 
					 
					
						
						
							
							fix array from list for > 32 bit types ( #501 )  
						
						 
						
						
						
						
					 
					
						2024-01-19 15:49:25 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Juarez Bochi 
							
						 
					 
					
						
						
							
						
						ddf50113c5 
					 
					
						
						
							
							GGUF: Load and save metadata ( #446 )  
						
						 
						
						... 
						
						
						
						* gguf metadata
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-19 14:06:05 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Anchen 
							
						 
					 
					
						
						
							
						
						f6feb61f92 
					 
					
						
						
							
							feat: add support for saving safetensors in the save_weights ( #497 )  
						
						 
						
						... 
						
						
						
						* feat: add save safetensors support in module save_weights
* chore: checking missing changes
* Update python/mlx/nn/layers/base.py
Co-authored-by: Awni Hannun <awni.hannun@gmail.com >
* chore: update docstring for load_weights
---------
Co-authored-by: Awni Hannun <awni.hannun@gmail.com > 
						
						
					 
					
						2024-01-19 06:19:33 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						c4ec836523 
					 
					
						
						
							
							fix isinf for integer types ( #494 )  
						
						 
						
						
						
						
					 
					
						2024-01-19 05:31:10 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								AtomicVar 
							
						 
					 
					
						
						
							
						
						550d4bf7c0 
					 
					
						
						
							
							Update binary_cross_entropy function to handle both logits and probabilities ( #492 )  
						
						 
						
						
						
						
					 
					
						2024-01-18 19:22:23 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Ethan 
							
						 
					 
					
						
						
							
						
						a749a91c75 
					 
					
						
						
							
							Support disable metal buffer cache to prevent performance degradation caused by large memory caching ( #390 )  
						
						 
						
						... 
						
						
						
						* support disable metal buffer cache, due to large unused memory buffered when llm generated long context tokens
* Run format and add "cache_enabled" feature tests 
						
						
					 
					
						2024-01-18 08:33:34 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								toji 
							
						 
					 
					
						
						
							
						
						49a52610b7 
					 
					
						
						
							
							Added formatter structure and a boolean value formatter ( #354 )  
						
						 
						
						... 
						
						
						
						* added formatter structure and a boolean value formatter
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-18 07:49:41 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								AtomicVar 
							
						 
					 
					
						
						
							
						
						d1fef34138 
					 
					
						
						
							
							Add Gaussian NLL loss function ( #477 )  
						
						 
						
						... 
						
						
						
						* Add Gaussian NLL loss function
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-18 06:44:44 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						9c111f176d 
					 
					
						
						
							
							Fix split optimization for array iterator ( #484 )  
						
						 
						
						
						
						
					 
					
						2024-01-18 05:50:25 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						90c234b7ac 
					 
					
						
						
							
							Fix round to round half-cases to even ( #482 )  
						
						 
						
						
						
						
					 
					
						2024-01-17 15:27:23 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Jagrit Digani 
							
						 
					 
					
						
						
							
						
						78102a47ad 
					 
					
						
						
							
							Update GEMM ( #424 )  
						
						 
						
						... 
						
						
						
						* Organize and collect metal subroutine templates and elements in `metal/kernels/steel/`
* Update gemm elements for better performance 
* Add split-K specialization for gemm
* Add `addmm` primitive, op and bindings for fused matmul and bias addition 
* Update tests and benchmarks as needed 
						
						
					 
					
						2024-01-17 12:42:39 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						a2bf7693dd 
					 
					
						
						
							
							Primitive's VJP takes outputs as input ( #475 )  
						
						 
						
						... 
						
						
						
						Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com > 
						
						
					 
					
						2024-01-16 19:03:53 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						d8fabaa12b 
					 
					
						
						
							
							Split multi output ( #461 )  
						
						 
						
						... 
						
						
						
						* Multi-output split primitive
* Add the multi-output split to the ArrayIterator
* Add some grad tests for split 
						
						
					 
					
						2024-01-16 13:33:55 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Yashraj Singh 
							
						 
					 
					
						
						
							
						
						e72458a3fa 
					 
					
						
						
							
							implemented isposinf and isneginf in one PR ( #470 )  
						
						 
						
						... 
						
						
						
						* ran precommit
* updated docs 
						
						
					 
					
						2024-01-16 06:48:07 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						c15fe3e61b 
					 
					
						
						
							
							Allow arbitrary first dimension in quantization kernels. ( #458 )  
						
						 
						
						... 
						
						
						
						* Allow arbitrary first dim on qmm_t and qmv
* Allow arbitrary first dim on qmm and qvm
* Specialized aligned vs unaligned case
* Add more checks for valid quantizations 
						
						
					 
					
						2024-01-16 00:46:21 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Tristan Bilot 
							
						 
					 
					
						
						
							
						
						f44c132f4a 
					 
					
						
						
							
							Add scatter_min VJP ( #462 )  
						
						 
						
						
						
						
					 
					
						2024-01-16 00:37:40 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Matthew Ernst 
							
						 
					 
					
						
						
							
						
						92a2fdd577 
					 
					
						
						
							
							Adds isinf ( #445 )  
						
						 
						
						... 
						
						
						
						* adds isinf
Signed-off-by: matthewfernst <matthew.f.ernst@gmail.com >
* use stream + nits
* typo
---------
Signed-off-by: matthewfernst <matthew.f.ernst@gmail.com >
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-15 19:50:44 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Tristan Bilot 
							
						 
					 
					
						
						
							
						
						6022d4129e 
					 
					
						
						
							
							scatter_max vjp + bindings + tests ( #431 )  
						
						 
						
						... 
						
						
						
						Co-authored-by: DjamelMesbah <djamel.mesbah@adservio.fr > 
						
						
					 
					
						2024-01-14 14:12:15 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						41cc7bdfdb 
					 
					
						
						
							
							Fix stub generation, change graph exporting for arrows to go to outputs ( #455 )  
						
						 
						
						
						
						
					 
					
						2024-01-14 14:06:16 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Diogo 
							
						 
					 
					
						
						
							
						
						2e29d0815b 
					 
					
						
						
							
							Add tile op ( #438 )  
						
						 
						
						
						
						
					 
					
						2024-01-12 23:03:16 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						1b71487e1f 
					 
					
						
						
							
							docs ( #444 )  
						
						 
						
						
						
						
					 
					
						2024-01-12 13:34:16 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Ayush Shridhar 
							
						 
					 
					
						
						
							
						
						1416e7b664 
					 
					
						
						
							
							Add isnan ( #423 )  
						
						 
						
						
						
						
					 
					
						2024-01-12 11:16:48 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								davidkoski 
							
						 
					 
					
						
						
							
						
						29081204d1 
					 
					
						
						
							
							array.swapaxes should point to swapaxes free function ( #441 )  
						
						 
						
						
						
						
					 
					
						2024-01-12 11:06:16 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Avikant Srivastava 
							
						 
					 
					
						
						
							
						
						975e265f74 
					 
					
						
						
							
							feat: Add numpy constants ( #428 )  
						
						 
						
						... 
						
						
						
						* add numpy constants
* feat: add unittests
* add newaxis
* add test for newaxis transformation
* refactor 
						
						
					 
					
						2024-01-11 06:47:29 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						3b4f066dac 
					 
					
						
						
							
							Correct types for vjp + tests ( #418 )  
						
						 
						
						... 
						
						
						
						* correct types for vjp + tests
* fix build + comment 
						
						
					 
					
						2024-01-10 13:32:37 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Juarez Bochi 
							
						 
					 
					
						
						
							
						
						b7f905787e 
					 
					
						
						
							
							GGUF support ( #350 )  
						
						 
						
						... 
						
						
						
						* Initial GGUF support for tensor fields.
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-10 13:22:48 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Chunyang Wen 
							
						 
					 
					
						
						
							
						
						e3e933c6bc 
					 
					
						
						
							
							Add type hint for Module ( #412 )  
						
						 
						
						
						
						
					 
					
						2024-01-10 11:23:42 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						1d90a76d63 
					 
					
						
						
							
							in place ops behave in place, fix some overloads ( #411 )  
						
						 
						
						
						
						
					 
					
						2024-01-09 16:05:38 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						961435a243 
					 
					
						
						
							
							Scatter vjp ( #394 )  
						
						 
						
						... 
						
						
						
						* Add a first scatter vjp
* Implement the scatter_add vjp
* Add array.at to implement user friendly scatters 
						
						
					 
					
						2024-01-09 13:36:51 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						e9ca65c939 
					 
					
						
						
							
							Fix BN stats to not expand shape ( #409 )  
						
						 
						
						... 
						
						
						
						* fix BN stats to not expand shape
* nit 
						
						
					 
					
						2024-01-09 11:54:51 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						f099ebe535 
					 
					
						
						
							
							Multi output primitives ( #330 )  
						
						 
						
						... 
						
						
						
						* Multi-output primitives
---------
Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com > 
						
						
					 
					
						2024-01-08 16:39:08 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								YUN, Junwoo 
							
						 
					 
					
						
						
							
						
						0b8aeddac6 
					 
					
						
						
							
							Additoinal losses ( #336 )  
						
						 
						
						... 
						
						
						
						* cosine similarity loss
---------
Co-authored-by: Awni Hannun <awni@apple.com >
* Docstring nits 
						
						
					 
					
						2024-01-08 14:01:13 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Nripesh Niketan 
							
						 
					 
					
						
						
							
						
						73321b8097 
					 
					
						
						
							
							feat: add logicalAnd and logicalOR ( #386 )  
						
						 
						
						... 
						
						
						
						* feat: add logicalAnd and logicalOR
* run pre-commit
* Refactor logical_and and logical_or functions
* Add acknowledgement
* Add logical AND and logical OR operators
* Refactor logical_and and logical_or functions
* Add support for logical operators on bool arrays
* Update mlx/ops.cpp
Co-authored-by: Awni Hannun <awni.hannun@gmail.com >
* Update mlx/ops.cpp
Co-authored-by: Awni Hannun <awni.hannun@gmail.com >
* Add logical AND and OR operators for arrays and scalars
* Refactor vjp and jvp methods in primitives.cpp
* Add overloaded operators for logical AND and OR
* format
---------
Co-authored-by: Awni Hannun <awni.hannun@gmail.com >
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-08 07:00:05 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Hazem Essam 
							
						 
					 
					
						
						
							
						
						022a944367 
					 
					
						
						
							
							Added GLU activation function and Gated activation function ( #329 )  
						
						 
						
						... 
						
						
						
						* Added GLU activation function and gated activation function
* Ran pre-commit
* Ran pre commit
* Removed old sigmoid implementation to match with main
* Removed gated activation from __init__.py
* Removed unused test cases
* Removed unused imports
* format / docstring
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-08 06:13:16 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						a611b0bc82 
					 
					
						
						
							
							Removes the retain_graph flag ( #385 )  
						
						 
						
						... 
						
						
						
						* Adds global tracing flag
* Removes retain_graph in favor of is_tracer 
						
						
					 
					
						2024-01-07 15:16:51 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Diogo 
							
						 
					 
					
						
						
							
						
						449b43762e 
					 
					
						
						
							
							Add inner / outer op ( #348 )  
						
						 
						
						... 
						
						
						
						* inner / outer impl
* python tests
* ops list and ack
* updated descriptions
* use test helper
* removed dtype check and flatten outer to 1-D
* updated docs
* just use the reshape to flatten 
						
						
					 
					
						2024-01-07 09:01:09 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						6ea6b4258d 
					 
					
						
						
							
							Fix style check ( #395 )  
						
						 
						
						
						
						
					 
					
						2024-01-07 05:54:58 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Anchen 
							
						 
					 
					
						
						
							
						
						48f6ca8c3a 
					 
					
						
						
							
							Add theta cache for Rope and mask cache for ALiBi ( #375 )  
						
						 
						
						
						
						
					 
					
						2024-01-07 00:22:58 -08:00  
					
					
						 
						
						
							
							
							 
							
							
							
							
							 
						
					 
				 
			
				
					
						
							
							
								 
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						b34bf5d52b 
					 
					
						
						
							
							fix saving for non-contiguous arrays ( #389 )  
						
						 
						
						
						
						
					 
					
						2024-01-06 12:44:02 -08:00