Awni Hannun 
							
						 
					 
					
						
						
							
						
						9111999af3 
					 
					
						
						
							
							Fix small sort with metal validation ( #1695 )  
						
						
						
						
					 
					
						2024-12-12 09:21:45 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						6bd28d246e 
					 
					
						
						
							
							Allow no copy negative strides in as_strided and slice ( #1688 )  
						
						... 
						
						
						
						* allow no copy negative strides in as_strided and slice
* fix jit
* fix jit 
						
						
					 
					
						2024-12-12 08:59:45 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						3a21f61772 
					 
					
						
						
							
							Fix build ( #1693 )  
						
						
						
						
					 
					
						2024-12-11 23:56:25 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						4e1e9520e1 
					 
					
						
						
							
							Flatten and unflatten ( #1692 )  
						
						... 
						
						
						
						* flatten and unflatten
* fix grad
* fix shape infer
* use squeeze + unsqueeze in get_item 
						
						
					 
					
						2024-12-11 21:51:37 -08:00 
						 
				 
			
				
					
						
							
							
								Cheng 
							
						 
					 
					
						
						
							
						
						0bf19037ca 
					 
					
						
						
							
							Remove "using namespace mlx::core" in python/src ( #1689 )  
						
						
						
						
					 
					
						2024-12-11 15:45:39 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						f76a49e555 
					 
					
						
						
							
							ExpandDims primitive (#1687 )  
						
						... 
						
						
						
						* add squeeze primitive
* simplify squeeze, use in gather
* fix
* fix
* fix
* fix
* fix no cpu
* use squeeze in matmul and friends
* expand dims primitive
* comment 
						
						
					 
					
						2024-12-10 16:39:07 -08:00 
						 
				 
			
				
					
						
							
							
								Cheng 
							
						 
					 
					
						
						
							
						
						92ab6bdeb8 
					 
					
						
						
							
							Fix shared library not exporting symbols on Windows ( #1684 )  
						
						... 
						
						
						
						* Fix shared library not exporting symbols on Windows
* Function name style 
						
						
					 
					
						2024-12-10 13:59:14 -08:00 
						 
				 
			
				
					
						
							
							
								Cheng 
							
						 
					 
					
						
						
							
						
						a59fae040f 
					 
					
						
						
							
							Fix library output directory for MSVC ( #1681 )  
						
						
						
						
					 
					
						2024-12-09 19:07:50 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						29a620cab2 
					 
					
						
						
							
							No reshapes in quantized embedding ( #1682 )  
						
						... 
						
						
						
						* no reshapes in quantized embedding
* fix inadvertant cast
* add tol 
						
						
					 
					
						2024-12-09 18:57:38 -08:00 
						 
				 
			
				
					
						
							
							
								Cheng 
							
						 
					 
					
						
						
							
						
						87d7a2520e 
					 
					
						
						
							
							Use Py_ssize_t in python bindings ( #1678 )  
						
						... 
						
						
						
						* Use Py_ssize_t in python bindings
* Args passed to std::max must be same type 
						
						
					 
					
						2024-12-09 12:59:19 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						40c62c1321 
					 
					
						
						
							
							Use int64 stride everywhere ( #1671 )  
						
						... 
						
						
						
						* use int64 stride everywhere
* fix ext
* fix ext
* more shape + cleanup
* one more
* few more 
						
						
					 
					
						2024-12-09 11:09:02 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						35b412c099 
					 
					
						
						
							
							Fix compile hasher for string constants. ( #1677 )  
						
						... 
						
						
						
						* fix hash
* add test
* nit 
						
						
					 
					
						2024-12-09 09:26:18 -08:00 
						 
				 
			
				
					
						
							
							
								mt_caret 
							
						 
					 
					
						
						
							
						
						fd3377dd1f 
					 
					
						
						
							
							Support bias correction in Adam and AdamW optimizers ( #1640 )  
						
						
						
						
					 
					
						2024-12-06 12:13:34 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						bc2a29f033 
					 
					
						
						
							
							fix ( #1654 )  
						
						
						
						
					 
					
						2024-12-06 10:48:58 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						e047fd977d 
					 
					
						
						
							
							compile changes if stream changes ( #1644 )  
						
						
						
						
					 
					
						2024-12-03 14:37:44 -08:00 
						 
				 
			
				
					
						
							
							
								Alex Barron 
							
						 
					 
					
						
						
							
						
						1445dcaa60 
					 
					
						
						
							
							let class predicate specify quantization parameters ( #1638 )  
						
						
						
						
					 
					
						2024-12-02 14:09:28 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						aa86876813 
					 
					
						
						
							
							fix transformer decoder post norm LN ( #1637 )  
						
						
						
						
					 
					
						2024-12-02 07:02:17 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						7cbb4aef17 
					 
					
						
						
							
							Doc fix ( #1615 )  
						
						
						
						
					 
					
						2024-11-22 11:12:25 -08:00 
						 
				 
			
				
					
						
							
							
								Alex Barron 
							
						 
					 
					
						
						
							
						
						c79f6a4a8c 
					 
					
						
						
							
							3 and 6 bit quantization ( #1613 )  
						
						... 
						
						
						
						* Support 3 and 6 bit quantization 
						
						
					 
					
						2024-11-22 10:22:13 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						0c5eea226b 
					 
					
						
						
							
							Reduce specializations ( #1607 )  
						
						... 
						
						
						
						* start of reduce specializations
* fix all reduce
* fix many dims
* fix
* non-jit tests clear
* cleanup instantiations
* cpu merges
* change dim specializations
* optimize
* fix jit
* fix jit
* use higher precision for integer sum+prod
* fixes 
						
						
					 
					
						2024-11-21 19:53:00 -08:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						d8c824c594 
					 
					
						
						
							
							Formatting fixes ( #1606 )  
						
						
						
						
					 
					
						2024-11-20 15:30:36 -08:00 
						 
				 
			
				
					
						
							
							
								Saanidhya 
							
						 
					 
					
						
						
							
						
						cb431dfc9f 
					 
					
						
						
							
							Adds 3D pooling ( #1526 )  
						
						
						
						
					 
					
						2024-11-19 16:45:24 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						61d787726a 
					 
					
						
						
							
							Fix view scalar bug segfault ( #1603 )  
						
						... 
						
						
						
						* fix view scalar bug
* fix view scalar bug
* one more fix 
						
						
					 
					
						2024-11-19 10:54:05 -08:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						5e89aace9b 
					 
					
						
						
							
							Fix concatenate vmap ( #1600 )  
						
						
						
						
					 
					
						2024-11-19 10:44:04 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						bf481e8e5d 
					 
					
						
						
							
							Fix sibling leak ( #1590 )  
						
						... 
						
						
						
						* add test
* fix + test
* fix fix 
						
						
					 
					
						2024-11-18 19:17:01 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						9bd03dd9b4 
					 
					
						
						
							
							More buffer donation with no-ops ( #1591 )  
						
						... 
						
						
						
						* more donation
* fix test
* fix build 
						
						
					 
					
						2024-11-18 08:35:41 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						8c34c9dac4 
					 
					
						
						
							
							throw for invalid case and remove test ( #1575 )  
						
						
						
						
					 
					
						2024-11-08 12:04:03 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						91c0277356 
					 
					
						
						
							
							fix per-example mask + docs in sdpa ( #1574 )  
						
						
						
						
					 
					
						2024-11-08 11:51:15 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						59247c2b62 
					 
					
						
						
							
							add groups in conv2d ( #1569 )  
						
						
						
						
					 
					
						2024-11-07 13:57:53 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						54f05e7195 
					 
					
						
						
							
							Fix gather vmap ( #1563 )  
						
						... 
						
						
						
						* fix gather
* fix 
						
						
					 
					
						2024-11-05 11:29:20 -08:00 
						 
				 
			
				
					
						
							
							
								Alex Barron 
							
						 
					 
					
						
						
							
						
						26be608470 
					 
					
						
						
							
							Add split_k qvm for long context ( #1564 )  
						
						... 
						
						
						
						* Add splitk qvm
* configurable splitk
* tuning
* remove extra instantiation
* remove refactor
* separate test
* cpu tolerance 
						
						
					 
					
						2024-11-05 11:25:19 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						76f275b4df 
					 
					
						
						
							
							error in rms for wrong size ( #1562 )  
						
						
						
						
					 
					
						2024-11-04 13:24:02 -08:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						62f297b51d 
					 
					
						
						
							
							Sdpa fix ( #1558 )  
						
						
						
						
					 
					
						2024-11-02 21:25:46 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						09bc32f62f 
					 
					
						
						
							
							No extra reshape ( #1557 )  
						
						... 
						
						
						
						* no extra reshape
* lint 
						
						
					 
					
						2024-11-02 19:07:20 -07:00 
						 
				 
			
				
					
						
							
							
								Alex Barron 
							
						 
					 
					
						
						
							
						
						9e516b71ea 
					 
					
						
						
							
							Add dispatchThreads to custom kernel doc ( #1551 )  
						
						... 
						
						
						
						* add dispatchThreads info
* update
* add link 
						
						
					 
					
						2024-11-01 13:07:48 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						57c6aa7188 
					 
					
						
						
							
							fix multi output leak ( #1548 )  
						
						
						
						
					 
					
						2024-10-31 09:32:01 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						4f72c66911 
					 
					
						
						
							
							improvements to scatter / gather ( #1541 )  
						
						
						
						
					 
					
						2024-10-30 19:30:54 -07:00 
						 
				 
			
				
					
						
							
							
								Alex Barron 
							
						 
					 
					
						
						
							
						
						048fabdabd 
					 
					
						
						
							
							Fix vmap constant output size ( #1524 )  
						
						... 
						
						
						
						* use inputs to determine output size
* remove noop vmap tests 
						
						
					 
					
						2024-10-30 16:16:53 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						d2ff04a4f2 
					 
					
						
						
							
							fix format ( #1539 )  
						
						
						
						
					 
					
						2024-10-28 18:29:14 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						0eb56d5be0 
					 
					
						
						
							
							Wired ( #1510 )  
						
						... 
						
						
						
						* expose residency sets as wire/unwire
* returns wired size
* fix
* runtime support check
* fix os check
* fix test
* fix no metal build
* docs
* nit
* nits in docs
* nits 
						
						
					 
					
						2024-10-25 09:35:33 -07:00 
						 
				 
			
				
					
						
							
							
								Venkata Naga Aditya Datta Chivukula 
							
						 
					 
					
						
						
							
						
						430ffef58a 
					 
					
						
						
							
							[Feature] Added Sparse Initialization ( #1498 )  
						
						... 
						
						
						
						Co-authored-by: Saanidhyavats <saanidhyavats@gmail.com > 
						
						
					 
					
						2024-10-24 12:31:24 -07:00 
						 
				 
			
				
					
						
							
							
								Alex Barron 
							
						 
					 
					
						
						
							
						
						3d17077187 
					 
					
						
						
							
							Add mx.array.__format__ ( #1521 )  
						
						... 
						
						
						
						* add __format__
* actually test something
* fix 
						
						
					 
					
						2024-10-24 11:11:39 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						c9b41d460f 
					 
					
						
						
							
							Working 64-bit scans ( #1506 )  
						
						
						
						
					 
					
						2024-10-24 11:05:46 -07:00 
						 
				 
			
				
					
						
							
							
								Kashif Rasul 
							
						 
					 
					
						
						
							
						
						3ddc07e936 
					 
					
						
						
							
							Eigenvalues and eigenvectors ( #1334 )  
						
						... 
						
						
						
						* initial eigvalsh
* add compute_vectors
* add compute_vectors_
* return a pair
* add eigh to return only eigenvectors
* fixed typo
* merge merge Eighvalsh and Eigh into a single primitive
* use the same primate with the flag
* fix primatives
* use MULTI
* fix eval_gpu
* fix decleration
* rename EighPrimitive to Eigh
* tests
* tests
* fix rebase and format
* cleanup lapack
* format
* add cblas.h
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-10-22 12:18:48 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						c26208f67d 
					 
					
						
						
							
							Remove Hazard tracking with Fences ( #1509 )  
						
						... 
						
						
						
						* remove hazard tracking
* with fence map
* no hazard tracking with fences
* nits
* fix fence retain
* cleanup
* fix quantized rebase 
						
						
					 
					
						2024-10-21 19:33:32 -07:00 
						 
				 
			
				
					
						
							
							
								Alex Barron 
							
						 
					 
					
						
						
							
						
						d15fa13daf 
					 
					
						
						
							
							Batched Quantized Matmul + Fast Small QMV ( #1503 )  
						
						... 
						
						
						
						* add fast qmv for small dims
* fix test
* batched cpu
* add batched template param
* refactor metal quantized.cpp 
						
						
					 
					
						2024-10-21 16:23:17 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						92d7cb71f8 
					 
					
						
						
							
							Fix compile ( #1501 )  
						
						... 
						
						
						
						* fix compile
* fix space 
						
						
					 
					
						2024-10-18 11:06:40 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						9dd72cd421 
					 
					
						
						
							
							fix gumbel ( #1495 )  
						
						
						
						
					 
					
						2024-10-17 13:52:39 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						f9f8c167d4 
					 
					
						
						
							
							fix submodule stubs ( #1492 )  
						
						
						
						
					 
					
						2024-10-15 16:23:37 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						3f86399922 
					 
					
						
						
							
							Real and Imag ( #1490 )  
						
						... 
						
						
						
						* real and imag
* fix
* fix 
						
						
					 
					
						2024-10-15 16:23:15 -07:00