Cheng 
							
						 
					 
					
						
						
							
						
						0cae0bdac8 
					 
					
						
						
							
							CUDA backend: backbone ( #2075 )  
						
						
						
						
							
						
					 
					
						2025-05-06 21:26:46 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						5a1a5d5ed1 
					 
					
						
						
							
							fix input coherent kernel launch ( #2153 )  
						
						
						
						
							
						
					 
					
						2025-05-05 17:30:50 -07:00 
						 
				 
			
				
					
						
							
							
								Cheng 
							
						 
					 
					
						
						
							
						
						1683975acf 
					 
					
						
						
							
							Move common gpu primitives to backend/gpu ( #2145 )  
						
						
						
						
							
						
					 
					
						2025-05-05 13:45:29 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						af705590ac 
					 
					
						
						
							
							fix batched vector sdpa ( #2152 )  
						
						
						
						
							
						
					 
					
						2025-05-05 13:13:03 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						825124af8f 
					 
					
						
						
							
							fix bw for elementwise ops ( #2151 )  
						
						... 
						
						
						
						* fix bw for elementwise ops
* add compile
* fix
* fix
* fix
* fix 
						
						
							
						
					 
					
						2025-05-05 06:15:04 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						9c5e7da507 
					 
					
						
						
							
							fix compile merging ( #2150 )  
						
						
						
						
							
						
					 
					
						2025-05-02 15:08:50 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						481349495b 
					 
					
						
						
							
							GPU Hadamard for large N ( #1879 )  
						
						
						
						
							
						
					 
					
						2025-05-01 17:19:17 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						9daa6b003f 
					 
					
						
						
							
							fix shapeless export ( #2148 )  
						
						
						
						
							
						
					 
					
						2025-05-01 15:02:02 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						a3a632d567 
					 
					
						
						
							
							Fix the launcher when ran locally ( #2147 )  
						
						
						
						
							
						
					 
					
						2025-05-01 12:56:09 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						e496c5a4b4 
					 
					
						
						
							
							fix integer overflow in qmm ( #2143 )  
						
						
						
						
							
						
					 
					
						2025-04-30 09:28:56 -07:00 
						 
				 
			
				
					
						
							
							
								Cheng 
							
						 
					 
					
						
						
							
						
						ea890d8710 
					 
					
						
						
							
							Remove metal-only tests ( #2139 )  
						
						
						
						
							
						
					 
					
						2025-04-30 09:08:39 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						aa5d84f102 
					 
					
						
						
							
							Allow quant layer to be unfrozen ( #2142 )  
						
						
						
						
							
						
					 
					
						2025-04-30 09:08:29 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						f1606486d2 
					 
					
						
						
							
							Generalize gpu backend ( #2138 )  
						
						... 
						
						
						
						* generalize gpu backend
* fix no_gpu build
* fix no_gpu build
* generalize gpu backend 
						
						
							
						
					 
					
						2025-04-30 09:08:17 -07:00 
						 
				 
			
				
					
						
							
							
								Cheng 
							
						 
					 
					
						
						
							
						
						87720a8908 
					 
					
						
						
							
							Fix building with uv ( #2141 )  
						
						
						
						
							
						
					 
					
						2025-04-30 06:04:07 -07:00 
						 
				 
			
				
					
						
							
							
								Aashiq Dheeraj 
							
						 
					 
					
						
						
							
						
						bb6565ef14 
					 
					
						
						
							
							add fftshift and ifftshift fft helpers ( #2135 )  
						
						... 
						
						
						
						* add fftshift and ifftshift fft helpers
* address comments
* axes have to be iterable
* fix fp error in roll + add test
---------
Co-authored-by: Aashiq Dheeraj <aashiq@aashiq-mbp-m4.local > 
						
						
							
						
					 
					
						2025-04-29 22:13:45 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						7bb063bcb3 
					 
					
						
						
							
							Enable vjp for quantized scale and bias ( #2129 )  
						
						... 
						
						
						
						* Enable vjp for quantized scale and bias
* higher tol 
						
						
							
						
					 
					
						2025-04-29 13:03:09 -07:00 
						 
				 
			
				
					
						
							
							
								Alex Chi Z. 
							
						 
					 
					
						
						
							
						
						b36dd472bb 
					 
					
						
						
							
							return library if it is successfully loaded ( #2131 )  
						
						
						
						
							
						
					 
					
						2025-04-29 07:30:36 -07:00 
						 
				 
			
				
					
						
							
							
								hdeng-apple 
							
						 
					 
					
						
						
							
						
						167b759a38 
					 
					
						
						
							
							Fix typos ( #2136 )  
						
						
						
						
							
						
					 
					
						2025-04-29 07:26:05 -07:00 
						 
				 
			
				
					
						
							
							
								charan-003 
							
						 
					 
					
						
						
							
						
						99b9868859 
					 
					
						
						
							
							Clarify dimension notation in conv1d, conv2d, and conv3d docstrings ( #2123 )  
						
						... 
						
						
						
						* Clarify dimension notation in conv1d, conv2d, and conv3d docstrings
* Updating transposed convs in conv1d, conv2d, and conv3d
---------
Co-authored-by: Sai Charan Arvapally <saicharan@Sais-MacBook-Pro.local > 
						
						
							
						
					 
					
						2025-04-25 12:18:30 -07:00 
						 
				 
			
				
					
						
							
							
								1ndig0 
							
						 
					 
					
						
						
							
						
						6b2d5448f2 
					 
					
						
						
							
							Fix the error message in mx.right_shift and mx.left_shift ( #2121 )  
						
						... 
						
						
						
						* update right_shift and lef_shift
* simplify
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
							
						
					 
					
						2025-04-25 09:14:28 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						eaf709b83e 
					 
					
						
						
							
							patch ( #2119 )  
						
						
						
						
							
 
						
					 
					
						2025-04-24 16:11:07 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						f0e70afff0 
					 
					
						
						
							
							Fix swift pm load ( #2117 )  
						
						
						
						
							
						
					 
					
						2025-04-24 10:58:29 -07:00 
						 
				 
			
				
					
						
							
							
								hdeng-apple 
							
						 
					 
					
						
						
							
						
						86984cad68 
					 
					
						
						
							
							Remove static initializers ( #2059 )  
						
						... 
						
						
						
						* Remove static initializers in device.cpp, load.cpp, pocketfft.h
* Remove static initializer InTracing::trace_stack
* Remove static initializer of CompilerCache cache
* Revert changes in pocketfft.h
* Remove duplicate private section of thread_pool() 
						
						
							
						
					 
					
						2025-04-24 06:14:49 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						fbc89e3ced 
					 
					
						
						
							
							fix pinv ( #2110 )  
						
						
						
						
							
						
					 
					
						2025-04-23 13:08:28 -07:00 
						 
				 
			
				
					
						
							
							
								hdeng-apple 
							
						 
					 
					
						
						
							
						
						38c1e720c2 
					 
					
						
						
							
							Search mlx.metallib in macOS framework "Resources" dir ( #2061 )  
						
						... 
						
						
						
						---------
Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com > 
						
						
							
						
					 
					
						2025-04-23 09:53:13 -07:00 
						 
				 
			
				
					
						
							
							
								Param Thakkar 
							
						 
					 
					
						
						
							
						
						600e87e03c 
					 
					
						
						
							
							Added output_padding parameters in conv_transpose ( #2092 )  
						
						
						
						
							
						
					 
					
						2025-04-23 09:26:33 -07:00 
						 
				 
			
				
					
						
							
							
								Hyunsung Lee 
							
						 
					 
					
						
						
							
						
						3836445241 
					 
					
						
						
							
							Add broadcast_shapes in python API ( #2091 )  
						
						
						
						
							
						
					 
					
						2025-04-22 18:57:39 -07:00 
						 
				 
			
				
					
						
							
							
								Yury Popov 
							
						 
					 
					
						
						
							
						
						1d2c9d6a07 
					 
					
						
						
							
							Complex scan ( #2094 )  
						
						
						
						
							
						
					 
					
						2025-04-22 18:56:28 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						e8ac6bd2f5 
					 
					
						
						
							
							irfft throws instead of segfaults on scalars ( #2109 )  
						
						
						
						
							
						
					 
					
						2025-04-22 10:25:55 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						fdadc4f22c 
					 
					
						
						
							
							Add more complex unary ops ( #2101 )  
						
						
						
						
							
						
					 
					
						2025-04-21 13:04:54 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						79b527f45f 
					 
					
						
						
							
							conv vmap ( #2102 )  
						
						
						
						
							
						
					 
					
						2025-04-21 13:04:39 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						dc4eada7f0 
					 
					
						
						
							
							Use unordered map for kwargs in export/import ( #2087 )  
						
						... 
						
						
						
						* use unordered map for kwargs in export/import
* comment 
						
						
							
						
					 
					
						2025-04-21 07:17:22 -07:00 
						 
				 
			
				
					
						
							
							
								Cheng 
							
						 
					 
					
						
						
							
						
						70ebc3b598 
					 
					
						
						
							
							Return const ref in array::data_shared_ptr ( #2100 )  
						
						
						
						
							
						
					 
					
						2025-04-21 07:17:09 -07:00 
						 
				 
			
				
					
						
							
							
								Cheng 
							
						 
					 
					
						
						
							
						
						b13f2aed16 
					 
					
						
						
							
							Introduce macros for dispatching dynamic dtypes as static types ( #2073 )  
						
						
						
						
							
						
					 
					
						2025-04-19 06:16:30 -07:00 
						 
				 
			
				
					
						
							
							
								Param Thakkar 
							
						 
					 
					
						
						
							
						
						5f04c0f818 
					 
					
						
						
							
							Fixed shift operations issue ( #2080 )  
						
						... 
						
						
						
						* Fixed shift operations issue
* Added tests and fixes
* Fixed loop syntax error
* Added tests for bool
* Fixed typo 
						
						
							
						
					 
					
						2025-04-18 14:28:33 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						55935ccae7 
					 
					
						
						
							
							fix py gc edge case ( #2079 )  
						
						
						
						
							
						
					 
					
						2025-04-18 12:46:53 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						b529515eb1 
					 
					
						
						
							
							minor bump ( #2081 )  
						
						
						
						
							
 
						
					 
					
						2025-04-17 14:57:11 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						3cde719eb7 
					 
					
						
						
							
							Route to gather qmm only for many tokens per expert ( #2082 )  
						
						
						
						
							
						
					 
					
						2025-04-17 14:53:08 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						5de6d94a90 
					 
					
						
						
							
							Gather qmm batched kernel and refactoring of quantized ( #2078 )  
						
						
						
						
							
						
					 
					
						2025-04-17 13:53:11 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						99eefd2ec0 
					 
					
						
						
							
							Gather mm new kernel and small refactoring ( #2040 )  
						
						
						
						
							
						
					 
					
						2025-04-14 16:37:36 -07:00 
						 
				 
			
				
					
						
							
							
								Yury Popov 
							
						 
					 
					
						
						
							
						
						e9e268336b 
					 
					
						
						
							
							LogCumSumExp ( #2069 )  
						
						
						
						
							
						
					 
					
						2025-04-13 01:27:29 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						7275ac7523 
					 
					
						
						
							
							Fix release build ( #2072 )  
						
						
						
						
							
						
					 
					
						2025-04-12 20:41:58 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						c4189a38e4 
					 
					
						
						
							
							Add float mask to sdpa vector ( #2068 )  
						
						
						
						
							
						
					 
					
						2025-04-11 17:29:40 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						68d1b3256b 
					 
					
						
						
							
							nit: fix exception handling ( #2066 )  
						
						
						
						
							
						
					 
					
						2025-04-11 14:12:08 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						9c6953bda7 
					 
					
						
						
							
							Fix stubgen ( #2065 )  
						
						... 
						
						
						
						* Fix stubgen
* add multi optim to docs 
						
						
							
						
					 
					
						2025-04-11 12:02:54 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						ef7ece9851 
					 
					
						
						
							
							fix fft bug ( #2062 )  
						
						
						
						
							
						
					 
					
						2025-04-10 19:41:27 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						ddaa4b7dcb 
					 
					
						
						
							
							Fix the test and add custom min/max reductions for uncommon MPI types ( #2060 )  
						
						
						
						
							
						
					 
					
						2025-04-10 17:01:17 -07:00 
						 
				 
			
				
					
						
							
							
								Cheng 
							
						 
					 
					
						
						
							
						
						dfae2c6989 
					 
					
						
						
							
							Fix MSVC build due to use of M_LN2 ( #2058 )  
						
						
						
						
							
						
					 
					
						2025-04-10 07:41:41 -07:00 
						 
				 
			
				
					
						
							
							
								Anastasiia Filippova 
							
						 
					 
					
						
						
							
						
						515f104926 
					 
					
						
						
							
							Min / max reductions ( #2041 )  
						
						
						
						
							
						
					 
					
						2025-04-09 23:22:20 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						9ecefd56db 
					 
					
						
						
							
							Do not load the default lib if another is requested ( #2055 )  
						
						
						
						
							
						
					 
					
						2025-04-09 13:31:38 -07:00