Awni Hannun 
							
						 
					 
					
						
						
							
						
						e9e53856d2 
					 
					
						
						
							
							patch bump ( #1287 )  
						
						
						
						
							
 
						
					 
					
						2024-07-25 11:42:09 -07:00 
						 
				 
			
				
					
						
							
							
								Anton Belov 
							
						 
					 
					
						
						
							
						
						5029894662 
					 
					
						
						
							
							[Issue  #1187 ] Add nan_to_num function initial attempt ( #1247 )  
						
						... 
						
						
						
						* initial attempt, working with wrong types
* not compiling; mx.float16 and mx.bfloat16 tests added
* fix nan to num
* nit
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
							
						
					 
					
						2024-07-25 09:57:37 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						baf9fa5f42 
					 
					
						
						
							
							Einsum ( #1269 )  
						
						... 
						
						
						
						* einsum initial
* fix comma break
* sum axis was wrong
* small cleanups
* python binding
* changed bindings to resemble numpy
* remove todo comment
* comment changes
* add count of operands/inputs
* fail fast if operands list is empty
* ignore comma if no output
* einsum path matching numpy
* getting somewhere with path
* remove print
* it passes the first test
* moved einsum tests to seperate file
* seperated einsum path
* moved einsum naive
* remove space from equation
* fast fail if no operands passed
* update tests and remove printf
* small cleanup
* some more cleanups
* removed python helper file
* ack
* utilize std for finding min in vector
* duplicate def
* remove the tuple as it was unreadable
* moved einsum_naive back to ops
* remaining isn't needed
* avoid creating another set
* cleanup
* greedy path, start of naive einsum
* more einsum
* fix some bugs
* some more fixes, tests pass
* benchmark
* some simplify
* fix einsum and test
Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com >
* add a bunch more tests and fix a bunch more bugs
* some docs nits
---------
Co-authored-by: dc-dc-dc <dgcruz983@gmail.com >
Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com > 
						
						
							
						
					 
					
						2024-07-25 09:36:44 -07:00 
						 
				 
			
				
					
						
							
							
								Jagrit Digani 
							
						 
					 
					
						
						
							
						
						7f914365fd 
					 
					
						
						
							
							Fix GPU sort for large arrays ( #1285 )  
						
						... 
						
						
						
						* Fix GPU sort for large arrays 
						
						
							
						
					 
					
						2024-07-24 14:37:10 -07:00 
						 
				 
			
				
					
						
							
							
								Paul Paczuski 
							
						 
					 
					
						
						
							
						
						ebd7135b50 
					 
					
						
						
							
							Improve stability of BCE loss calculation for input probabilities close to or exactly 0 or 1 ( #1280 )  
						
						... 
						
						
						
						* Improve stability of BCE loss calculation
* Standardize comment
* Apply formatting with black via pre-commit
* Add usage recommendation to docstring
* Update python/mlx/nn/losses.py
---------
Co-authored-by: Awni Hannun <awni.hannun@gmail.com > 
						
						
							
						
					 
					
						2024-07-24 08:38:22 -07:00 
						 
				 
			
				
					
						
							
							
								fgranqvist 
							
						 
					 
					
						
						
							
						
						50eff6a10a 
					 
					
						
						
							
							Implement sampling from laplace distribution. ( #1279 )  
						
						
						
						
							
						
					 
					
						2024-07-24 15:15:37 +02:00 
						 
				 
			
				
					
						
							
							
								Alex Barron 
							
						 
					 
					
						
						
							
						
						c34a5ae7f7 
					 
					
						
						
							
							Fix bfloat16 Hadamard ( #1283 )  
						
						... 
						
						
						
						* fix bfloat16 hadamard
* add scale
* review comments
---------
Co-authored-by: Alex Barron <abarron22@apple.com > 
						
						
							
						
					 
					
						2024-07-23 14:54:43 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						e2aa6ec8ae 
					 
					
						
						
							
							some fixes ( #1281 )  
						
						
						
						
							
						
					 
					
						2024-07-23 11:49:05 -07:00 
						 
				 
			
				
					
						
							
							
								toji 
							
						 
					 
					
						
						
							
						
						6768c6a54a 
					 
					
						
						
							
							Adding missing type hints  ( #1243 )  
						
						... 
						
						
						
						* added type hints for `run`, `tree_map` and `tree_map_with_path`
* fix lint
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
							
						
					 
					
						2024-07-23 07:29:38 -07:00 
						 
				 
			
				
					
						
							
							
								Tim Gymnich 
							
						 
					 
					
						
						
							
						
						6307d166eb 
					 
					
						
						
							
							Fix overflow / underflow handling for expm1f ( #1278 )  
						
						... 
						
						
						
						* Fix overflow / underflow handling for expm1f
* update tests 
						
						
							
						
					 
					
						2024-07-23 07:29:06 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						1fba87b0df 
					 
					
						
						
							
							Fix leak with multi-output primitives ( #1274 )  
						
						... 
						
						
						
						* fix leak with multi-output primitives
* hopefully an actual fix 
						
						
							
						
					 
					
						2024-07-23 06:34:18 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						df124e018a 
					 
					
						
						
							
							fix gguf ( #1273 )  
						
						... 
						
						
						
						* fix gguf
* comment 
						
						
							
						
					 
					
						2024-07-18 07:35:35 -07:00 
						 
				 
			
				
					
						
							
							
								Cheng 
							
						 
					 
					
						
						
							
						
						2f83d6e4b7 
					 
					
						
						
							
							Do not release buffers on exit ( #1142 )  
						
						
						
						
							
						
					 
					
						2024-07-15 15:12:24 -07:00 
						 
				 
			
				
					
						
							
							
								Feng Shijie 
							
						 
					 
					
						
						
							
						
						987785d8d7 
					 
					
						
						
							
							Fix typo and missing header ( #1266 )  
						
						
						
						
							
						
					 
					
						2024-07-15 08:20:24 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						8c01a7893b 
					 
					
						
						
							
							minor fix in optimizer + docs ( #1264 )  
						
						
						
						
							
						
					 
					
						2024-07-12 12:18:02 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						218047c75a 
					 
					
						
						
							
							docs fixes ( #1263 )  
						
						
						
						
							
						
					 
					
						2024-07-11 15:59:07 -07:00 
						 
				 
			
				
					
						
							
							
								Alex Barron 
							
						 
					 
					
						
						
							
						
						d0da74209b 
					 
					
						
						
							
							version bump ( #1260 )  
						
						
						
						
							
 
						
					 
					
						2024-07-11 11:17:55 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						5c1fa64fb0 
					 
					
						
						
							
							Custom transforms ( #1246 )  
						
						
						
						
							
						
					 
					
						2024-07-10 18:00:01 -07:00 
						 
				 
			
				
					
						
							
							
								Alex Barron 
							
						 
					 
					
						
						
							
						
						a3c287354f 
					 
					
						
						
							
							Fast Hadamard Transform ( #1249 )  
						
						... 
						
						
						
						* Working hadamard for powers of 2
* working for m*2^k
* add scale and check contiguity
* add size check
* clean up
* fix test
* add grads + vmap
* gpu only
* skip on linux
* test typo
* add cpu impl
* remove gpu only tests
* fix linux build + add is_equivalent 
						
						
							
						
					 
					
						2024-07-09 20:39:01 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						03cf033f82 
					 
					
						
						
							
							Fix reshape copy bug ( #1253 )  
						
						
						
						
							
						
					 
					
						2024-07-07 21:37:00 -07:00 
						 
				 
			
				
					
						
							
							
								Alex Barron 
							
						 
					 
					
						
						
							
						
						bdb36c9a63 
					 
					
						
						
							
							add zero vjps for bitwise ops and gather w.r.t. index ( #1256 )  
						
						
						
						
							
						
					 
					
						2024-07-07 21:34:59 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						20bb301195 
					 
					
						
						
							
							CPU binary reduction + Nits ( #1242 )  
						
						... 
						
						
						
						* very minor nits
* reduce binary
* fix test 
						
						
							
						
					 
					
						2024-06-28 13:50:42 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						d6383a1c6a 
					 
					
						
						
							
							version bump ( #1239 )  
						
						
						
						
							
 
						
					 
					
						2024-06-27 10:43:13 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						b05bcfd27f 
					 
					
						
						
							
							Fixes segfault when compiling checkpointed functions ( #1235 )  
						
						
						
						
							
						
					 
					
						2024-06-26 16:14:45 -07:00 
						 
				 
			
				
					
						
							
							
								Alex Barron 
							
						 
					 
					
						
						
							
						
						2615660e62 
					 
					
						
						
							
							Fix strided sort bug ( #1236 )  
						
						... 
						
						
						
						* Use output strides in sort kernel
* fix zero strides bug 
						
						
							
						
					 
					
						2024-06-26 14:32:11 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						5b0af4cdb1 
					 
					
						
						
							
							fix donation condition for compilation ( #1237 )  
						
						
						
						
							
						
					 
					
						2024-06-26 09:04:05 -07:00 
						 
				 
			
				
					
						
							
							
								Jagrit Digani 
							
						 
					 
					
						
						
							
						
						8c2e15e6c8 
					 
					
						
						
							
							Accelerate import updates for iOS ( #1227 )  
						
						... 
						
						
						
						* Update veclib and bnns includes to #include <Accelerate/Accelerate.h> for compatibility with ios
* Mark float literals in softmax.cpp to be float16_t for errors in ios
* Add arm neon vector operation guards
* Redirect to common backend for consistency 
						
						
							
						
					 
					
						2024-06-26 09:01:50 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						56c8a33439 
					 
					
						
						
							
							Get metal version from xcode ( #1228 )  
						
						... 
						
						
						
						* get metal version from xcode
* typo
* fix 
						
						
							
						
					 
					
						2024-06-26 07:02:11 -07:00 
						 
				 
			
				
					
						
							
							
								David Koski 
							
						 
					 
					
						
						
							
						
						4eef1e8a3e 
					 
					
						
						
							
							fix typo ( #1215 )  
						
						
						
						
							
						
					 
					
						2024-06-24 13:36:35 -07:00 
						 
				 
			
				
					
						
							
							
								Alex Barron 
							
						 
					 
					
						
						
							
						
						95d11bda06 
					 
					
						
						
							
							Fix NumPy 2.0 pickle test ( #1221 )  
						
						... 
						
						
						
						* fix numpy version <2 temporarily
* typo
* better fix
* Fix just for bfloat16
---------
Co-authored-by: Alex Barron <abarron22@apple.com > 
						
						
							
						
					 
					
						2024-06-23 05:47:22 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						af9079cc1f 
					 
					
						
						
							
							version bump ( #1212 )  
						
						
						
						
							
 
						
					 
					
						2024-06-14 11:28:51 -07:00 
						 
				 
			
				
					
						
							
							
								Jagrit Digani 
							
						 
					 
					
						
						
							
						
						2d6cd47713 
					 
					
						
						
							
							Masked gemv ( #1211 )  
						
						
						
						
							
						
					 
					
						2024-06-14 09:52:26 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						fe3167d7ea 
					 
					
						
						
							
							smaller CPU binary ( #1203 )  
						
						... 
						
						
						
						* smaller CPU binary
* fix no cpu build 
						
						
							
						
					 
					
						2024-06-14 09:46:55 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						31e134be35 
					 
					
						
						
							
							Build for macOS 15 ( #1208 )  
						
						... 
						
						
						
						* Build for macos 15
* metal32 as well
* comment
---------
Co-authored-by: Awni Hannun <Awni Hannun> 
						
						
							
						
					 
					
						2024-06-13 13:31:44 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						e84ba8056d 
					 
					
						
						
							
							only allow openmpi ( #1209 )  
						
						
						
						
							
						
					 
					
						2024-06-13 12:14:44 -07:00 
						 
				 
			
				
					
						
							
							
								Fangjun Kuang 
							
						 
					 
					
						
						
							
						
						f20e97b092 
					 
					
						
						
							
							minor fixes ( #1194 )  
						
						... 
						
						
						
						* minor fixes
* fix build errors 
						
						
							
						
					 
					
						2024-06-12 22:06:49 -07:00 
						 
				 
			
				
					
						
							
							
								Alex Barron 
							
						 
					 
					
						
						
							
						
						934683088e 
					 
					
						
						
							
							Refactor JIT for unary/binary/ternary ops ( #1206 )  
						
						... 
						
						
						
						* refactor unary/binary/ternary ops
* get_primitive_string util
--------- 
						
						
							
						
					 
					
						2024-06-12 14:22:12 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						de2b9e7d0a 
					 
					
						
						
							
							Fix kernel deps to reduce build times ( #1205 )  
						
						
						
						
							
						
					 
					
						2024-06-12 11:17:39 -07:00 
						 
				 
			
				
					
						
							
							
								Alex Barron 
							
						 
					 
					
						
						
							
						
						dd7d8e5e29 
					 
					
						
						
							
							Add Quantized Ops to the JIT ( #1204 )  
						
						... 
						
						
						
						* JIT for quantized ops
* remove unused imports
* address comments
* fix imports
* second attempt to fix imports
---------
Co-authored-by: Alex Barron <abarron22@apple.com > 
						
						
							
						
					 
					
						2024-06-12 09:47:12 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						df964132fb 
					 
					
						
						
							
							fix scatter + test ( #1202 )  
						
						... 
						
						
						
						* fix scatter + test
* fix test warnings
* fix metal validation 
						
						
							
						
					 
					
						2024-06-11 14:35:12 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						709ccc6800 
					 
					
						
						
							
							install mpi for release build ( #1199 )  
						
						
						
						
							
						
					 
					
						2024-06-10 10:09:32 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						cf236fc390 
					 
					
						
						
							
							version ( #1191 )  
						
						
						
						
							
 
						
					 
					
						2024-06-06 17:16:40 -07:00 
						 
				 
			
				
					
						
							
							
								Alex Barron 
							
						 
					 
					
						
						
							
						
						27d70c7d9d 
					 
					
						
						
							
							Feature complete Metal FFT ( #1102 )  
						
						... 
						
						
						
						* feature complete metal fft
* fix contiguity bug
* jit fft
* simplify rader/bluestein constant computation
* remove kernel/utils.h dep
* remove bf16.h dep
* format
---------
Co-authored-by: Alex Barron <abarron22@apple.com > 
						
						
							
						
					 
					
						2024-06-06 12:57:25 -07:00 
						 
				 
			
				
					
						
							
							
								nicolov 
							
						 
					 
					
						
						
							
						
						0e585b4409 
					 
					
						
						
							
							Add docstring for scatter ( #1189 )  
						
						... 
						
						
						
						* Add docstring for scatter
* docs nits
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
							
						
					 
					
						2024-06-06 11:51:25 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						0163a8e57a 
					 
					
						
						
							
							Add docs for the distributed namespace ( #1184 )  
						
						
						
						
							
						
					 
					
						2024-06-06 11:37:00 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						578842954c 
					 
					
						
						
							
							fix jit scan when output doesn't have primitive ( #1190 )  
						
						
						
						
							
						
					 
					
						2024-06-06 07:24:58 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						496315fe1d 
					 
					
						
						
							
							Fix scan ( #1188 )  
						
						... 
						
						
						
						* fix scan
* improve grid size
* fix cpu cummax 
						
						
							
						
					 
					
						2024-06-05 14:21:58 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						0fe6895893 
					 
					
						
						
							
							Fix the hard-shrink test ( #1185 )  
						
						
						
						
							
						
					 
					
						2024-06-04 16:22:56 -07:00 
						 
				 
			
				
					
						
							
							
								Nikhil Mehta 
							
						 
					 
					
						
						
							
						
						0b7d71fd2f 
					 
					
						
						
							
							Add softmin, hardshrink, hardtanh ( #1180 )  
						
						... 
						
						
						
						---------
Co-authored-by: Nikhil Mehta <nikmehta@tesla.com > 
						
						
							
						
					 
					
						2024-06-04 15:48:18 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						83b11bc58d 
					 
					
						
						
							
							Fix Metal API validation for empty concat ( #1183 )  
						
						
						
						
							
						
					 
					
						2024-06-04 13:17:08 -07:00