Awni Hannun 
							
						 
					 
					
						
						
							
						
						f374b6ca4d 
					 
					
						
						
							
							Bump nanobind to 2.2 ( #1461 )  
						
						... 
						
						
						
						* bump nanobind
* extension version for tests 
						
						
							
						
					 
					
						2024-10-07 16:52:40 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						0070e1db40 
					 
					
						
						
							
							Fix deep recursion with siblings ( #1462 )  
						
						... 
						
						
						
						* fix recursion with siblings
* fix
* add test
* increase tol 
						
						
							
						
					 
					
						2024-10-07 06:15:33 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						95d04805b3 
					 
					
						
						
							
							Fix complex power on Metal ( #1460 )  
						
						
						
						
							
						
					 
					
						2024-10-06 19:58:30 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						e4534dac17 
					 
					
						
						
							
							Conv grad with groups + bugfix ( #1449 )  
						
						... 
						
						
						
						* fix bug in flipped conv with groups, start of grad for groups
* fix
* fix
* fix + test 
						
						
							
						
					 
					
						2024-10-06 07:08:53 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						fef3c4ec1d 
					 
					
						
						
							
							Fix mpi test in CI ( #1456 )  
						
						... 
						
						
						
						* Fix mpi test in CI
* Set bind to none 
						
						
							
						
					 
					
						2024-10-06 06:09:17 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						1bdc038bf9 
					 
					
						
						
							
							fix argpartition + faster {arg} sorts / partitions ( #1453 )  
						
						
						
						
							
						
					 
					
						2024-10-03 14:21:25 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						5523d9c426 
					 
					
						
						
							
							faster cpu indexing ( #1450 )  
						
						
						
						
							
						
					 
					
						2024-10-03 13:53:47 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						d878015228 
					 
					
						
						
							
							Fix normalization check_input ( #1452 )  
						
						
						
						
							
						
					 
					
						2024-10-03 13:26:56 -07:00 
						 
				 
			
				
					
						
							
							
								Cheng 
							
						 
					 
					
						
						
							
						
						5900e3249f 
					 
					
						
						
							
							Fix building on Linux ( #1446 )  
						
						
						
						
							
						
					 
					
						2024-09-30 07:00:39 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						bacced53d3 
					 
					
						
						
							
							Fix row reduce with very few rows ( #1447 )  
						
						
						
						
							
						
					 
					
						2024-09-29 20:00:35 -07:00 
						 
				 
			
				
					
						
							
							
								Lucas Newman 
							
						 
					 
					
						
						
							
						
						4a64d4bff1 
					 
					
						
						
							
							Add support for grouped 1D convolutions to the nn API ( #1444 )  
						
						... 
						
						
						
						* Fix the weight shape for grouped convolutions from the nn API.
* Add tests.
* Pre-commit formatting.
* Add input validation.
* Use integer division instead of casting.
* docs
* nit
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
							
						
					 
					
						2024-09-28 06:41:07 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						b1e2b53c2d 
					 
					
						
						
							
							bump ( #1445 )  
						
						
						
						
							
 
						
					 
					
						2024-09-27 13:53:02 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						11354d5bff 
					 
					
						
						
							
							Avoid io timeout for large arrays ( #1442 )  
						
						
						
						
							
						
					 
					
						2024-09-27 13:32:14 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						718aea3f1d 
					 
					
						
						
							
							allow take to work with integer index ( #1440 )  
						
						
						
						
							
						
					 
					
						2024-09-26 15:58:03 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						5b6f38df2b 
					 
					
						
						
							
							Faster cpu ops ( #1434 )  
						
						... 
						
						
						
						* faster binary and cleaner copy
* use recursive template for other ops
* more cleanup
* fix from cleanup
* more clean
* fix binary
* use contiguous iterator
* add 3d
* nits
* fix
* fix?
* fix
* fix rebase 
						
						
							
						
					 
					
						2024-09-26 09:19:13 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						0b4a58699e 
					 
					
						
						
							
							Some overhead reductions in mx.fast.metal_kernel ( #1437 )  
						
						... 
						
						
						
						* some overhead reductions
* fix
* use +=
* use more += 
						
						
							
						
					 
					
						2024-09-25 17:25:21 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						4f9f9ebb6f 
					 
					
						
						
							
							Faster Metal unary and binary for general case  ( #1431 )  
						
						... 
						
						
						
						* faster unary and binary for general case
* update ternary + jit fix
* fix jit
* unary work per thread 
						
						
							
						
					 
					
						2024-09-25 12:07:43 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						afc9c0ec1b 
					 
					
						
						
							
							dtype is copy assignable ( #1436 )  
						
						
						
						
							
						
					 
					
						2024-09-25 12:07:13 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						195b429d99 
					 
					
						
						
							
							Put along axis + fixe for partition grad ( #1430 )  
						
						... 
						
						
						
						* put along axis, fixes for partition grad
* zeros for arg reduce 
						
						
							
						
					 
					
						2024-09-23 10:03:38 -07:00 
						 
				 
			
				
					
						
							
							
								Luke Carlson 
							
						 
					 
					
						
						
							
						
						2b878e9dd7 
					 
					
						
						
							
							Create CITATION.cff ( #1425 )  
						
						
						
						
							
						
					 
					
						2024-09-20 11:39:46 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						67b6bf530d 
					 
					
						
						
							
							Optimization for general ND copies ( #1421 )  
						
						
						
						
							
						
					 
					
						2024-09-17 17:59:51 -07:00 
						 
				 
			
				
					
						
							
							
								Nripesh Niketan 
							
						 
					 
					
						
						
							
						
						6af5ca35b2 
					 
					
						
						
							
							feat: add cross_product ( #1252 )  
						
						... 
						
						
						
						* feat: add cross_product
* lint
* python binding
* refactor: Improve error message for cross_product function
* refactor: more close to numpy cross product
* refactor: improve error message for cross_product function
* finish
* fix acks
* allow old numpy
* doc
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
							
						
					 
					
						2024-09-17 13:12:43 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						4f46e9c997 
					 
					
						
						
							
							More fixes for arrays with large sizes ( #1405 )  
						
						... 
						
						
						
						* compile works for big arrays when contiguous
* style
* nits in docs
* a bunch more stuff
* update jit
* update jit
* use constant for shapes and strides and remove elem_to_loc overload
* use kernel instantiation
* docs nits
* update binary and ternary
* comments 
						
						
							
						
					 
					
						2024-09-17 12:46:31 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						c6739ba7f3 
					 
					
						
						
							
							Faster RNN layers ( #1419 )  
						
						... 
						
						
						
						* faster rnn
* use admm 
						
						
							
						
					 
					
						2024-09-17 06:04:19 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						914409fef9 
					 
					
						
						
							
							Data parallel helper ( #1407 )  
						
						
						
						
							
						
					 
					
						2024-09-16 18:17:21 -07:00 
						 
				 
			
				
					
						
							
							
								jjuang-apple 
							
						 
					 
					
						
						
							
						
						8d68a3e805 
					 
					
						
						
							
							remove fmt dependencies from MLX install ( #1417 )  
						
						
						
						
							
						
					 
					
						2024-09-16 13:32:28 -07:00 
						 
				 
			
				
					
						
							
							
								jjuang-apple 
							
						 
					 
					
						
						
							
						
						6bbcc453ef 
					 
					
						
						
							
							avoid using find_library to make install truly portable ( #1416 )  
						
						
						
						
							
						
					 
					
						2024-09-16 13:21:32 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						d5ed4d7a71 
					 
					
						
						
							
							override class function ( #1418 )  
						
						
						
						
							
						
					 
					
						2024-09-16 13:21:04 -07:00 
						 
				 
			
				
					
						
							
							
								Nripesh Niketan 
							
						 
					 
					
						
						
							
						
						669c27140d 
					 
					
						
						
							
							Chore: add pre-commit hook for cmake ( #1362 )  
						
						... 
						
						
						
						* reset and lint
* format
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
							
						
					 
					
						2024-09-16 12:53:01 -07:00 
						 
				 
			
				
					
						
							
							
								Max-Heinrich Laves 
							
						 
					 
					
						
						
							
						
						adcc88e208 
					 
					
						
						
							
							Conv cpu improvements ( #1410 )  
						
						
						
						
							
						
					 
					
						2024-09-15 18:45:10 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						d6492b0163 
					 
					
						
						
							
							fix clip ( #1415 )  
						
						
						
						
							
						
					 
					
						2024-09-14 16:09:09 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						b3f52c9fbe 
					 
					
						
						
							
							ensure io/comm streams are active before eval ( #1412 )  
						
						
						
						
							
						
					 
					
						2024-09-14 06:17:36 -07:00 
						 
				 
			
				
					
						
							
							
								c0g 
							
						 
					 
					
						
						
							
						
						bd8396fad8 
					 
					
						
						
							
							Fix typo in transformer docs ( #1414 )  
						
						
						
						
							
						
					 
					
						2024-09-14 06:05:15 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						d0c58841d1 
					 
					
						
						
							
							Patch bump ( #1408 )  
						
						
						
						
							
 
						
					 
					
						2024-09-12 16:44:23 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						881f09b2e2 
					 
					
						
						
							
							Allow querying the allocator for the buffer size ( #1404 )  
						
						
						
						
							
						
					 
					
						2024-09-11 21:02:16 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						8b30acd7eb 
					 
					
						
						
							
							fix module attribute set, reset, set ( #1403 )  
						
						
						
						
							
						
					 
					
						2024-09-11 16:30:42 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						02efb310ca 
					 
					
						
						
							
							Xcode 160 ( #1384 )  
						
						... 
						
						
						
						* xcode 16.0 with debug tests
* limit nproc for builds
* vmap bug
* assert bug
* run python tests in debug mode
* fix view, bool copies preserve bits'
* actual view fix 
						
						
							
						
					 
					
						2024-09-10 15:15:17 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						e7e59c6f05 
					 
					
						
						
							
							Fix copying scalars by adding fill_gpu ( #1402 )  
						
						... 
						
						
						
						* fix copying scalars by adding fill_gpu
* Another copy scalar changed to fill
---------
Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com > 
						
						
							
						
					 
					
						2024-09-09 15:54:08 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						3ae6aabe9f 
					 
					
						
						
							
							throw for certain cases of non captured inputs in compile ( #1401 )  
						
						
						
						
							
						
					 
					
						2024-09-09 14:54:31 -07:00 
						 
				 
			
				
					
						
							
							
								xnorai 
							
						 
					 
					
						
						
							
						
						dc627dcb5e 
					 
					
						
						
							
							Replace the use of result_of_t with invoke_result_t ( #1397 )  
						
						... 
						
						
						
						* Fix C++20 incompatibility
* Fix C++20 incompatibility 
						
						
							
						
					 
					
						2024-09-06 19:52:57 -07:00 
						 
				 
			
				
					
						
							
							
								Max-Heinrich Laves 
							
						 
					 
					
						
						
							
						
						efeb9c0f02 
					 
					
						
						
							
							Transposed Convolution ( #1245 )  
						
						... 
						
						
						
						* initial implementation for conv_transpose
ran pre-commit
implemented conv_transpose
updated conv_general docstring
updated conv_general docstring
updated code comments
removed commented run_conv_checks
updated acknowledgments
added missing entry to ops.rst
added op to nn.layers
resolved merge conflicts
* removed ConvolutionTranspose primitive as suggested by reviewer
removed ConvolutionTranspose primitive as suggested by reviewer
* remove transpose flag, add another test
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
							
						
					 
					
						2024-09-06 19:52:38 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						ba3e913c7a 
					 
					
						
						
							
							Simplifications for MLX C ( #1396 )  
						
						... 
						
						
						
						* simplifications for MLX C
* use vectors instead of map
* update examples 
						
						
							
						
					 
					
						2024-09-06 19:16:50 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						7cca1727af 
					 
					
						
						
							
							Fix slice data size ( #1394 )  
						
						... 
						
						
						
						* fix slice data size and add tests
* fix contiguous flag
* simplify stride and perform copy for non-contiguous arrays
* fix cpu
* comment 
						
						
							
						
					 
					
						2024-09-04 19:10:43 -07:00 
						 
				 
			
				
					
						
							
							
								Bhargav Yagnik 
							
						 
					 
					
						
						
							
						
						11371fe251 
					 
					
						
						
							
							Test to prevent bugs like  #1386  ( #1391 )  
						
						... 
						
						
						
						* updated test_array for missing ops
* formatting changes 
						
						
							
						
					 
					
						2024-09-04 17:24:30 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						41c603d48a 
					 
					
						
						
							
							fix jit reduce ( #1395 )  
						
						
						
						
							
						
					 
					
						2024-09-04 14:03:10 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						969337345f 
					 
					
						
						
							
							Fix reduce edge case ( #1389 )  
						
						
						
						
							
						
					 
					
						2024-09-01 21:37:51 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						9592766939 
					 
					
						
						
							
							add std as method ( #1387 )  
						
						... 
						
						
						
						* add std as method
* add std as method 
						
						
							
						
					 
					
						2024-09-01 19:49:16 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						58dca7d846 
					 
					
						
						
							
							Fix copy in the sort primitive ( #1383 )  
						
						
						
						
							
						
					 
					
						2024-08-31 08:32:14 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						0d302cd25b 
					 
					
						
						
							
							Fix compiel with byte sized constants ( #1381 )  
						
						
						
						
							
						
					 
					
						2024-08-30 17:24:35 -07:00 
						 
				 
			
				
					
						
							
							
								Alex Barron 
							
						 
					 
					
						
						
							
						
						da691257ec 
					 
					
						
						
							
							Fix overflow in quantize/dequantize ( #1379 )  
						
						... 
						
						
						
						* add 2d indices to prevent overflow
* use nthreads not out size 
						
						
							
						
					 
					
						2024-08-30 13:32:41 -07:00