Awni Hannun 
							
						 
					 
					
						
						
							
						
						76f275b4df 
					 
					
						
						
							
							error in rms for wrong size ( #1562 )  
						
						
						
						
					 
					
						2024-11-04 13:24:02 -08:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						62f297b51d 
					 
					
						
						
							
							Sdpa fix ( #1558 )  
						
						
						
						
					 
					
						2024-11-02 21:25:46 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						09bc32f62f 
					 
					
						
						
							
							No extra reshape ( #1557 )  
						
						... 
						
						
						
						* no extra reshape
* lint 
						
						
					 
					
						2024-11-02 19:07:20 -07:00 
						 
				 
			
				
					
						
							
							
								Alex Barron 
							
						 
					 
					
						
						
							
						
						9e516b71ea 
					 
					
						
						
							
							Add dispatchThreads to custom kernel doc ( #1551 )  
						
						... 
						
						
						
						* add dispatchThreads info
* update
* add link 
						
						
					 
					
						2024-11-01 13:07:48 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						57c6aa7188 
					 
					
						
						
							
							fix multi output leak ( #1548 )  
						
						
						
						
					 
					
						2024-10-31 09:32:01 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						4f72c66911 
					 
					
						
						
							
							improvements to scatter / gather ( #1541 )  
						
						
						
						
					 
					
						2024-10-30 19:30:54 -07:00 
						 
				 
			
				
					
						
							
							
								Alex Barron 
							
						 
					 
					
						
						
							
						
						048fabdabd 
					 
					
						
						
							
							Fix vmap constant output size ( #1524 )  
						
						... 
						
						
						
						* use inputs to determine output size
* remove noop vmap tests 
						
						
					 
					
						2024-10-30 16:16:53 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						d2ff04a4f2 
					 
					
						
						
							
							fix format ( #1539 )  
						
						
						
						
					 
					
						2024-10-28 18:29:14 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						0eb56d5be0 
					 
					
						
						
							
							Wired ( #1510 )  
						
						... 
						
						
						
						* expose residency sets as wire/unwire
* returns wired size
* fix
* runtime support check
* fix os check
* fix test
* fix no metal build
* docs
* nit
* nits in docs
* nits 
						
						
					 
					
						2024-10-25 09:35:33 -07:00 
						 
				 
			
				
					
						
							
							
								Venkata Naga Aditya Datta Chivukula 
							
						 
					 
					
						
						
							
						
						430ffef58a 
					 
					
						
						
							
							[Feature] Added Sparse Initialization ( #1498 )  
						
						... 
						
						
						
						Co-authored-by: Saanidhyavats <saanidhyavats@gmail.com > 
						
						
					 
					
						2024-10-24 12:31:24 -07:00 
						 
				 
			
				
					
						
							
							
								Alex Barron 
							
						 
					 
					
						
						
							
						
						3d17077187 
					 
					
						
						
							
							Add mx.array.__format__ ( #1521 )  
						
						... 
						
						
						
						* add __format__
* actually test something
* fix 
						
						
					 
					
						2024-10-24 11:11:39 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						c9b41d460f 
					 
					
						
						
							
							Working 64-bit scans ( #1506 )  
						
						
						
						
					 
					
						2024-10-24 11:05:46 -07:00 
						 
				 
			
				
					
						
							
							
								Kashif Rasul 
							
						 
					 
					
						
						
							
						
						3ddc07e936 
					 
					
						
						
							
							Eigenvalues and eigenvectors ( #1334 )  
						
						... 
						
						
						
						* initial eigvalsh
* add compute_vectors
* add compute_vectors_
* return a pair
* add eigh to return only eigenvectors
* fixed typo
* merge merge Eighvalsh and Eigh into a single primitive
* use the same primate with the flag
* fix primatives
* use MULTI
* fix eval_gpu
* fix decleration
* rename EighPrimitive to Eigh
* tests
* tests
* fix rebase and format
* cleanup lapack
* format
* add cblas.h
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-10-22 12:18:48 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						c26208f67d 
					 
					
						
						
							
							Remove Hazard tracking with Fences ( #1509 )  
						
						... 
						
						
						
						* remove hazard tracking
* with fence map
* no hazard tracking with fences
* nits
* fix fence retain
* cleanup
* fix quantized rebase 
						
						
					 
					
						2024-10-21 19:33:32 -07:00 
						 
				 
			
				
					
						
							
							
								Alex Barron 
							
						 
					 
					
						
						
							
						
						d15fa13daf 
					 
					
						
						
							
							Batched Quantized Matmul + Fast Small QMV ( #1503 )  
						
						... 
						
						
						
						* add fast qmv for small dims
* fix test
* batched cpu
* add batched template param
* refactor metal quantized.cpp 
						
						
					 
					
						2024-10-21 16:23:17 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						92d7cb71f8 
					 
					
						
						
							
							Fix compile ( #1501 )  
						
						... 
						
						
						
						* fix compile
* fix space 
						
						
					 
					
						2024-10-18 11:06:40 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						9dd72cd421 
					 
					
						
						
							
							fix gumbel ( #1495 )  
						
						
						
						
					 
					
						2024-10-17 13:52:39 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						f9f8c167d4 
					 
					
						
						
							
							fix submodule stubs ( #1492 )  
						
						
						
						
					 
					
						2024-10-15 16:23:37 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						3f86399922 
					 
					
						
						
							
							Real and Imag ( #1490 )  
						
						... 
						
						
						
						* real and imag
* fix
* fix 
						
						
					 
					
						2024-10-15 16:23:15 -07:00 
						 
				 
			
				
					
						
							
							
								LastWhisper 
							
						 
					 
					
						
						
							
						
						2b8ace6a03 
					 
					
						
						
							
							Typing the dropout. ( #1479 )  
						
						
						
						
					 
					
						2024-10-15 06:45:46 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						0ab8e099e8 
					 
					
						
						
							
							Fix cpu segfault ( #1488 )  
						
						... 
						
						
						
						* fix cpu segfault
* nit in tests 
						
						
					 
					
						2024-10-14 16:17:03 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						020f048cd0 
					 
					
						
						
							
							A few updates for CPU ( #1482 )  
						
						... 
						
						
						
						* some updates
* format
* fix
* nit 
						
						
					 
					
						2024-10-14 12:45:49 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						881615b072 
					 
					
						
						
							
							Faster metal compiled kernels + some fixes ( #1486 )  
						
						... 
						
						
						
						* bump mac tests to use py39
* work per thread for compiled kernels
* fixe for large arrays
* fix 
						
						
					 
					
						2024-10-14 12:45:38 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						bf6ec92216 
					 
					
						
						
							
							Make the GPU device more thread safe ( #1478 )  
						
						... 
						
						
						
						* gpu stream safety
* comment
* fix 
						
						
					 
					
						2024-10-12 17:49:15 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						e1c9600da3 
					 
					
						
						
							
							Add mx.random.permutation ( #1471 )  
						
						... 
						
						
						
						* random permutation
* comment 
						
						
					 
					
						2024-10-08 19:42:19 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						1fa0d20a30 
					 
					
						
						
							
							consistently handle all -inf in softmax ( #1470 )  
						
						
						
						
					 
					
						2024-10-08 09:54:02 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						3274c6a087 
					 
					
						
						
							
							Fix array is_available race cases ( #1468 )  
						
						
						
						
					 
					
						2024-10-07 19:13:50 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						9b12093739 
					 
					
						
						
							
							Add the roll op ( #1455 )  
						
						
						
						
					 
					
						2024-10-07 17:21:42 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						f374b6ca4d 
					 
					
						
						
							
							Bump nanobind to 2.2 ( #1461 )  
						
						... 
						
						
						
						* bump nanobind
* extension version for tests 
						
						
					 
					
						2024-10-07 16:52:40 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						0070e1db40 
					 
					
						
						
							
							Fix deep recursion with siblings ( #1462 )  
						
						... 
						
						
						
						* fix recursion with siblings
* fix
* add test
* increase tol 
						
						
					 
					
						2024-10-07 06:15:33 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						e4534dac17 
					 
					
						
						
							
							Conv grad with groups + bugfix ( #1449 )  
						
						... 
						
						
						
						* fix bug in flipped conv with groups, start of grad for groups
* fix
* fix
* fix + test 
						
						
					 
					
						2024-10-06 07:08:53 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						1bdc038bf9 
					 
					
						
						
							
							fix argpartition + faster {arg} sorts / partitions ( #1453 )  
						
						
						
						
					 
					
						2024-10-03 14:21:25 -07:00 
						 
				 
			
				
					
						
							
							
								Lucas Newman 
							
						 
					 
					
						
						
							
						
						4a64d4bff1 
					 
					
						
						
							
							Add support for grouped 1D convolutions to the nn API ( #1444 )  
						
						... 
						
						
						
						* Fix the weight shape for grouped convolutions from the nn API.
* Add tests.
* Pre-commit formatting.
* Add input validation.
* Use integer division instead of casting.
* docs
* nit
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-09-28 06:41:07 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						718aea3f1d 
					 
					
						
						
							
							allow take to work with integer index ( #1440 )  
						
						
						
						
					 
					
						2024-09-26 15:58:03 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						afc9c0ec1b 
					 
					
						
						
							
							dtype is copy assignable ( #1436 )  
						
						
						
						
					 
					
						2024-09-25 12:07:13 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						195b429d99 
					 
					
						
						
							
							Put along axis + fixe for partition grad ( #1430 )  
						
						... 
						
						
						
						* put along axis, fixes for partition grad
* zeros for arg reduce 
						
						
					 
					
						2024-09-23 10:03:38 -07:00 
						 
				 
			
				
					
						
							
							
								Nripesh Niketan 
							
						 
					 
					
						
						
							
						
						6af5ca35b2 
					 
					
						
						
							
							feat: add cross_product ( #1252 )  
						
						... 
						
						
						
						* feat: add cross_product
* lint
* python binding
* refactor: Improve error message for cross_product function
* refactor: more close to numpy cross product
* refactor: improve error message for cross_product function
* finish
* fix acks
* allow old numpy
* doc
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-09-17 13:12:43 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						4f46e9c997 
					 
					
						
						
							
							More fixes for arrays with large sizes ( #1405 )  
						
						... 
						
						
						
						* compile works for big arrays when contiguous
* style
* nits in docs
* a bunch more stuff
* update jit
* update jit
* use constant for shapes and strides and remove elem_to_loc overload
* use kernel instantiation
* docs nits
* update binary and ternary
* comments 
						
						
					 
					
						2024-09-17 12:46:31 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						c6739ba7f3 
					 
					
						
						
							
							Faster RNN layers ( #1419 )  
						
						... 
						
						
						
						* faster rnn
* use admm 
						
						
					 
					
						2024-09-17 06:04:19 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						914409fef9 
					 
					
						
						
							
							Data parallel helper ( #1407 )  
						
						
						
						
					 
					
						2024-09-16 18:17:21 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						d5ed4d7a71 
					 
					
						
						
							
							override class function ( #1418 )  
						
						
						
						
					 
					
						2024-09-16 13:21:04 -07:00 
						 
				 
			
				
					
						
							
							
								Nripesh Niketan 
							
						 
					 
					
						
						
							
						
						669c27140d 
					 
					
						
						
							
							Chore: add pre-commit hook for cmake ( #1362 )  
						
						... 
						
						
						
						* reset and lint
* format
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-09-16 12:53:01 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						d6492b0163 
					 
					
						
						
							
							fix clip ( #1415 )  
						
						
						
						
					 
					
						2024-09-14 16:09:09 -07:00 
						 
				 
			
				
					
						
							
							
								c0g 
							
						 
					 
					
						
						
							
						
						bd8396fad8 
					 
					
						
						
							
							Fix typo in transformer docs ( #1414 )  
						
						
						
						
					 
					
						2024-09-14 06:05:15 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						8b30acd7eb 
					 
					
						
						
							
							fix module attribute set, reset, set ( #1403 )  
						
						
						
						
					 
					
						2024-09-11 16:30:42 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						02efb310ca 
					 
					
						
						
							
							Xcode 160 ( #1384 )  
						
						... 
						
						
						
						* xcode 16.0 with debug tests
* limit nproc for builds
* vmap bug
* assert bug
* run python tests in debug mode
* fix view, bool copies preserve bits'
* actual view fix 
						
						
					 
					
						2024-09-10 15:15:17 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						3ae6aabe9f 
					 
					
						
						
							
							throw for certain cases of non captured inputs in compile ( #1401 )  
						
						
						
						
					 
					
						2024-09-09 14:54:31 -07:00 
						 
				 
			
				
					
						
							
							
								Max-Heinrich Laves 
							
						 
					 
					
						
						
							
						
						efeb9c0f02 
					 
					
						
						
							
							Transposed Convolution ( #1245 )  
						
						... 
						
						
						
						* initial implementation for conv_transpose
ran pre-commit
implemented conv_transpose
updated conv_general docstring
updated conv_general docstring
updated code comments
removed commented run_conv_checks
updated acknowledgments
added missing entry to ops.rst
added op to nn.layers
resolved merge conflicts
* removed ConvolutionTranspose primitive as suggested by reviewer
removed ConvolutionTranspose primitive as suggested by reviewer
* remove transpose flag, add another test
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-09-06 19:52:38 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						ba3e913c7a 
					 
					
						
						
							
							Simplifications for MLX C ( #1396 )  
						
						... 
						
						
						
						* simplifications for MLX C
* use vectors instead of map
* update examples 
						
						
					 
					
						2024-09-06 19:16:50 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						7cca1727af 
					 
					
						
						
							
							Fix slice data size ( #1394 )  
						
						... 
						
						
						
						* fix slice data size and add tests
* fix contiguous flag
* simplify stride and perform copy for non-contiguous arrays
* fix cpu
* comment 
						
						
					 
					
						2024-09-04 19:10:43 -07:00