Awni Hannun 
							
						 
					 
					
						
						
							
						
						b63ef10a7f 
					 
					
						
						
							
							Extensions ( #962 )  
						
						... 
						
						
						
						* start to fix extensions
* mostly fixed extensions
* fix extension build
* couple more nits 
						
						
							
						
					 
					
						2024-04-09 08:50:36 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						42afe27e12 
					 
					
						
						
							
							std and expm1 ( #973 )  
						
						... 
						
						
						
						* std and expm1
* actually add expm1
* fix linux
* fix vjp
* relax tol for linux test
* Add it to the compilable primitives
---------
Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com > 
						
						
							
						
					 
					
						2024-04-08 14:26:01 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						76e63212ff 
					 
					
						
						
							
							Enable bfloat scan ( #974 )  
						
						... 
						
						
						
						* enable bfloat scan
* fix tests 
						
						
							
						
					 
					
						2024-04-08 12:29:19 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						aac2f9fb61 
					 
					
						
						
							
							Improve profiling with gpu tracing ( #969 )  
						
						... 
						
						
						
						* improve profiling with gpu tracing
* fix for linux
* nit
* doc fix
* fix example 
						
						
							
						
					 
					
						2024-04-07 21:47:43 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						bddf23f175 
					 
					
						
						
							
							patch bump ( #956 )  
						
						
						
						
							
 
						
					 
					
						2024-04-04 11:56:37 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						039da779d1 
					 
					
						
						
							
							No quant reshape ( #957 )  
						
						... 
						
						
						
						* precise option on cpu
* remove print
* remove reshape in quant matmul
* no quant reshape 
						
						
							
						
					 
					
						2024-04-04 11:52:12 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						d88d2124b5 
					 
					
						
						
							
							segfaut layer norm grad ( #955 )  
						
						
						
						
							
						
					 
					
						2024-04-04 10:59:15 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						e142aaf8a1 
					 
					
						
						
							
							Option for precise softmax ( #953 )  
						
						... 
						
						
						
						* precise softmax
* Add an equivalency check
* Make the threadgroup memory definition fixed
* precise cpu softmax
* precise option on cpu
* remove print
---------
Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com > 
						
						
							
						
					 
					
						2024-04-04 08:32:35 -07:00 
						 
				 
			
				
					
						
							
							
								AmirHossein_Razlighi 
							
						 
					 
					
						
						
							
						
						0caf35f4b8 
					 
					
						
						
							
							Better exceptions in case of invalid operations on mlx.core.array ( #910 ) ( #926 )  
						
						... 
						
						
						
						* Nicer exceptions for ops on non-arrays 
						
						
							
						
					 
					
						2024-04-02 21:11:24 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						3fc993f82d 
					 
					
						
						
							
							Properly handle negative axes in python vmap ( #944 )  
						
						
						
						
							
						
					 
					
						2024-04-02 18:07:23 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						741eb28443 
					 
					
						
						
							
							fix a couple bugs ( #952 )  
						
						
						
						
							
						
					 
					
						2024-04-02 12:07:41 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						1a87dc5ea8 
					 
					
						
						
							
							Fix compile fusion for multi-output edge cases ( #950 )  
						
						... 
						
						
						
						* Fix compile fusion for multi-output edge cases
* Add a test for multi-output compile 
						
						
							
						
					 
					
						2024-04-02 08:42:31 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						2427fa171e 
					 
					
						
						
							
							Fix cpu compile ( #934 )  
						
						... 
						
						
						
						* fix one cpu bug, test for another
* format hooks
* simplify contiguity check for cpu compile
* fix
* add back donation
* comment 
						
						
							
						
					 
					
						2024-04-01 17:37:12 -07:00 
						 
				 
			
				
					
						
							
							
								Jagrit Digani 
							
						 
					 
					
						
						
							
						
						639e06e1f3 
					 
					
						
						
							
							Indexing bug fix ( #947 )  
						
						... 
						
						
						
						* Fix axes accounting
* Add tests 
						
						
							
						
					 
					
						2024-04-01 12:18:50 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						02fedbf1da 
					 
					
						
						
							
							Fix array initialization from list ( #942 )  
						
						... 
						
						
						
						* Fix array initialization from list
* Change the error message in the test 
						
						
							
						
					 
					
						2024-04-01 06:27:52 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						110d9b149d 
					 
					
						
						
							
							Layer norm grad fix donation bug ( #941 )  
						
						... 
						
						
						
						* add layer norm grad test
* Fix donation bug in layernorm vjp
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
							
						
					 
					
						2024-04-01 06:15:50 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						9cbff5ec1d 
					 
					
						
						
							
							Fix typo in qmm check ( #940 )  
						
						
						
						
							
						
					 
					
						2024-03-31 19:15:44 -07:00 
						 
				 
			
				
					
						
							
							
								Suvan Kumar 
							
						 
					 
					
						
						
							
						
						433c0206b0 
					 
					
						
						
							
							Update saving_and_loading.rst ( #929 )  
						
						... 
						
						
						
						Update saving / load docs. 
						
						
							
						
					 
					
						2024-03-30 14:30:06 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						8915901966 
					 
					
						
						
							
							Donation bug ( #933 )  
						
						... 
						
						
						
						* donation
* buf
* fix bug in softmax
* comment
* remove print 
						
						
							
						
					 
					
						2024-03-30 10:08:54 -07:00 
						 
				 
			
				
					
						
							
							
								AmirHossein_Razlighi 
							
						 
					 
					
						
						
							
						
						f48bc496c7 
					 
					
						
						
							
							Comparing python objects (such as list/tuple) with mlx.core.array ( #920 )  
						
						... 
						
						
						
						* add implicit conversion of list to array for equality constraint
* add tests for array equality
* add test for tuple and array equality
* return False if __eq__ arg is list or tuple
* write tests for equality
* update the rule of comparison for __ge__/__gt__/__lt__/__le__
* add a helper function for detecting mlx.core.array
* return true in case fo inequality
* debug minor issue regarding detecting mlx array
* add tests for inequality comparisons
* add name for contribution
* reformat files using pre-commit
* update tests for float
* update tests for inequality
* raise exception in case of invalid comparisons
* use isinstance instead of string comparison
* replace "is_convirtable_to_array" with previous logic
* remove throwing exceptions for other operations
* just a comment
* minor changes for efficiency
* optimize a utils function
* change the function name
* Update ACKNOWLEDGMENTS.md
---------
Co-authored-by: Awni Hannun <awni.hannun@gmail.com > 
						
						
							
						
					 
					
						2024-03-29 06:52:30 -07:00 
						 
				 
			
				
					
						
							
							
								Cheng 
							
						 
					 
					
						
						
							
						
						913b19329c 
					 
					
						
						
							
							Add missing && when forwarding args ( #925 )  
						
						... 
						
						
						
						Without the && args would be copied and perfect forwarding won't work. 
						
						
							
						
					 
					
						2024-03-29 06:48:29 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						d8cb3128f6 
					 
					
						
						
							
							bump ( #924 )  
						
						... 
						
						
						
						* bump
* fix version 
						
						
							
 
						
					 
					
						2024-03-28 16:14:55 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						5f9ba3019f 
					 
					
						
						
							
							Fix qmm_t for unaligned cases ( #923 )  
						
						
						
						
							
						
					 
					
						2024-03-28 15:34:57 -07:00 
						 
				 
			
				
					
						
							
							
								Cheng 
							
						 
					 
					
						
						
							
						
						46caf0bef0 
					 
					
						
						
							
							Remove unnecessary string copies ( #891 )  
						
						... 
						
						
						
						1. Use string_view instead of string when there is no need for copy.
2. Otherwise move string when possible. 
						
						
							
						
					 
					
						2024-03-28 13:14:59 -07:00 
						 
				 
			
				
					
						
							
							
								Jack Mousseau 
							
						 
					 
					
						
						
							
						
						45f636e759 
					 
					
						
						
							
							Add Metal debug option and capture functions ( #707 )  
						
						... 
						
						
						
						* Add Metal debug option and capture functions
* Add brief Metal debugger documentation
* doc nits
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
							
						
					 
					
						2024-03-28 09:40:31 -07:00 
						 
				 
			
				
					
						
							
							
								Cheng 
							
						 
					 
					
						
						
							
						
						a7b404ff53 
					 
					
						
						
							
							Use uintptr_t instead of size_t to store funtion id ( #916 )  
						
						... 
						
						
						
						Also does some small cleanup of the compile cache code. 
						
						
							
						
					 
					
						2024-03-28 06:37:59 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						c4fd0e5ede 
					 
					
						
						
							
							Fixes   #918  bug in compile_tests ( #919 )  
						
						
						
						
							
						
					 
					
						2024-03-27 22:37:37 -07:00 
						 
				 
			
				
					
						
							
							
								Cheng 
							
						 
					 
					
						
						
							
						
						bab5386306 
					 
					
						
						
							
							Make ops aware of rvalues: astype/as_strided/copy/full ( #895 )  
						
						... 
						
						
						
						When compositing transforms lots of temporary of arrays will be created
and passed to next primitive, and by making ops accepting args by value
we can avoid lots of copies of temporary arrays. 
						
						
							
						
					 
					
						2024-03-27 22:35:55 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						aca7584635 
					 
					
						
						
							
							Fix OOB read in qmv when non-divisible by blocksize ( #917 )  
						
						
						
						
							
						
					 
					
						2024-03-27 22:18:35 -07:00 
						 
				 
			
				
					
						
							
							
								AmirHossein_Razlighi 
							
						 
					 
					
						
						
							
						
						d611251502 
					 
					
						
						
							
							Support Chaining for some of functionalities of nn.Module ( #885 ) ( #897 )  
						
						... 
						
						
						
						* add chaining support for some of the functionalities of "nn.Module"
* reformat
* change the return types
* remove return types
* add return type with forward referencing
* add tests for chaining
* add name to contributors
* Update python/mlx/nn/layers/base.py
Co-authored-by: Awni Hannun <awni.hannun@gmail.com >
* Update python/mlx/nn/layers/base.py
Co-authored-by: Awni Hannun <awni.hannun@gmail.com >
* update docstring
* update docstrings
---------
Co-authored-by: Awni Hannun <awni.hannun@gmail.com > 
						
						
							
						
					 
					
						2024-03-27 19:58:29 -07:00 
						 
				 
			
				
					
						
							
							
								Cheng 
							
						 
					 
					
						
						
							
						
						f30b659291 
					 
					
						
						
							
							Make MLX build on x64 macOS ( #901 )  
						
						... 
						
						
						
						The arm64 macbook pros are heavy and I usually care my intel one for
mobile, it would be nice if I can play with MLX on it.
To build with x64, user must pass `MLX_ENABLE_X64_MAC` to cmake:
CMAKE_ARGS='-DMLX_ENABLE_X64_MAC=ON' python setup.py 
						
						
							
						
					 
					
						2024-03-27 06:14:29 -07:00 
						 
				 
			
				
					
						
							
							
								Cheng 
							
						 
					 
					
						
						
							
						
						90dfa43ff1 
					 
					
						
						
							
							Don't use make_unique to create shared_ptr ( #902 )  
						
						... 
						
						
						
						The code compiled because shared_ptr's constructor actually accepts
unique_ptr. 
						
						
							
						
					 
					
						2024-03-27 06:13:29 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						dc175f08d3 
					 
					
						
						
							
							Fix race in multi-stream eval ( #911 )  
						
						... 
						
						
						
						* maybe fix race
* comment 
						
						
							
						
					 
					
						2024-03-26 16:36:36 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						29221fa238 
					 
					
						
						
							
							Implement vjps for some primitives in the fast namespace ( #883 )  
						
						... 
						
						
						
						* Implement rope vjp in terms of rope
* RMSNormVJP primitive and kernel
* Add LayerNormVJP primitive and kernel 
						
						
							
						
					 
					
						2024-03-26 16:35:34 -07:00 
						 
				 
			
				
					
						
							
							
								Cheng 
							
						 
					 
					
						
						
							
						
						a789685c63 
					 
					
						
						
							
							Remove duplicate defines of StreamOrDevice and is_big_endian ( #892 )  
						
						
						
						
							
						
					 
					
						2024-03-26 15:15:11 -07:00 
						 
				 
			
				
					
						
							
							
								Jagrit Digani 
							
						 
					 
					
						
						
							
						
						240d10699c 
					 
					
						
						
							
							Implement negative padding in conv with slicing ( #907 )  
						
						... 
						
						
						
						* Implement negative padding with slicing
* Update mlx/ops.cpp
Co-authored-by: Awni Hannun <awni@apple.com >
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
							
						
					 
					
						2024-03-26 14:59:19 -07:00 
						 
				 
			
				
					
						
							
							
								Jagrit Digani 
							
						 
					 
					
						
						
							
						
						925014b661 
					 
					
						
						
							
							Fix multiblock sort limits ( #906 )  
						
						... 
						
						
						
						* Fix multiblock sort limits
* Fix metal validation error 
						
						
							
						
					 
					
						2024-03-26 14:00:00 -07:00 
						 
				 
			
				
					
						
							
							
								Abdussamet Türker 
							
						 
					 
					
						
						
							
						
						5611e1a95e 
					 
					
						
						
							
							Fix unsqueeze with None ( #899 )  
						
						... 
						
						
						
						* Fix unsqueeze with None
* Clean unnecessary files 
						
						
							
						
					 
					
						2024-03-26 13:59:44 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						570f2bf29e 
					 
					
						
						
							
							pick up preivously set attributes ( #905 )  
						
						
						
						
							
						
					 
					
						2024-03-26 11:19:59 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						9948eddf11 
					 
					
						
						
							
							Fix nan and improve speed for qvm ( #903 )  
						
						
						
						
							
						
					 
					
						2024-03-26 10:41:45 -07:00 
						 
				 
			
				
					
						
							
							
								Luca Arnaboldi 
							
						 
					 
					
						
						
							
						
						a3ee03da01 
					 
					
						
						
							
							Fixing random.normal for half-precision dtype  #642  ( #904 )  
						
						... 
						
						
						
						* Fixing random.normal for half-precision dtype #642 
* Update python/tests/test_random.py
Co-authored-by: Awni Hannun <awni.hannun@gmail.com >
---------
Co-authored-by: Awni Hannun <awni.hannun@gmail.com > 
						
						
							
						
					 
					
						2024-03-26 09:58:27 -07:00 
						 
				 
			
				
					
						
							
							
								Cheng 
							
						 
					 
					
						
						
							
						
						28fcd2b519 
					 
					
						
						
							
							Add missing && when forwarding args ( #894 )  
						
						... 
						
						
						
						Without the && args would be copied and perfect forwarding won't work.
Also add template utils to make sure the function only forwards array
and not vector<array>. 
						
						
							
						
					 
					
						2024-03-25 14:55:54 -07:00 
						 
				 
			
				
					
						
							
							
								Jack Mousseau 
							
						 
					 
					
						
						
							
						
						8e686764ac 
					 
					
						
						
							
							Ensure shape dimensions are within supported integer range ( #566 ) ( #704 )  
						
						... 
						
						
						
						* Ensure shape dimensions are within supported integer range (#566 )
* fix build
* fix rebase bug
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
							
						
					 
					
						2024-03-25 13:29:45 -07:00 
						 
				 
			
				
					
						
							
							
								Daniel Strobusch 
							
						 
					 
					
						
						
							
						
						479051ce1c 
					 
					
						
						
							
							add numeric type hierarchy and issubdtype as well as a set_dtype meth… ( #427 )  
						
						... 
						
						
						
						* add numeric type hierarchy and issubdtype as well as a set_dtype method to nn.Module with predicate
numeric type hierarchy and issubtype is compatible to the [numpy hierarchy](220f0ab2c5/numpy/_core/numerictypes.py (L42)Closes  #285 .
* nits in docs
* unify type category checking
* nits in docs
* nits in docs
* more docs nits
* fix callable type
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
							
						
					 
					
						2024-03-25 12:32:59 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						bfb5bad4f0 
					 
					
						
						
							
							patch ( #893 )  
						
						
						
						
							
 
						
					 
					
						2024-03-24 21:03:59 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						1e16331d9c 
					 
					
						
						
							
							post nanobind docs fixes and some updates ( #889 )  
						
						... 
						
						
						
						* post nanobind docs fixes and some updates
* one more doc nit
* fix for stubs and latex 
						
						
							
						
					 
					
						2024-03-24 15:03:27 -07:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						be98f4ab6b 
					 
					
						
						
							
							Reduce a little overhead ( #871 )  
						
						... 
						
						
						
						* some small overhead improvements
* use result_type in rms_norm
* remove release force
* fix + use non-vector version
* revert compile change
* fix ops
* a little more overhead
* a little more cleanup and overhead 
						
						
							
						
					 
					
						2024-03-22 17:29:36 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						6ee1112f30 
					 
					
						
						
							
							Fix copy donation and add partial rope ( #881 )  
						
						
						
						
							
						
					 
					
						2024-03-22 17:28:26 -07:00 
						 
				 
			
				
					
						
							
							
								Jagrit Digani 
							
						 
					 
					
						
						
							
						
						8e5a5a1ccd 
					 
					
						
						
							
							Set item bug fix ( #879 )  
						
						... 
						
						
						
						* set item shaping bug fix
* Add extra tests 
						
						
							
						
					 
					
						2024-03-22 12:11:17 -07:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						fcda3a0e66 
					 
					
						
						
							
							Increase test tolerance for fast.layer_norm ( #880 )  
						
						
						
						
							
						
					 
					
						2024-03-22 12:10:27 -07:00