Diogo 
							
						 
					 
					
						
						
							
						
						2e29d0815b 
					 
					
						
						
							
							Add tile op ( #438 )  
						
						
						
						
					 
					
						2024-01-12 23:03:16 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						1b71487e1f 
					 
					
						
						
							
							docs ( #444 )  
						
						
						
						
					 
					
						2024-01-12 13:34:16 -08:00 
						 
				 
			
				
					
						
							
							
								Ayush Shridhar 
							
						 
					 
					
						
						
							
						
						1416e7b664 
					 
					
						
						
							
							Add isnan ( #423 )  
						
						
						
						
					 
					
						2024-01-12 11:16:48 -08:00 
						 
				 
			
				
					
						
							
							
								davidkoski 
							
						 
					 
					
						
						
							
						
						29081204d1 
					 
					
						
						
							
							array.swapaxes should point to swapaxes free function ( #441 )  
						
						
						
						
					 
					
						2024-01-12 11:06:16 -08:00 
						 
				 
			
				
					
						
							
							
								Avikant Srivastava 
							
						 
					 
					
						
						
							
						
						975e265f74 
					 
					
						
						
							
							feat: Add numpy constants ( #428 )  
						
						... 
						
						
						
						* add numpy constants
* feat: add unittests
* add newaxis
* add test for newaxis transformation
* refactor 
						
						
					 
					
						2024-01-11 06:47:29 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						3b4f066dac 
					 
					
						
						
							
							Correct types for vjp + tests ( #418 )  
						
						... 
						
						
						
						* correct types for vjp + tests
* fix build + comment 
						
						
					 
					
						2024-01-10 13:32:37 -08:00 
						 
				 
			
				
					
						
							
							
								Juarez Bochi 
							
						 
					 
					
						
						
							
						
						b7f905787e 
					 
					
						
						
							
							GGUF support ( #350 )  
						
						... 
						
						
						
						* Initial GGUF support for tensor fields.
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-10 13:22:48 -08:00 
						 
				 
			
				
					
						
							
							
								Chunyang Wen 
							
						 
					 
					
						
						
							
						
						e3e933c6bc 
					 
					
						
						
							
							Add type hint for Module ( #412 )  
						
						
						
						
					 
					
						2024-01-10 11:23:42 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						1d90a76d63 
					 
					
						
						
							
							in place ops behave in place, fix some overloads ( #411 )  
						
						
						
						
					 
					
						2024-01-09 16:05:38 -08:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						961435a243 
					 
					
						
						
							
							Scatter vjp ( #394 )  
						
						... 
						
						
						
						* Add a first scatter vjp
* Implement the scatter_add vjp
* Add array.at to implement user friendly scatters 
						
						
					 
					
						2024-01-09 13:36:51 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						e9ca65c939 
					 
					
						
						
							
							Fix BN stats to not expand shape ( #409 )  
						
						... 
						
						
						
						* fix BN stats to not expand shape
* nit 
						
						
					 
					
						2024-01-09 11:54:51 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						f099ebe535 
					 
					
						
						
							
							Multi output primitives ( #330 )  
						
						... 
						
						
						
						* Multi-output primitives
---------
Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com > 
						
						
					 
					
						2024-01-08 16:39:08 -08:00 
						 
				 
			
				
					
						
							
							
								YUN, Junwoo 
							
						 
					 
					
						
						
							
						
						0b8aeddac6 
					 
					
						
						
							
							Additoinal losses ( #336 )  
						
						... 
						
						
						
						* cosine similarity loss
---------
Co-authored-by: Awni Hannun <awni@apple.com >
* Docstring nits 
						
						
					 
					
						2024-01-08 14:01:13 -08:00 
						 
				 
			
				
					
						
							
							
								Nripesh Niketan 
							
						 
					 
					
						
						
							
						
						73321b8097 
					 
					
						
						
							
							feat: add logicalAnd and logicalOR ( #386 )  
						
						... 
						
						
						
						* feat: add logicalAnd and logicalOR
* run pre-commit
* Refactor logical_and and logical_or functions
* Add acknowledgement
* Add logical AND and logical OR operators
* Refactor logical_and and logical_or functions
* Add support for logical operators on bool arrays
* Update mlx/ops.cpp
Co-authored-by: Awni Hannun <awni.hannun@gmail.com >
* Update mlx/ops.cpp
Co-authored-by: Awni Hannun <awni.hannun@gmail.com >
* Add logical AND and OR operators for arrays and scalars
* Refactor vjp and jvp methods in primitives.cpp
* Add overloaded operators for logical AND and OR
* format
---------
Co-authored-by: Awni Hannun <awni.hannun@gmail.com >
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-08 07:00:05 -08:00 
						 
				 
			
				
					
						
							
							
								Hazem Essam 
							
						 
					 
					
						
						
							
						
						022a944367 
					 
					
						
						
							
							Added GLU activation function and Gated activation function ( #329 )  
						
						... 
						
						
						
						* Added GLU activation function and gated activation function
* Ran pre-commit
* Ran pre commit
* Removed old sigmoid implementation to match with main
* Removed gated activation from __init__.py
* Removed unused test cases
* Removed unused imports
* format / docstring
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-08 06:13:16 -08:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						a611b0bc82 
					 
					
						
						
							
							Removes the retain_graph flag ( #385 )  
						
						... 
						
						
						
						* Adds global tracing flag
* Removes retain_graph in favor of is_tracer 
						
						
					 
					
						2024-01-07 15:16:51 -08:00 
						 
				 
			
				
					
						
							
							
								Diogo 
							
						 
					 
					
						
						
							
						
						449b43762e 
					 
					
						
						
							
							Add inner / outer op ( #348 )  
						
						... 
						
						
						
						* inner / outer impl
* python tests
* ops list and ack
* updated descriptions
* use test helper
* removed dtype check and flatten outer to 1-D
* updated docs
* just use the reshape to flatten 
						
						
					 
					
						2024-01-07 09:01:09 -08:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						6ea6b4258d 
					 
					
						
						
							
							Fix style check ( #395 )  
						
						
						
						
					 
					
						2024-01-07 05:54:58 -08:00 
						 
				 
			
				
					
						
							
							
								Anchen 
							
						 
					 
					
						
						
							
						
						48f6ca8c3a 
					 
					
						
						
							
							Add theta cache for Rope and mask cache for ALiBi ( #375 )  
						
						
						
						
					 
					
						2024-01-07 00:22:58 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						b34bf5d52b 
					 
					
						
						
							
							fix saving for non-contiguous arrays ( #389 )  
						
						
						
						
					 
					
						2024-01-06 12:44:02 -08:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						4c48f6460d 
					 
					
						
						
							
							Fix segfault from buffer protocol and tests ( #383 )  
						
						... 
						
						
						
						* Fix segfault from buffer protocol and tests
* Fix tf test 
						
						
					 
					
						2024-01-05 18:17:44 -08:00 
						 
				 
			
				
					
						
							
							
								Daniel Strobusch 
							
						 
					 
					
						
						
							
						
						1331fa19f6 
					 
					
						
						
							
							Make array conform to the Python Buffer Protocol ( #323 )  
						
						
						
						
					 
					
						2024-01-05 15:58:33 -08:00 
						 
				 
			
				
					
						
							
							
								Daniel Strobusch 
							
						 
					 
					
						
						
							
						
						dfdb284e16 
					 
					
						
						
							
							make behaviour of dtype arguments consistent and compliant to numpy ( #379 )  
						
						... 
						
						
						
						All functions that take an optional dtype should
* have a default dtype visible in the generated docs (accomplished via `"dtype"_a = std::optional{float32}`)
* behave identical when `dtype=None` or no dtype is passed
This important when passing kw args down from a numpy function like:
```
def f(x, dtype=None):
  mx.random.uniform(dtype=dtype)
  # ...
```
NumPy functions behave like this.
It also fixes a minor bug in `tri`: #378 
Closes  #378  
						
						
					 
					
						2024-01-05 09:37:46 -08:00 
						 
				 
			
				
					
						
							
							
								mutexuan 
							
						 
					 
					
						
						
							
						
						d8f41a5c0f 
					 
					
						
						
							
							support python mlx.array creation from list of mlx.array's ( #325 )  
						
						... 
						
						
						
						* support python mlx.array creation from list of mlx.array's
* include bfloat16 in UT
* refactor so that sub array made of all python primitive types gets initialized by fill_vector
* address PR comment: arr.shape().size() -> arr.ndim()
* address PR comment: get back Dtype constness and let stack to handle type promotions automatically 
						
						
					 
					
						2024-01-04 18:53:33 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						b9e415d19c 
					 
					
						
						
							
							bump pre commit and fix format ( #373 )  
						
						
						
						
					 
					
						2024-01-04 16:28:52 -08:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						75dc537e44 
					 
					
						
						
							
							Fix the sigmoid module ( #371 )  
						
						
						
						
					 
					
						2024-01-04 13:16:36 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						cf88db44b5 
					 
					
						
						
							
							revert copy ( #366 )  
						
						
						
						
					 
					
						2024-01-04 10:43:29 -08:00 
						 
				 
			
				
					
						
							
							
								Chunyang Wen 
							
						 
					 
					
						
						
							
						
						16856a0160 
					 
					
						
						
							
							Remove useless pass ( #364 )  
						
						... 
						
						
						
						Co-authored-by: Chunyang Wen <chunyang_wen@apple.com > 
						
						
					 
					
						2024-01-04 06:34:01 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						d752f8e142 
					 
					
						
						
							
							Fix CI ( #359 )  
						
						... 
						
						
						
						* fix ci
* check for linux for fp16 
						
						
					 
					
						2024-01-04 06:33:08 -08:00 
						 
				 
			
				
					
						
							
							
								toji 
							
						 
					 
					
						
						
							
						
						d2467c320d 
					 
					
						
						
							
							Added support for python copy ( #335 )  
						
						... 
						
						
						
						* Added support for python copy
* precommit changes
* removed `_compiled_call_impl` line
* added tests and suggested changes
* ACK changes 
						
						
					 
					
						2024-01-03 20:59:40 -08:00 
						 
				 
			
				
					
						
							
							
								Diogo 
							
						 
					 
					
						
						
							
						
						0d31128a44 
					 
					
						
						
							
							use union instead of | ( #358 )  
						
						
						
						
					 
					
						2024-01-03 19:33:19 -08:00 
						 
				 
			
				
					
						
							
							
								Diogo 
							
						 
					 
					
						
						
							
						
						1ac18eac20 
					 
					
						
						
							
							simple numpy helper for tests ( #352 )  
						
						
						
						
					 
					
						2024-01-03 19:19:19 -08:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						e7f5059fe4 
					 
					
						
						
							
							Support for quantized matmul with w and w^T ( #349 )  
						
						... 
						
						
						
						* Add the metal qvm implementation
* Add qmm_n
* Add gradient wrt to input for quantized_matmul 
						
						
					 
					
						2024-01-03 14:22:36 -08:00 
						 
				 
			
				
					
						
							
							
								Gabrijel Boduljak 
							
						 
					 
					
						
						
							
						
						c7edafb729 
					 
					
						
						
							
							implemented InstanceNorm ( #244 )  
						
						... 
						
						
						
						* implemented instancenorm
* implemented vector_norm in cpp
added linalg to mlx
* implemented vector_norm python binding
* renamed vector_norm to norm, implemented norm without provided ord
* completed the implementation of the norm
* added tests
* removed unused import in linalg.cpp
* updated python bindings
* added some tests for python bindings
* handling inf, -inf as numpy does, more extensive tests of compatibility with numpy
* added better docs and examples
* refactored mlx.linalg.norm bindings
* reused existing util for implementation of linalg.norm
* more tests
* fixed a bug with no ord and axis provided
* removed unused imports
* some style and API consistency updates to linalg norm
* remove unused includes
* fix python tests
* fixed a bug with frobenius norm of a complex-valued matrix
* complex for vector too
* addressed PR review comments
* fixed import order in __init__
* expected values in instancenorm tests are simple lists
* minor return expression style change
* added InstanceNorm to docs
* doc string nits
* added myself to individual contributors
---------
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2024-01-03 12:21:15 -08:00 
						 
				 
			
				
					
						
							
							
								Awni Hannun 
							
						 
					 
					
						
						
							
						
						dff4a3833f 
					 
					
						
						
							
							Module checks the weight on load_weights ( #337 )  
						
						... 
						
						
						
						* update module to check weights on load, also fix docs and reorganize tests
* nits + rebase
* a few more docs updates for Module
* use manual module file
* comment 
						
						
					 
					
						2024-01-02 18:55:42 -08:00 
						 
				 
			
				
					
						
							
							
								Diogo 
							
						 
					 
					
						
						
							
						
						0782a4573a 
					 
					
						
						
							
							Add Tensordot op ( #344 )  
						
						
						
						
					 
					
						2024-01-02 17:15:00 -08:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						436bec9fd9 
					 
					
						
						
							
							Fix the implementation of the Bilinear layer ( #347 )  
						
						
						
						
					 
					
						2024-01-02 16:46:18 -08:00 
						 
				 
			
				
					
						
							
							
								Asaf Zorea 
							
						 
					 
					
						
						
							
						
						295ce9db09 
					 
					
						
						
							
							Feature expand nn linear ( #315 )  
						
						... 
						
						
						
						* Added an identity and bilinear layers
Added a reset_parameters option
Added normal init for bias
* pre-commit run
* add type hints for parameters and the return type
change Bilinear math to x_1 and x_2
change __call__ arguments to x and y instead of input and output
add explanation to the Initialization
* Remove unnecessary reshape
* Added 'i' to bilinear formula
* Changed bilinear computation to two matrix multiplications
* avoid saving intermediate results, kept y in bilinear for better clarity (can be replaced with x1)
* Changed math formula in Linear
Added more explanation to math formulas
Changed x1, x2 reshape to support all inputs sizes 
						
						
					 
					
						2024-01-02 06:08:53 -08:00 
						 
				 
			
				
					
						
							
							
								Josh Soref 
							
						 
					 
					
						
						
							
						
						44c1ce5e6a 
					 
					
						
						
							
							Spelling ( #342 )  
						
						... 
						
						
						
						* spelling: accumulates
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: across
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: additional
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: against
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: among
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: array
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: at least
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: available
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: axes
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: basically
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: bfloat
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: bounds
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: broadcast
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: buffer
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: class
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: coefficients
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: collision
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: combinations
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: committing
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: computation
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: consider
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: constructing
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: conversions
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: correctly
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: corresponding
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: declaration
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: default
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: dependency
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: destination
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: destructor
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: dimensions
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: divided
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: element-wise
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: elements
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: endianness
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: equivalent
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: explicitly
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: github
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: indices
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: irregularly
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: memory
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: metallib
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: negative
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: notable
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: optional
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: otherwise
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: overridden
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: partially
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: partition
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: perform
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: perturbations
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: positively
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: primitive
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: repeat
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: repeats
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: respect
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: respectively
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: result
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: rounding
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: separate
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: skipping
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: structure
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: the
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: transpose
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: unnecessary
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: unneeded
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
* spelling: unsupported
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com >
---------
Signed-off-by: Josh Soref <2119212+jsoref@users.noreply.github.com > 
						
						
					 
					
						2024-01-01 21:08:17 -08:00 
						 
				 
			
				
					
						
							
							
								mutexuan 
							
						 
					 
					
						
						
							
						
						350095ce6e 
					 
					
						
						
							
							fix type cast error in item() for bfloat16 ( #339 )  
						
						... 
						
						
						
						Co-authored-by: xuan <xuan@apple.com > 
						
						
					 
					
						2024-01-01 19:02:04 -08:00 
						 
				 
			
				
					
						
							
							
								Nripesh Niketan 
							
						 
					 
					
						
						
							
						
						e09bf35b28 
					 
					
						
						
							
							feat: Add Dropout3d layer to nn.layers ( #313 )  
						
						... 
						
						
						
						* feat: Add Dropout3d layer to nn.layers
* acknowledgement
* Add dropout tests to test_nn.py
* run pre-commit
* Add activation functions and dropout3d ops
* Add dropout tests for bfloat16 and float16 
						
						
					 
					
						2023-12-31 14:01:21 -08:00 
						 
				 
			
				
					
						
							
							
								Hazem Essam 
							
						 
					 
					
						
						
							
						
						e3b8da2a49 
					 
					
						
						
							
							Added implementation for Scaled RoPE. ( #261 )  
						
						... 
						
						
						
						* Added scale for RoPE
* Ran pre-commit
* Added RoPE scaling test
* Added docstring for scale parameter
* Modified docstrings 
						
						
					 
					
						2023-12-31 06:06:01 -08:00 
						 
				 
			
				
					
						
							
							
								Nripesh Niketan 
							
						 
					 
					
						
						
							
						
						5ad8fb7268 
					 
					
						
						
							
							feat: add softsign, softmax, hardswish, logsoftmax activation function ( #309 )  
						
						... 
						
						
						
						* feat: add softsign activation function
* run pre-commit
* Add Softsign activation function
* Add Softsign activation function
* Add documentation for ReLU6, Softplus, and Softsign activations
* Update activation functions in neural network layers
* Add LogSoftmax and Hardswish activations
* run pre-commit
* Update activations.py
* Added acknowledgements
* Fix activation function comments
* Fix activation functions in neural network layers 
						
						
					 
					
						2023-12-29 11:49:36 -08:00 
						 
				 
			
				
					
						
							
							
								Chunyang Wen 
							
						 
					 
					
						
						
							
						
						2aedf3e791 
					 
					
						
						
							
							Minor refactor for tree_map and tree_unflatten ( #311 )  
						
						... 
						
						
						
						* Minor refact for tree_map and tree_unflatten
* Remove the if statement
---------
Co-authored-by: Chunyang Wen <chunyang_wen@apple.com > 
						
						
					 
					
						2023-12-28 20:55:10 -08:00 
						 
				 
			
				
					
						
							
							
								Chunyang Wen 
							
						 
					 
					
						
						
							
						
						473b6b43b4 
					 
					
						
						
							
							Use defaultdict ( #307 )  
						
						... 
						
						
						
						Co-authored-by: Chunyang Wen <chunyang_wen@apple.com > 
						
						
					 
					
						2023-12-28 14:46:13 -08:00 
						 
				 
			
				
					
						
							
							
								Angelos Katharopoulos 
							
						 
					 
					
						
						
							
						
						d29770eeaa 
					 
					
						
						
							
							Update batchnorm to have the running stats in parameters ( #305 )  
						
						
						
						
					 
					
						2023-12-28 14:31:10 -08:00 
						 
				 
			
				
					
						
							
							
								Chunyang Wen 
							
						 
					 
					
						
						
							
						
						040c3bafab 
					 
					
						
						
							
							Add missing f str ( #306 )  
						
						... 
						
						
						
						Co-authored-by: Chunyang Wen <chunyang_wen@apple.com > 
						
						
					 
					
						2023-12-28 06:09:34 -08:00 
						 
				 
			
				
					
						
							
							
								Chunyang Wen 
							
						 
					 
					
						
						
							
						
						05767b026f 
					 
					
						
						
							
							Add information for dropout probability ( #304 )  
						
						... 
						
						
						
						Co-authored-by: Chunyang Wen <chunyang_wen@apple.com > 
						
						
					 
					
						2023-12-27 21:51:30 -08:00 
						 
				 
			
				
					
						
							
							
								Bahaa 
							
						 
					 
					
						
						
							
						
						ff2b58e299 
					 
					
						
						
							
							Add support for repeat ( #278 )  
						
						... 
						
						
						
						* add repeat function
* fix styling
* optimizing repeat
* fixed minor issues
* not sure why that folder is there xD
* fixed now for sure
* test repeat not repeat test
* Fixed
---------
Co-authored-by: Bahaa Eddin tabbakha <bahaa@Bahaas-MacBook-Pro.local > 
						
						
					 
					
						2023-12-27 13:11:38 -08:00 
						 
				 
			
				
					
						
							
							
								YUN, Junwoo 
							
						 
					 
					
						
						
							
						
						4417e37ede 
					 
					
						
						
							
							Transformer fix ( #167 )  
						
						... 
						
						
						
						* add transformer with dropout, fix transformer ffm, layernorm order
* precommit changes
* precommit changes
* add docstring, activation, norm_first
* run precommit
* run precommit
* add doctstring
* precommit
* style nits in docs
---------
Co-authored-by: junwoo-yun <junwoo.yun@bagelcode.com >
Co-authored-by: Awni Hannun <awni@apple.com > 
						
						
					 
					
						2023-12-27 08:48:36 -08:00