Awni Hannun
|
913fd33c9c
|
style nits in docs
|
2023-12-27 08:45:08 -08:00 |
|
YUN, Junwoo
|
47a64c480b
|
Merge branch 'main' into transformer_fix
|
2023-12-27 02:18:34 +08:00 |
|
junwoo-yun
|
0372cbbcbf
|
precommit
|
2023-12-27 02:16:26 +08:00 |
|
junwoo-yun
|
bee11e79d9
|
add doctstring
|
2023-12-27 02:15:24 +08:00 |
|
Yutaka Kondo
|
fc4e5b476b
|
Fix llama link in README.md (#289)
|
2023-12-25 20:53:20 -08:00 |
|
Daniel Strobusch
|
d58ac083f3
|
expose itemsize and nbytes as for numpy arrays (#284)
see:
* https://numpy.org/doc/stable/reference/generated/numpy.ndarray.nbytes.html
* https://numpy.org/doc/stable/reference/generated/numpy.ndarray.itemsize.html
relates to https://github.com/ml-explore/mlx-examples/pull/174
|
2023-12-25 10:34:28 -08:00 |
|
__mo_san__
|
a123c3c7d2
|
implement-batch-norm-layer (#217)
- Add batch normalization layer
---------
Co-authored-by: Robert McCraith <mccraithrobert@gmail.com>
Co-authored-by: Awni Hannun <awni@apple.com>
|
2023-12-25 07:32:53 -08:00 |
|
junwoo-yun
|
88a94b9db8
|
run precommit
|
2023-12-25 07:52:19 +08:00 |
|
junwoo-yun
|
0e0557b756
|
run precommit
|
2023-12-25 07:50:54 +08:00 |
|
junwoo-yun
|
297e69017c
|
add docstring, activation, norm_first
|
2023-12-25 07:39:42 +08:00 |
|
Angelos Katharopoulos
|
9e6b8c9f48
|
Refactor the reduction kernels (#277)
|
2023-12-24 14:47:57 -08:00 |
|
Zach Schillaci
|
22fee5a383
|
Remove redundant assert in losses.py (#281)
|
2023-12-24 08:39:08 -08:00 |
|
Daniel Strobusch
|
7365d142a3
|
random.uniform must respect dtype, even if lower precision than "low" (#280)
Fix an edge case where random uniform returns a float32 array, even if a lower precision dtype is wanted due to adding the float32 "low" array.
|
2023-12-24 07:04:43 -08:00 |
|
Awni Hannun
|
8b227fa9af
|
fix no metal build (#276)
|
2023-12-23 19:18:10 -08:00 |
|
Vidit Agarwal
|
8c3da54c7d
|
Fix failing test for log cosh loss (#275)
* fix assert statement in log_cosh_loss
* reformatted by pre-commit black
|
2023-12-23 16:26:46 -08:00 |
|
Vidit Agarwal
|
acf1721b98
|
Corrected the example of value_and_grad (#274)
* Corrected the example for mx.value_and_grad
* Reformat through pre-commit/black
|
2023-12-23 11:06:38 -08:00 |
|
Finn Voorhees
|
f91f450141
|
Fix argmax returns documentation (#263)
|
2023-12-22 20:33:17 -08:00 |
|
Ronan Collobert
|
cd3616a463
|
Revisit autorelease memory pools (#260)
* make general autorelease pool part of metal device
* make things simpler
* no metal backend support
* new_memory_pool -> new_scoped_memory_pool
|
2023-12-22 11:01:26 -08:00 |
|
Nicholas Santavas
|
d35fa1db41
|
Add Hinge, Huber and LogCosh losses (#199)
|
2023-12-22 10:28:10 -08:00 |
|
Justin Deschenaux
|
e8deca84e0
|
Add dropout2d (#250)
|
2023-12-22 08:02:29 -08:00 |
|
Angelos Katharopoulos
|
8385f93cea
|
Bumping the version (#256)
|
2023-12-21 18:33:14 -08:00 |
|
Awni Hannun
|
2118c3dbfa
|
fix (#255)
|
2023-12-21 18:18:41 -08:00 |
|
Awni Hannun
|
a002797d52
|
A temporary fix (#254)
|
2023-12-21 17:59:15 -08:00 |
|
Angelos Katharopoulos
|
1d053e0d1d
|
Fix the alibi test that was left unchanged (#252)
|
2023-12-21 14:59:25 -08:00 |
|
Hazem Essam
|
0aa65c7a6b
|
Added ALiBi implementation (#232)
|
2023-12-21 14:36:38 -08:00 |
|
Daniel Strobusch
|
794feb83df
|
support arange for bfloat16 (#245)
|
2023-12-21 14:33:43 -08:00 |
|
Angelos Katharopoulos
|
2c7df6795e
|
Make sure that arrays are freed when saving (#247)
|
2023-12-21 14:08:24 -08:00 |
|
Angelos Katharopoulos
|
b3916cbf2b
|
Improve names of quantization arguments (#235)
* Change the default quantization group_size to 64
* Rename groups to group_size and width to bits
|
2023-12-20 16:53:53 -08:00 |
|
Angelos Katharopoulos
|
57fe918cf8
|
Adds C++ and nn quantization utilities (#230)
* Add C++ de-/quantize ops
* Add quantize functions to the docs and tests
* Add a QuantizedLinear module
|
2023-12-20 14:17:38 -08:00 |
|
Justin Deschenaux
|
4912ff3ec2
|
Add Lion optimizer (#209)
* Add Lion optimizer
* Update acknowledgements also with past contributions
|
2023-12-20 13:54:58 -08:00 |
|
Awni Hannun
|
f40d17047d
|
Indexing bug (#233)
* fix
* test
|
2023-12-20 10:44:01 -08:00 |
|
Angelos Katharopoulos
|
2807c6aff0
|
Implements divide for integer types and adds floor_divide op (#228)
* Add floor_divide
* Add floor_divide to the tests
* Add floor_divide to the docs
|
2023-12-19 20:12:19 -08:00 |
|
davidkoski
|
de892cb66c
|
fix for non-macos build issue on cblas.h (#227)
|
2023-12-19 17:01:59 -08:00 |
|
davidkoski
|
37024d899c
|
fixes for building with swiftpm (#225)
- clbas is part of veclib (compile failure)
- add SWIFTPM_BUNDLE #define to allow loading the metallib from a swiftpm resource bundle
|
2023-12-19 16:22:10 -08:00 |
|
Diogo
|
137f55bf28
|
fail early if readinto does not exist (#221)
|
2023-12-19 13:27:17 -08:00 |
|
Emircan Erol
|
e549f84532
|
Triplet Loss (#211)
* Triplet Loss
* Requested Changes
* Margin to alpha
|
2023-12-19 12:37:12 -08:00 |
|
Angelos Katharopoulos
|
dfa9f4bc58
|
An initial quantized matmul implementation (#205)
* Add quantized matvec
* Add quantized matrix matrix with 2nd matrix transposed
* Add quantized matmul tests
* Add a slow cpu quantized matmul
* Add a slightly faster vectorized cpu version
|
2023-12-18 23:18:57 -08:00 |
|
Abe Leininger
|
e6872a4149
|
Added linspace (#181)
* linspace ops support
---------
Co-authored-by: Awni Hannun <awni@apple.com>
|
2023-12-18 19:57:55 -08:00 |
|
Juarez Bochi
|
f4f6e17d45
|
Fix cross-attention (#210)
* Fix cross-attention
With the current code, ln2 is a no-op. Its output should be passed to the cross-attention layer
* Add name to contributors
|
2023-12-18 12:27:27 -08:00 |
|
Angelos Katharopoulos
|
4d4af12c6f
|
Adds round op and primitive (#203)
|
2023-12-18 11:32:48 -08:00 |
|
Awni Hannun
|
477397bc98
|
Citation + Contributor acknowledgment section (#207)
* cite
* nits
* nits
* comment
|
2023-12-18 10:07:00 -08:00 |
|
jojopuppet
|
18cca64c81
|
Add smoothed L1 loss and enhancements to cross entropy loss (#166)
* Add smooth_l1_loss
* Add labels moothing for cross entropy loss
---------
Co-authored-by: Awni Hannun <awni@apple.com>
|
2023-12-18 07:26:21 -08:00 |
|
Awni Hannun
|
0e5807bbcb
|
include optional (#202)
|
2023-12-17 22:01:35 -08:00 |
|
Cyril Zakka, MD
|
8eb56beb3a
|
Added clip function (#159)
* Added clip
* Added Python bindings
* Formatting
* Added cpp tests
* Added Python tests
* python bindings work
* rebase
---------
Co-authored-by: Awni Hannun <awni@apple.com>
|
2023-12-17 20:00:29 -08:00 |
|
Awni Hannun
|
ee0c2835c5
|
Docs updates (#198)
Reorganize NN docs + a few other tidbits.
|
2023-12-17 13:20:55 -08:00 |
|
Awni Hannun
|
90d04072b7
|
fix build w/ flatten (#195)
|
2023-12-17 11:58:45 -08:00 |
|
__mo_san__
|
52e1589a52
|
implemented Flatten Module (#149)
* implemented flatten op
---------
Co-authored-by: Awni Hannun <awni@apple.com>
|
2023-12-16 21:54:37 -08:00 |
|
YUN, Junwoo
|
eebd7c275d
|
Add optimizers (AdaMax, AdaDelta, RMSprop) and ordering optimizer classes (#142)
* Add AdaMax, AdaDelta, RMSprop
|
2023-12-16 21:43:15 -08:00 |
|
Austin Liu
|
a67bbfe745
|
Update docs (#177) (#190)
* update docs (fix #177)
* reorder
---------
Co-authored-by: Awni Hannun <awni@apple.com>
|
2023-12-16 06:52:18 -08:00 |
|
Awni Hannun
|
104c34f906
|
setite negative indexing bug (#189)
|
2023-12-16 06:44:47 -08:00 |
|