dc-dc-dc
|
c043d27682
|
more cleanup
|
2023-12-22 01:16:04 -05:00 |
|
dc-dc-dc
|
d3b1bea251
|
remove utils header
|
2023-12-22 01:16:04 -05:00 |
|
dc-dc-dc
|
472ce433f8
|
removed custom json parser for nlohmann
|
2023-12-22 01:16:04 -05:00 |
|
dc-dc-dc
|
99b9c1dac5
|
fix doc
|
2023-12-22 01:16:04 -05:00 |
|
dc-dc-dc
|
c88d3174aa
|
added save bindings and fixed header
|
2023-12-22 01:16:04 -05:00 |
|
dc-dc-dc
|
9a39254959
|
add python bindings for loading
|
2023-12-22 01:16:04 -05:00 |
|
dc-dc-dc
|
dcfa2700f6
|
use the correct type
|
2023-12-22 01:16:04 -05:00 |
|
dc-dc-dc
|
801c1ad3a4
|
ignore __metadata__
|
2023-12-22 01:16:04 -05:00 |
|
dc-dc-dc
|
f4d876d35f
|
load changes were not needed after all
|
2023-12-22 01:16:04 -05:00 |
|
dc-dc-dc
|
60132a16de
|
uint64_t instead of uint32_t
|
2023-12-22 01:16:04 -05:00 |
|
dc-dc-dc
|
c432de8013
|
removed debugging from load
|
2023-12-22 01:16:04 -05:00 |
|
dc-dc-dc
|
18a1c335d0
|
updated tests
|
2023-12-22 01:16:04 -05:00 |
|
dc-dc-dc
|
f09bcc7d50
|
saving works
|
2023-12-22 01:16:04 -05:00 |
|
dc-dc-dc
|
5aa0b1f632
|
expand dtype support
|
2023-12-22 01:16:04 -05:00 |
|
dc-dc-dc
|
ba869e5e71
|
remove safe file
|
2023-12-22 01:16:04 -05:00 |
|
dc-dc-dc
|
5b4155d4d0
|
more tests
|
2023-12-22 01:16:04 -05:00 |
|
dc-dc-dc
|
42baa095d1
|
its working
|
2023-12-22 01:16:04 -05:00 |
|
dc-dc-dc
|
9be3ea69ee
|
switch to unordered map
|
2023-12-22 01:16:04 -05:00 |
|
dc-dc-dc
|
fef579cec1
|
fixed array parsing
|
2023-12-22 01:16:04 -05:00 |
|
dc-dc-dc
|
91495382fd
|
more cleanup
|
2023-12-22 01:16:04 -05:00 |
|
dc-dc-dc
|
d0285db98c
|
parse the number from json
|
2023-12-22 01:16:04 -05:00 |
|
dc-dc-dc
|
be075a82bf
|
small cleanup
|
2023-12-22 01:16:04 -05:00 |
|
dc-dc-dc
|
bd422decc4
|
covering more cases
|
2023-12-22 01:16:04 -05:00 |
|
dc-dc-dc
|
87ec7b3cf9
|
initial commit
|
2023-12-22 01:16:04 -05:00 |
|
Angelos Katharopoulos
|
8385f93cea
|
Bumping the version (#256)
|
2023-12-21 18:33:14 -08:00 |
|
Awni Hannun
|
2118c3dbfa
|
fix (#255)
|
2023-12-21 18:18:41 -08:00 |
|
Awni Hannun
|
a002797d52
|
A temporary fix (#254)
|
2023-12-21 17:59:15 -08:00 |
|
Angelos Katharopoulos
|
1d053e0d1d
|
Fix the alibi test that was left unchanged (#252)
|
2023-12-21 14:59:25 -08:00 |
|
Hazem Essam
|
0aa65c7a6b
|
Added ALiBi implementation (#232)
|
2023-12-21 14:36:38 -08:00 |
|
Daniel Strobusch
|
794feb83df
|
support arange for bfloat16 (#245)
|
2023-12-21 14:33:43 -08:00 |
|
Angelos Katharopoulos
|
2c7df6795e
|
Make sure that arrays are freed when saving (#247)
|
2023-12-21 14:08:24 -08:00 |
|
Angelos Katharopoulos
|
b3916cbf2b
|
Improve names of quantization arguments (#235)
* Change the default quantization group_size to 64
* Rename groups to group_size and width to bits
|
2023-12-20 16:53:53 -08:00 |
|
Angelos Katharopoulos
|
57fe918cf8
|
Adds C++ and nn quantization utilities (#230)
* Add C++ de-/quantize ops
* Add quantize functions to the docs and tests
* Add a QuantizedLinear module
|
2023-12-20 14:17:38 -08:00 |
|
Justin Deschenaux
|
4912ff3ec2
|
Add Lion optimizer (#209)
* Add Lion optimizer
* Update acknowledgements also with past contributions
|
2023-12-20 13:54:58 -08:00 |
|
Awni Hannun
|
f40d17047d
|
Indexing bug (#233)
* fix
* test
|
2023-12-20 10:44:01 -08:00 |
|
Angelos Katharopoulos
|
2807c6aff0
|
Implements divide for integer types and adds floor_divide op (#228)
* Add floor_divide
* Add floor_divide to the tests
* Add floor_divide to the docs
|
2023-12-19 20:12:19 -08:00 |
|
davidkoski
|
de892cb66c
|
fix for non-macos build issue on cblas.h (#227)
|
2023-12-19 17:01:59 -08:00 |
|
davidkoski
|
37024d899c
|
fixes for building with swiftpm (#225)
- clbas is part of veclib (compile failure)
- add SWIFTPM_BUNDLE #define to allow loading the metallib from a swiftpm resource bundle
|
2023-12-19 16:22:10 -08:00 |
|
Diogo
|
137f55bf28
|
fail early if readinto does not exist (#221)
|
2023-12-19 13:27:17 -08:00 |
|
Emircan Erol
|
e549f84532
|
Triplet Loss (#211)
* Triplet Loss
* Requested Changes
* Margin to alpha
|
2023-12-19 12:37:12 -08:00 |
|
Angelos Katharopoulos
|
dfa9f4bc58
|
An initial quantized matmul implementation (#205)
* Add quantized matvec
* Add quantized matrix matrix with 2nd matrix transposed
* Add quantized matmul tests
* Add a slow cpu quantized matmul
* Add a slightly faster vectorized cpu version
|
2023-12-18 23:18:57 -08:00 |
|
Abe Leininger
|
e6872a4149
|
Added linspace (#181)
* linspace ops support
---------
Co-authored-by: Awni Hannun <awni@apple.com>
|
2023-12-18 19:57:55 -08:00 |
|
Juarez Bochi
|
f4f6e17d45
|
Fix cross-attention (#210)
* Fix cross-attention
With the current code, ln2 is a no-op. Its output should be passed to the cross-attention layer
* Add name to contributors
|
2023-12-18 12:27:27 -08:00 |
|
Angelos Katharopoulos
|
4d4af12c6f
|
Adds round op and primitive (#203)
|
2023-12-18 11:32:48 -08:00 |
|
Awni Hannun
|
477397bc98
|
Citation + Contributor acknowledgment section (#207)
* cite
* nits
* nits
* comment
|
2023-12-18 10:07:00 -08:00 |
|
jojopuppet
|
18cca64c81
|
Add smoothed L1 loss and enhancements to cross entropy loss (#166)
* Add smooth_l1_loss
* Add labels moothing for cross entropy loss
---------
Co-authored-by: Awni Hannun <awni@apple.com>
|
2023-12-18 07:26:21 -08:00 |
|
Awni Hannun
|
0e5807bbcb
|
include optional (#202)
|
2023-12-17 22:01:35 -08:00 |
|
Cyril Zakka, MD
|
8eb56beb3a
|
Added clip function (#159)
* Added clip
* Added Python bindings
* Formatting
* Added cpp tests
* Added Python tests
* python bindings work
* rebase
---------
Co-authored-by: Awni Hannun <awni@apple.com>
|
2023-12-17 20:00:29 -08:00 |
|
Awni Hannun
|
ee0c2835c5
|
Docs updates (#198)
Reorganize NN docs + a few other tidbits.
|
2023-12-17 13:20:55 -08:00 |
|
Awni Hannun
|
90d04072b7
|
fix build w/ flatten (#195)
|
2023-12-17 11:58:45 -08:00 |
|