Angelos Katharopoulos
e9b32747b4
Add grad checkpointing and PE in the transformer example ( #387 )
...
* Add grad checkpointing and PE in the transformer example
* Remove other frameworks from LM example
* Remove the other frameworks from MNIST example
* Improve the transformer LM example
* Fix black and change LR
2024-02-01 13:04:03 -08:00
AtomicVar
2ba5d3db14
Refactor activation function and loss calculation ( #325 )
2024-01-16 13:42:56 -08:00
Vidyasagar Bhargava
647e48870a
updated README ( #184 )
2023-12-24 06:19:53 -08:00
Kashif Rasul
0371d90ccb
fashion-mnist example ( #180 )
...
* fashion mnist example
* fix from review
2023-12-23 07:34:45 -08:00
Awni Hannun
27c0a8c002
Add llms subdir + update README ( #145 )
...
* add llms subdir + update README
* nits
* use same pre-commit as mlx
* update readmes a bit
* format
2023-12-20 10:22:25 -08:00
jj701
bd742ec03c
Adding Requirements.txt
2023-12-11 20:45:39 -06:00
Awni Hannun
31bc57c4ff
add copyright in source
2023-11-30 11:08:53 -08:00
Awni Hannun
b243c1d8f4
a few examples
2023-11-29 08:17:26 -08:00