Angelos Katharopoulos
e9b32747b4
Add grad checkpointing and PE in the transformer example ( #387 )
...
* Add grad checkpointing and PE in the transformer example
* Remove other frameworks from LM example
* Remove the other frameworks from MNIST example
* Improve the transformer LM example
* Fix black and change LR
2024-02-01 13:04:03 -08:00
Awni Hannun
5aa652d3c2
remove simplify ( #379 )
2024-01-26 13:54:49 -08:00
Awni Hannun
27c0a8c002
Add llms subdir + update README ( #145 )
...
* add llms subdir + update README
* nits
* use same pre-commit as mlx
* update readmes a bit
* format
2023-12-20 10:22:25 -08:00
Awni Hannun
98f4346c81
black format
2023-12-09 14:15:25 -08:00
Ondřej Sojka
5a0c36d8b6
Fix link in docstring in wikitext function
2023-12-07 14:19:55 +01:00
Awni Hannun
443037859a
Merge pull request #5 from jaglinux/patch-1
...
Update README.md
2023-12-06 21:07:13 -08:00
Leon Ericsson
e488831e03
bug fix in transformer_lm example
2023-12-06 15:41:46 -08:00
Jagadish Krishnamoorthy
5fc117025a
Update README.md
...
Fixed typo
2023-12-06 00:30:37 -08:00
Awni Hannun
31bc57c4ff
add copyright in source
2023-11-30 11:08:53 -08:00
Awni Hannun
b243c1d8f4
a few examples
2023-11-29 08:17:26 -08:00