Volodymyr Kyrylov
7979b84a9e
transformer_lm: add --dataset enwik8 ( #838 )
...
* transformer_lm: add --dataset enwik8
* nits
---------
Co-authored-by: Awni Hannun <awni@apple.com>
2024-06-26 11:59:01 -07:00
Kosti
cfeef6d9d2
Typo: SGD->AdamW ( #471 )
2024-02-20 15:47:17 -08:00
Lee Harrold
70465b8cda
clean up loss function extraction ( #433 )
...
Co-authored-by: Lee Harrold <lhharrold@sep.com>
2024-02-12 05:46:00 -08:00
Awni Hannun
f45a1ab83c
Update a few examples to use compile ( #420 )
...
* update a few examples to use compile
* update mnist
* add compile to vae and rename some stuff for simplicity
* update reqs
* use state in eval
* GCN example with RNG + dropout
* add a bit of prefetching
2024-02-08 13:00:41 -08:00
Angelos Katharopoulos
e9b32747b4
Add grad checkpointing and PE in the transformer example ( #387 )
...
* Add grad checkpointing and PE in the transformer example
* Remove other frameworks from LM example
* Remove the other frameworks from MNIST example
* Improve the transformer LM example
* Fix black and change LR
2024-02-01 13:04:03 -08:00
Awni Hannun
5aa652d3c2
remove simplify ( #379 )
2024-01-26 13:54:49 -08:00
Awni Hannun
27c0a8c002
Add llms subdir + update README ( #145 )
...
* add llms subdir + update README
* nits
* use same pre-commit as mlx
* update readmes a bit
* format
2023-12-20 10:22:25 -08:00
Awni Hannun
98f4346c81
black format
2023-12-09 14:15:25 -08:00
Ondřej Sojka
5a0c36d8b6
Fix link in docstring in wikitext function
2023-12-07 14:19:55 +01:00
Awni Hannun
443037859a
Merge pull request #5 from jaglinux/patch-1
...
Update README.md
2023-12-06 21:07:13 -08:00
Leon Ericsson
e488831e03
bug fix in transformer_lm example
2023-12-06 15:41:46 -08:00
Jagadish Krishnamoorthy
5fc117025a
Update README.md
...
Fixed typo
2023-12-06 00:30:37 -08:00
Awni Hannun
31bc57c4ff
add copyright in source
2023-11-30 11:08:53 -08:00
Awni Hannun
b243c1d8f4
a few examples
2023-11-29 08:17:26 -08:00