mirror of
https://github.com/ml-explore/mlx-examples.git
synced 2025-06-24 01:17:28 +08:00

* Add grad checkpointing and PE in the transformer example * Remove other frameworks from LM example * Remove the other frameworks from MNIST example * Improve the transformer LM example * Fix black and change LR
316 B
316 B
Transformer LM
This is an example of a decoder-only Transformer LM. The only dependency is MLX.
Run the example on the GPU with:
python main.py --gpu
By default the dataset is the PTB corpus. Choose a different dataset with the --dataset
option.