Add grad checkpointing and PE in the transformer example (#387)

* Add grad checkpointing and PE in the transformer example

* Remove other frameworks from LM example

* Remove the other frameworks from MNIST example

* Improve the transformer LM example

* Fix black and change LR
This commit is contained in:
Angelos Katharopoulos
2024-02-01 13:04:03 -08:00
committed by GitHub
parent ec14583c2a
commit e9b32747b4
8 changed files with 36 additions and 946 deletions

View File

@@ -10,5 +10,3 @@ python main.py --gpu
```
By default the dataset is the [PTB corpus](https://paperswithcode.com/dataset/penn-treebank). Choose a different dataset with the `--dataset` option.
To run the PyTorch, Jax or TensorFlow examples install the respective framework.