add speculative decoding example for llama (#149)

* speculative decoding

* add sample 0

* spec decode gives same results as regular decode

* rebase

* use accept reject criteria

* switch to t5

* update readme

* readme nit

* nits

* nits

* nits

---------

Co-authored-by: Benjamin Anderson <benjamin@Benjamins-MBP.lan>
Co-authored-by: Awni Hannun <awni@apple.com>
This commit is contained in:
Benjamin Anderson
2023-12-28 17:20:43 -06:00
committed by GitHub
parent 07c163d9d9
commit 09566c7257
7 changed files with 775 additions and 1 deletions

View File

@@ -0,0 +1,3 @@
mlx>=0.0.6
transformers
numpy