mirror of
https://github.com/ml-explore/mlx-examples.git
synced 2025-09-01 12:49:50 +08:00
add speculative decoding example for llama (#149)
* speculative decoding * add sample 0 * spec decode gives same results as regular decode * rebase * use accept reject criteria * switch to t5 * update readme * readme nit * nits * nits * nits --------- Co-authored-by: Benjamin Anderson <benjamin@Benjamins-MBP.lan> Co-authored-by: Awni Hannun <awni@apple.com>
This commit is contained in:

committed by
GitHub

parent
07c163d9d9
commit
09566c7257
3
llms/speculative_decoding/requirements.txt
Normal file
3
llms/speculative_decoding/requirements.txt
Normal file
@@ -0,0 +1,3 @@
|
||||
mlx>=0.0.6
|
||||
transformers
|
||||
numpy
|
Reference in New Issue
Block a user