Commit Graph

4 Commits

Author SHA1 Message Date
Awni Hannun
782f5a71b7 reorg + fixes to caching, unify prompt caching across types and use cases for e.g. caching during a chat 2024-10-05 14:49:39 -07:00
Angelos Katharopoulos
324184d670 Fix the cache_prompt (#979) 2024-09-06 20:19:27 -07:00
Awni Hannun
b1186e2a81 Docs on prompt scaling (#963)
* docs on prompt scaling

* remove unused var

* nits
2024-08-29 15:05:17 -07:00
Angelos Katharopoulos
1003a8b2dd Add the ability to load the KV cache from a file (#956) 2024-08-28 22:11:45 -07:00