Angelos Katharopoulos
|
7f8c961287
|
Fix setattr for the TokenizerWrapper (#961)
|
2024-08-28 14:47:33 -07:00 |
|
Awni Hannun
|
9f10728145
|
fix yi (#852)
|
2024-06-27 06:38:19 -07:00 |
|
Awni Hannun
|
ee60e2a9d5
|
Kv cache (#643)
* in place kv_cache
* fix
* fix kv cache size
* partially fix kv cache dtype
* step kv cache
* multiple of step size
* more teests + kv cache
* more kv cache
* udpate all models to use kv cache
|
2024-05-08 08:18:13 -07:00 |
|
Awni Hannun
|
2146bcd7ee
|
Quantize embedding / Update quantize API (#680)
* more async eval
* quantize embedding / update quantize api
* more updates for quantize
* update for quantize embeddings
* update sd quant API
* update sdxl quants
* error for datasets < batch_size
* async
* fix config loading
* fix quant
* fix tests
* fix req
* remove lm head if tie weights is true
* fix test
|
2024-04-18 18:16:10 -07:00 |
|
Angelos Katharopoulos
|
e55a9e8cb4
|
Add an SPM detokenizer that doesn't trim initial space (#681)
|
2024-04-15 14:15:25 -07:00 |
|
Angelos Katharopoulos
|
1278994b56
|
Add streaming detokenizers (#651)
|
2024-04-08 22:36:01 -07:00 |
|