Awni Hannun
2146bcd7ee
Quantize embedding / Update quantize API ( #680 )
...
* more async eval
* quantize embedding / update quantize api
* more updates for quantize
* update for quantize embeddings
* update sd quant API
* update sdxl quants
* error for datasets < batch_size
* async
* fix config loading
* fix quant
* fix tests
* fix req
* remove lm head if tie weights is true
* fix test
2024-04-18 18:16:10 -07:00
Awni Hannun
b8a348c1b8
Switch to fast RMS/LN Norm ( #603 )
...
* use nn.RMSNorm, use sdpa, cleanup
* bump mlx versions
* minor update
* use fast layer norm
* version bump
* update requirement for whisper
* update requirement for gguf
2024-03-23 07:13:51 -07:00
Awni Hannun
ec14583c2a
work with tuple shape ( #393 )
2024-02-01 13:03:47 -08:00
Vaibhav Srivastav
d4c3a9cb54
[Whisper] Add HF Hub upload option. ( #254 )
...
* Add HF Hub upload option.
* up.
* Add missing requirements.
2024-01-08 06:18:24 -08:00
Awni Hannun
172a60056f
update whisper readme and requirements
2023-12-07 13:01:44 -08:00
Awni Hannun
b243c1d8f4
a few examples
2023-11-29 08:17:26 -08:00