Commit Graph

6 Commits

Author SHA1 Message Date
Awni Hannun
2146bcd7ee
Quantize embedding / Update quantize API (#680)
* more async eval

* quantize embedding / update quantize api

* more updates for quantize

* update for quantize embeddings

* update sd quant API

* update sdxl quants

* error for datasets < batch_size

* async

* fix config loading

* fix quant

* fix tests

* fix req

* remove lm head if tie weights is true

* fix test
2024-04-18 18:16:10 -07:00
Awni Hannun
b8a348c1b8
Switch to fast RMS/LN Norm (#603)
* use nn.RMSNorm, use sdpa, cleanup

* bump mlx versions

* minor update

* use fast layer norm

* version bump

* update requirement for whisper

* update requirement for gguf
2024-03-23 07:13:51 -07:00
Awni Hannun
ec14583c2a
work with tuple shape (#393) 2024-02-01 13:03:47 -08:00
Vaibhav Srivastav
d4c3a9cb54
[Whisper] Add HF Hub upload option. (#254)
* Add HF Hub upload option.

* up.

* Add missing requirements.
2024-01-08 06:18:24 -08:00
Awni Hannun
172a60056f update whisper readme and requirements 2023-12-07 13:01:44 -08:00
Awni Hannun
b243c1d8f4 a few examples 2023-11-29 08:17:26 -08:00