Awni Hannun
95840f32e2
Fix whipser conversion for safetensors models ( #935 )
...
* fix whipser conversion for safetensor only. error in mlx lm for existing paths
* fix tests
2024-08-14 10:22:04 -07:00
Awni Hannun
33905447f9
Whisper updates to allow HF models ( #923 )
...
* simplify conversion and update convert for HF models
* use npz for compat
* fixes
* fixes
* fix gguf
* allow user supplied path
2024-08-09 11:11:58 -07:00
madroid
6775d6cb3f
Whisper: Add pip distribution configuration to support pip installations. ( #739 )
...
* Whisper: rename whisper to mlx_whisper
* Whisper: add setup.py config for publish
* Whisper: add assets data to setup config
* Whisper: pre-commit for setup.py
* Whisper: Update README.md
* Whisper: Update README.md
* nits
* fix package data
* nit in readme
---------
Co-authored-by: Awni Hannun <awni@apple.com>
2024-05-01 09:00:02 -07:00
Awni Hannun
2146bcd7ee
Quantize embedding / Update quantize API ( #680 )
...
* more async eval
* quantize embedding / update quantize api
* more updates for quantize
* update for quantize embeddings
* update sd quant API
* update sdxl quants
* error for datasets < batch_size
* async
* fix config loading
* fix quant
* fix tests
* fix req
* remove lm head if tie weights is true
* fix test
2024-04-18 18:16:10 -07:00
Awni Hannun
78c431dc25
cleanup whisper a little ( #639 )
2024-03-30 13:13:58 -07:00
Vaibhav Srivastav
d4c3a9cb54
[Whisper] Add HF Hub upload option. ( #254 )
...
* Add HF Hub upload option.
* up.
* Add missing requirements.
2024-01-08 06:18:24 -08:00
bofeng huang
bf9926489e
[Whisper] Add word timestamps and confidence scores ( #201 )
...
* Add word timestamps and confidence scores
* Create a separate forward_with_cross_qk function
* Move multiple ops from np to mlx, clean comments
* Save alignment_heads
* Cast qk to fp32
* Add test for word-level timestamps and confidence scores
* format + readme
* nit
---------
Co-authored-by: Awni Hannun <awni@apple.com>
2024-01-07 10:01:29 -08:00
Awni Hannun
a5d6d0436c
Support Hugging Face models ( #215 )
...
* support hf direct models
2024-01-03 15:13:26 -08:00
bofeng huang
581a5733a1
[Whisper] Load customized MLX model & Quantization ( #191 )
...
* Add option to load customized mlx model
* Add quantization
* Apply reviews
* Separate model conversion and loading
* Update test
* Fix benchmark
* Add notes about conversion
* Improve doc
2023-12-29 10:22:15 -08:00