Commit Graph

25 Commits

Author SHA1 Message Date
Awni Hannun
ec14583c2a
work with tuple shape (#393) 2024-02-01 13:03:47 -08:00
Yousif
7575125d5d
Added lora support for Phi-2 (#302)
* Added lora support for Phi-2

* Added Phi-2 support in fuse and convert

* format + readme

---------

Co-authored-by: Awni Hannun <awni@apple.com>
2024-01-12 13:45:30 -08:00
Alexandre Boucaud
3ac731dd4f
Fix TypeError in whisper benchmark script (#306)
* Add missing keyword to the decoding options

* Reverting last commit

* Fixing transcribe keyword in benckmark.py

* Add argument name to load_model

This is intended to avoid confusion
2024-01-12 13:07:15 -08:00
Awni Hannun
c1342b8e89
Use pip for mlx data with speech commands (#307)
* update to use pypi mlx data

* nit in readme
2024-01-12 11:06:33 -08:00
Awni Hannun
80d18671ad
[Lora] Fix generate (#282)
* fix generate

* update readme, fix test, better default

* nits

* typo
2024-01-10 16:13:06 -08:00
Vaibhav Srivastav
bb35e878cb
[Whisper] Add load from Hub. (#255)
* Add load from Hub.

* Up.
2024-01-08 06:20:00 -08:00
Vaibhav Srivastav
d4c3a9cb54
[Whisper] Add HF Hub upload option. (#254)
* Add HF Hub upload option.

* up.

* Add missing requirements.
2024-01-08 06:18:24 -08:00
bofeng huang
bf9926489e
[Whisper] Add word timestamps and confidence scores (#201)
* Add word timestamps and confidence scores

* Create a separate forward_with_cross_qk function

* Move multiple ops from np to mlx, clean comments

* Save alignment_heads

* Cast qk to fp32

* Add test for word-level timestamps and confidence scores

* format + readme

* nit

---------

Co-authored-by: Awni Hannun <awni@apple.com>
2024-01-07 10:01:29 -08:00
Awni Hannun
a5d6d0436c
Support Hugging Face models (#215)
* support hf direct models
2024-01-03 15:13:26 -08:00
bofeng huang
581a5733a1
[Whisper] Load customized MLX model & Quantization (#191)
* Add option to load customized mlx model

* Add quantization

* Apply reviews

* Separate model conversion and loading

* Update test

* Fix benchmark

* Add notes about conversion

* Improve doc
2023-12-29 10:22:15 -08:00
Dimo
07c163d9d9
[Whisper] Large-v3 requires 128 Mel frequency bins (#193)
* Large-v3 requires 128 Mel frequency bins

* extract correct model dimensions and use argparse

* format

* format

---------

Co-authored-by: Awni Hannun <awni@apple.com>
2023-12-28 13:50:35 -08:00
bofeng huang
e1e56a625b
Fix benchmark (#200) 2023-12-28 11:29:39 -08:00
Awni Hannun
27c0a8c002
Add llms subdir + update README (#145)
* add llms subdir + update README

* nits

* use same pre-commit as mlx

* update readmes a bit

* format
2023-12-20 10:22:25 -08:00
Awni Hannun
b863e7cca0 format 2023-12-14 16:56:50 -08:00
Stv.X
cbae83e011 Corrected spelling of terms in whisper/README.md 2023-12-14 08:15:26 +08:00
bofenghuang
4b1a06c0cb Fix fp16 2023-12-13 11:07:47 +01:00
Awni Hannun
74c4ed40d2
Merge pull request #76 from bofenghuang/add-whisper-large-v3
Add whisper-large-v3
2023-12-12 20:22:31 -08:00
bofenghuang
94705ed38b Add large v3 2023-12-12 17:26:52 +01:00
Awni Hannun
6e723a015a whisper default in fp16 2023-12-12 07:37:35 -08:00
Awni Hannun
172a60056f update whisper readme and requirements 2023-12-07 13:01:44 -08:00
Awni Hannun
54952a0d80
Merge pull request #12 from chatgpt-1/main
Fix: timestamp extraction bug in transcribe function
2023-12-07 08:53:30 -08:00
adhishthite
9cf82a0d43 Benchmark all models if user allows. 2023-12-07 00:07:42 +05:30
crackerben
6cbc029450 Fix timestamp extraction bug in transcribe function 2023-12-06 20:34:30 +08:00
Awni Hannun
31bc57c4ff add copyright in source 2023-11-30 11:08:53 -08:00
Awni Hannun
b243c1d8f4 a few examples 2023-11-29 08:17:26 -08:00