* initial encodec

* works

* nits

* use fast group norm

* fix for rnn layer

* fix mlx version

* use custom LSTM kernel

* audio encodec

* fix example, support batched inference

* nits
This commit is contained in:
Awni Hannun
2024-09-23 11:39:25 -07:00
committed by GitHub
parent 796d5e40e4
commit 9bb2dd62f3
10 changed files with 1267 additions and 0 deletions

View File

@@ -27,6 +27,7 @@ Some more useful examples are listed below.
### Audio Models
- Speech recognition with [OpenAI's Whisper](whisper).
- Audio compression and generation with [Meta's EnCodec](encodec).
### Multimodal models