Vaibhav Srivastav
|
0eaa323c10
|
Fix conversion + inference errors. - Mistral (#176)
* Fix conversion + inference errors.
* wire rope_theta throuugh to nn.RoPE
---------
Co-authored-by: Awni Hannun <awni@apple.com>
|
2023-12-22 14:10:25 -08:00 |
|
Todsaporn Banjerdkit
|
7ae445f6c7
|
feat: add mistral tps (#173)
* feat: add mistral tps
* eval params before timing + format
---------
Co-authored-by: Awni Hannun <awni@apple.com>
|
2023-12-22 07:55:57 -08:00 |
|
Awni Hannun
|
3cf436b529
|
Quantize example (#162)
* testing quantization
* conversion + quantization working
* one config processor
* quantization in mistral / nits in llama
* args for quantization
* llama / mistral conversion in good shape
* phi2 quantized
* mixtral
* qwen conversion
|
2023-12-21 12:59:37 -08:00 |
|
Daniel Strobusch
|
43b6522af2
|
rename --model_path to --model-path (#151)
use same argument convention for mistral/mixtral as for llama convert.
|
2023-12-21 06:28:57 -08:00 |
|
Awni Hannun
|
27c0a8c002
|
Add llms subdir + update README (#145)
* add llms subdir + update README
* nits
* use same pre-commit as mlx
* update readmes a bit
* format
|
2023-12-20 10:22:25 -08:00 |
|