refactor(qwen): moving qwen into mlx-lm (#312)

* refactor(qwen): moving qwen into mlx-lm

* chore: update doc

* chore: fix type hint

* add qwen model support in convert

* chore: fix doc

* chore: only load model in quantize_model

* chore: make the convert script only copy tokenizer files instead of load it and save

* chore: update docstring

* chore: remove unnecessary try catch

* chore: clean up for tokenizer and update  transformers 4.37

* nits in README

---------

Co-authored-by: Awni Hannun <awni@apple.com>
This commit is contained in:
Anchen
2024-01-22 15:00:07 -08:00
committed by GitHub
parent de15532da8
commit 30be4c4734
8 changed files with 80 additions and 309 deletions

View File

@@ -1,4 +1,4 @@
mlx
numpy
transformers
transformers>=4.37.0
protobuf