mlx-examples/llms/mlx_lm
Anchen 30be4c4734
refactor(qwen): moving qwen into mlx-lm (#312)
* refactor(qwen): moving qwen into mlx-lm

* chore: update doc

* chore: fix type hint

* add qwen model support in convert

* chore: fix doc

* chore: only load model in quantize_model

* chore: make the convert script only copy tokenizer files instead of load it and save

* chore: update docstring

* chore: remove unnecessary try catch

* chore: clean up for tokenizer and update  transformers 4.37

* nits in README

---------

Co-authored-by: Awni Hannun <awni@apple.com>
2024-01-22 15:00:07 -08:00
..
models refactor(qwen): moving qwen into mlx-lm (#312) 2024-01-22 15:00:07 -08:00
__init__.py Mlx llm package (#301) 2024-01-12 10:25:56 -08:00
convert.py chore: fix the convert.py script for weights are not sanitized and support quant for non-32 dimensions (#340) 2024-01-19 21:07:21 -08:00
generate.py refactor(qwen): moving qwen into mlx-lm (#312) 2024-01-22 15:00:07 -08:00
README.md Mlx llm package (#301) 2024-01-12 10:25:56 -08:00
requirements.txt refactor(qwen): moving qwen into mlx-lm (#312) 2024-01-22 15:00:07 -08:00
UPLOAD.md Mlx llm package (#301) 2024-01-12 10:25:56 -08:00
utils.py refactor(qwen): moving qwen into mlx-lm (#312) 2024-01-22 15:00:07 -08:00

Generate Text with MLX and 🤗 Hugging Face

This an example of large language model text generation that can pull models from the Hugging Face Hub.

For more information on this example, see the README in the parent directory.