mlx-examples/llms/mlx_lm
Baptiste Canton 42672f5446
add an option to apply the tokenizer chat template (#338)
* add an option to apply the tokenizer chat template

* fix the option to apply the tokenizer chat template

* better error messages for chat template issues

* apply the chat template by default when possible

* nit in comment'

* rebase

---------

Co-authored-by: Awni Hannun <awni@apple.com>
2024-01-22 19:52:42 -08:00
..
models refactor(qwen): moving qwen into mlx-lm (#312) 2024-01-22 15:00:07 -08:00
__init__.py Mlx llm package (#301) 2024-01-12 10:25:56 -08:00
convert.py chore: fix the convert.py script for weights are not sanitized and support quant for non-32 dimensions (#340) 2024-01-19 21:07:21 -08:00
generate.py add an option to apply the tokenizer chat template (#338) 2024-01-22 19:52:42 -08:00
README.md Mlx llm package (#301) 2024-01-12 10:25:56 -08:00
requirements.txt refactor(qwen): moving qwen into mlx-lm (#312) 2024-01-22 15:00:07 -08:00
UPLOAD.md Mlx llm package (#301) 2024-01-12 10:25:56 -08:00
utils.py refactor(qwen): moving qwen into mlx-lm (#312) 2024-01-22 15:00:07 -08:00

Generate Text with MLX and 🤗 Hugging Face

This an example of large language model text generation that can pull models from the Hugging Face Hub.

For more information on this example, see the README in the parent directory.