mlx-examples/d.py at 8fb82fee43634d7c67124162e51f07bf427294bc - mlx-examples - Gitea for Geophysics

zhangyiss/mlx-examples

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-08-30 02:53:41 +08:00

L Lllvvuu 8fb82fee43

Merge branch 'main' into feat/batch_generate

2024-10-09 15:13:12 -04:00

12 lines

283 B

Python

Raw Blame History

 import mlx_lm
 model, tokenizer = mlx_lm.load("/Users/llwu/models/mlx/Meta-Llama-3.1-8B-4bit")
 for s in mlx_lm.stream_generate(
     model,
     tokenizer,
     prompt=["Meta Llama 3.1 is a ", "Google Gemma 2 is a "],
     max_tokens=20,
 ):
     print(s[0].ljust(30) + s[1], flush=True)