mirror of
https://github.com/ml-explore/mlx-examples.git
synced 2025-06-25 01:41:19 +08:00
![]() * chore: fix the load quantization model * change to explicitly check for quantization config |
||
---|---|---|
.. | ||
deepseek-coder | ||
llama | ||
mistral | ||
mixtral | ||
phi2 | ||
qwen | ||
speculative_decoding |