feat(mlx-lm): export the GGUF (fp16) format model weights from fuse.py (#555)

* wip * wip * feat: convert mlx model to gguf f16 * chore: conver norm layer to float32 to avoid overflow issue * chore: add support for mixtral * chore: clean up * chore: remove unused import statement * chore: clean up weight name mapping * version and readme * actual version bump --------- Co-authored-by: Awni Hannun <awni@apple.com>
2025-12-16 02:08:55 +08:00 · 2024-03-22 04:34:11 +11:00
parent 8f906c859a
commit fe96ef342f
4 changed files with 351 additions and 6 deletions
--- a/llms/mlx_lm/version.py
+++ b/llms/mlx_lm/version.py
@@ -1,3 +1,3 @@
 # Copyright © 2023-2024 Apple Inc.

-__version__ = "0.3.0"
+__version__ = "0.4.0"