feat(mlx_lm): Nemotron (#949)

* feat: Nemotron https://huggingface.co/nvidia/Minitron-4B-Base This is basically Llama with partial RoPE and LayerNorm instead of BatchNorm. Also they add 1 to the LayerNorm weight for some reason. * fixup! feat: Nemotron * nits --------- Co-authored-by: Awni Hannun <awni@apple.com>
2025-12-16 02:08:55 +08:00 · 2024-08-29 21:08:57 -07:00
parent b1186e2a81
commit fc93c55723
2 changed files with 228 additions and 0 deletions
--- a/llms/mlx_lm/tuner/utils.py
+++ b/llms/mlx_lm/tuner/utils.py
@@ -93,6 +93,7 @@ def linear_to_lora_layers(
        "llama",
        "phi",
        "mixtral",
+        "nemotron",
        "stablelm",
        "qwen2",
        "qwen2_moe",