mlx-examples/llms/mlx_lm/models
Awni Hannun b8a348c1b8
Switch to fast RMS/LN Norm (#603)
* use nn.RMSNorm, use sdpa, cleanup

* bump mlx versions

* minor update

* use fast layer norm

* version bump

* update requirement for whisper

* update requirement for gguf
2024-03-23 07:13:51 -07:00
..
__init__.py Mlx llm package (#301) 2024-01-12 10:25:56 -08:00
base.py Mlx llm package (#301) 2024-01-12 10:25:56 -08:00
cohere.py Switch to fast RMS/LN Norm (#603) 2024-03-23 07:13:51 -07:00
gemma.py Switch to fast RMS/LN Norm (#603) 2024-03-23 07:13:51 -07:00
llama.py Switch to fast RMS/LN Norm (#603) 2024-03-23 07:13:51 -07:00
mixtral.py Switch to fast RMS/LN Norm (#603) 2024-03-23 07:13:51 -07:00
olmo.py Switch to fast RMS/LN Norm (#603) 2024-03-23 07:13:51 -07:00
phi.py Switch to fast RMS/LN Norm (#603) 2024-03-23 07:13:51 -07:00
phixtral.py Switch to fast RMS/LN Norm (#603) 2024-03-23 07:13:51 -07:00
plamo.py Switch to fast RMS/LN Norm (#603) 2024-03-23 07:13:51 -07:00
qwen2.py Switch to fast RMS/LN Norm (#603) 2024-03-23 07:13:51 -07:00
qwen.py Switch to fast RMS/LN Norm (#603) 2024-03-23 07:13:51 -07:00
stablelm.py Switch to fast RMS/LN Norm (#603) 2024-03-23 07:13:51 -07:00
starcoder2.py Switch to fast RMS/LN Norm (#603) 2024-03-23 07:13:51 -07:00