mlx-examples/llms/mlx_lm/models
Gökdeniz Gülmez 2c1c9e9024
MiniCPM implementation (#685)
* Added support for the MiniCPM architecture

* Added support for the MiniCPM architecture

* Updated utils.py and LORA.md

* Updated utils.py and LORA.md

* Update implementation details for MiniCPM architecture

* Cleaning up

* fixed the missing lm.head layer problem

* Refactor Model class to dynamically handle tied and untied word embeddings

* Quick update

* added a dynamic rope scaling base calucaltion

* Added support for the MiniCPM architecture

* Added support for the MiniCPM architecture

* Updated utils.py and LORA.md

* Updated utils.py and LORA.md

* Update implementation details for MiniCPM architecture

* Cleaning up

* fixed the missing lm.head layer problem

* Refactor Model class to dynamically handle tied and untied word embeddings

* added a dynamic rope scaling base calucaltion

* quick fix and clean up

* clean up again

* removed the MiniCPMNorm class as its not used

* forgot something, sorry

* format

* version bump

---------

Co-authored-by: Awni Hannun <awni@apple.com>
2024-04-25 15:29:28 -07:00
..
__init__.py Mlx llm package (#301) 2024-01-12 10:25:56 -08:00
base.py Mlx llm package (#301) 2024-01-12 10:25:56 -08:00
cohere.py Quantize embedding / Update quantize API (#680) 2024-04-18 18:16:10 -07:00
dbrx.py - Removed unused Python imports (#683) 2024-04-16 07:50:32 -07:00
gemma.py Quantize embedding / Update quantize API (#680) 2024-04-18 18:16:10 -07:00
llama.py Switch to fast RMS/LN Norm (#603) 2024-03-23 07:13:51 -07:00
minicpm.py MiniCPM implementation (#685) 2024-04-25 15:29:28 -07:00
mixtral.py Fix argpartition call in Mixtral and other MOES (#676) 2024-04-12 11:00:56 -07:00
olmo.py Quantize embedding / Update quantize API (#680) 2024-04-18 18:16:10 -07:00
phi3.py Add support for phi-3 (#712) 2024-04-23 09:20:00 -07:00
phi.py Switch to fast RMS/LN Norm (#603) 2024-03-23 07:13:51 -07:00
phixtral.py Fix argpartition call in Mixtral and other MOES (#676) 2024-04-12 11:00:56 -07:00
plamo.py Configurable LR schedulers (#604) 2024-03-29 13:41:10 -07:00
qwen2_moe.py Add support for qwen2moe (#640) 2024-04-02 11:33:29 -07:00
qwen2.py Quantize embedding / Update quantize API (#680) 2024-04-18 18:16:10 -07:00
qwen.py Switch to fast RMS/LN Norm (#603) 2024-03-23 07:13:51 -07:00
stablelm.py Stable lm 2 (#666) 2024-04-08 14:18:55 -07:00
starcoder2.py Quantize embedding / Update quantize API (#680) 2024-04-18 18:16:10 -07:00