mlx-examples

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-08-12 12:16:44 +08:00

History

Gökdeniz Gülmez 2c1c9e9024 MiniCPM implementation (#685 ) * Added support for the MiniCPM architecture * Added support for the MiniCPM architecture * Updated utils.py and LORA.md * Updated utils.py and LORA.md * Update implementation details for MiniCPM architecture * Cleaning up * fixed the missing lm.head layer problem * Refactor Model class to dynamically handle tied and untied word embeddings * Quick update * added a dynamic rope scaling base calucaltion * Added support for the MiniCPM architecture * Added support for the MiniCPM architecture * Updated utils.py and LORA.md * Updated utils.py and LORA.md * Update implementation details for MiniCPM architecture * Cleaning up * fixed the missing lm.head layer problem * Refactor Model class to dynamically handle tied and untied word embeddings * added a dynamic rope scaling base calucaltion * quick fix and clean up * clean up again * removed the MiniCPMNorm class as its not used * forgot something, sorry * format * version bump --------- Co-authored-by: Awni Hannun <awni@apple.com>		2024-04-25 15:29:28 -07:00
..
__init__.py	Mlx llm package (#301 )	2024-01-12 10:25:56 -08:00
base.py	Mlx llm package (#301 )	2024-01-12 10:25:56 -08:00
cohere.py	Quantize embedding / Update quantize API (#680 )	2024-04-18 18:16:10 -07:00
dbrx.py	- Removed unused Python imports (#683 )	2024-04-16 07:50:32 -07:00
gemma.py	Quantize embedding / Update quantize API (#680 )	2024-04-18 18:16:10 -07:00
llama.py	Switch to fast RMS/LN Norm (#603 )	2024-03-23 07:13:51 -07:00
minicpm.py	MiniCPM implementation (#685 )	2024-04-25 15:29:28 -07:00
mixtral.py	Fix argpartition call in Mixtral and other MOES (#676 )	2024-04-12 11:00:56 -07:00
olmo.py	Quantize embedding / Update quantize API (#680 )	2024-04-18 18:16:10 -07:00
phi3.py	Add support for phi-3 (#712 )	2024-04-23 09:20:00 -07:00
phi.py	Switch to fast RMS/LN Norm (#603 )	2024-03-23 07:13:51 -07:00
phixtral.py	Fix argpartition call in Mixtral and other MOES (#676 )	2024-04-12 11:00:56 -07:00
plamo.py	Configurable LR schedulers (#604 )	2024-03-29 13:41:10 -07:00
qwen2_moe.py	Add support for qwen2moe (#640 )	2024-04-02 11:33:29 -07:00
qwen2.py	Quantize embedding / Update quantize API (#680 )	2024-04-18 18:16:10 -07:00
qwen.py	Switch to fast RMS/LN Norm (#603 )	2024-03-23 07:13:51 -07:00
stablelm.py	Stable lm 2 (#666 )	2024-04-08 14:18:55 -07:00
starcoder2.py	Quantize embedding / Update quantize API (#680 )	2024-04-18 18:16:10 -07:00