mirror of
https://github.com/ml-explore/mlx-examples.git
synced 2025-06-28 03:41:17 +08:00
![]() * Added support for the MiniCPM architecture * Added support for the MiniCPM architecture * Updated utils.py and LORA.md * Updated utils.py and LORA.md * Update implementation details for MiniCPM architecture * Cleaning up * fixed the missing lm.head layer problem * Refactor Model class to dynamically handle tied and untied word embeddings * Quick update * added a dynamic rope scaling base calucaltion * Added support for the MiniCPM architecture * Added support for the MiniCPM architecture * Updated utils.py and LORA.md * Updated utils.py and LORA.md * Update implementation details for MiniCPM architecture * Cleaning up * fixed the missing lm.head layer problem * Refactor Model class to dynamically handle tied and untied word embeddings * added a dynamic rope scaling base calucaltion * quick fix and clean up * clean up again * removed the MiniCPMNorm class as its not used * forgot something, sorry * format * version bump --------- Co-authored-by: Awni Hannun <awni@apple.com> |
||
---|---|---|
.. | ||
__init__.py | ||
base.py | ||
cohere.py | ||
dbrx.py | ||
gemma.py | ||
llama.py | ||
minicpm.py | ||
mixtral.py | ||
olmo.py | ||
phi3.py | ||
phi.py | ||
phixtral.py | ||
plamo.py | ||
qwen2_moe.py | ||
qwen2.py | ||
qwen.py | ||
stablelm.py | ||
starcoder2.py |