mlx-examples

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-10-29 01:18:08 +08:00

Files

Gökdeniz Gülmez 2c1c9e9024 MiniCPM implementation (#685 )

* Added support for the MiniCPM architecture

* Added support for the MiniCPM architecture

* Updated utils.py and LORA.md

* Updated utils.py and LORA.md

* Update implementation details for MiniCPM architecture

* Cleaning up

* fixed the missing lm.head layer problem

* Refactor Model class to dynamically handle tied and untied word embeddings

* Quick update

* added a dynamic rope scaling base calucaltion

* Added support for the MiniCPM architecture

* Added support for the MiniCPM architecture

* Updated utils.py and LORA.md

* Updated utils.py and LORA.md

* Update implementation details for MiniCPM architecture

* Cleaning up

* fixed the missing lm.head layer problem

* Refactor Model class to dynamically handle tied and untied word embeddings

* added a dynamic rope scaling base calucaltion

* quick fix and clean up

* clean up again

* removed the MiniCPMNorm class as its not used

* forgot something, sorry

* format

* version bump

---------

Co-authored-by: Awni Hannun <awni@apple.com>

2024-04-25 15:29:28 -07:00

__init__.py

feat: move lora into mlx-lm (#337 )

2024-01-23 08:44:37 -08:00

datasets.py

Support for OpenAI’s fine-tuning dataset format (#548 )

2024-03-19 16:45:46 -07:00

lora.py

cast around lora adapters (#613 )

2024-03-24 19:34:51 -07:00

trainer.py

Quantize embedding / Update quantize API (#680 )

2024-04-18 18:16:10 -07:00

utils.py

MiniCPM implementation (#685 )

2024-04-25 15:29:28 -07:00