mlx-examples/llms/mlx_lm/tuner
Prince Canuma dfa4dd6c93
Add support for cohere2 (#1157)
* add support for cohere2

* revert to act_fn to silu

* fix tests and sliding window attention

* add tests

* add to tuner

* fix sliding window

* add coauthor :)

Co-authored-by: n8programs <43304488+N8python@users.noreply.github.com>

* Add rotating kvcache to save space

* some nits

* style

* nits

---------

Co-authored-by: n8programs <43304488+N8python@users.noreply.github.com>
Co-authored-by: N8 <n8@n8programs.com>
Co-authored-by: Awni Hannun <awni@apple.com>
2024-12-16 08:01:03 -08:00
..
__init__.py LoRA: Extract small function (#614) 2024-06-02 06:38:42 -07:00
datasets.py LoRA: Support HuggingFace dataset via data parameter (#996) 2024-09-30 07:36:21 -07:00
dora.py Feature: QDoRA (#891) 2024-09-30 08:01:11 -07:00
lora.py Allow the entire model to be targed for LoRA and DoRA fine tuning: LoRA and DoRA embeddings with small DoRALinear bug fix (#914) 2024-08-16 07:38:36 -07:00
trainer.py [MLX LM] Sampler refactor + a few improvements (#1094) 2024-11-07 16:15:24 -08:00
utils.py Add support for cohere2 (#1157) 2024-12-16 08:01:03 -08:00