Prince Canuma
|
bf2da36fc6
|
Fix Cohere2: mask shape error (long context) (#1202)
* fix mask shape error (long context)
* Update llms/mlx_lm/models/cohere2.py
Co-authored-by: Awni Hannun <awni.hannun@gmail.com>
* revert layer_idx
* black formatting
* Update cohere2.py
* format
---------
Co-authored-by: Awni Hannun <awni.hannun@gmail.com>
Co-authored-by: Awni Hannun <awni@apple.com>
|
2025-01-12 12:58:08 -08:00 |
|
Alex Barron
|
d4ef909d4a
|
Length masking for batch inputs (#1173)
* length masking
* add mask to mlx_lm model interface
* remove lengths
* fix test:
* comment + fix
|
2024-12-18 19:43:52 -08:00 |
|
Prince Canuma
|
dfa4dd6c93
|
Add support for cohere2 (#1157)
* add support for cohere2
* revert to act_fn to silu
* fix tests and sliding window attention
* add tests
* add to tuner
* fix sliding window
* add coauthor :)
Co-authored-by: n8programs <43304488+N8python@users.noreply.github.com>
* Add rotating kvcache to save space
* some nits
* style
* nits
---------
Co-authored-by: n8programs <43304488+N8python@users.noreply.github.com>
Co-authored-by: N8 <n8@n8programs.com>
Co-authored-by: Awni Hannun <awni@apple.com>
|
2024-12-16 08:01:03 -08:00 |
|