Prince Canuma
|
ad93729dce
|
Update llms/mlx_lm/models/cohere2.py
Co-authored-by: Awni Hannun <awni.hannun@gmail.com>
|
2025-01-12 15:23:19 +01:00 |
|
Prince Canuma
|
1107364c3a
|
fix mask shape error (long context)
|
2025-01-11 21:56:00 +01:00 |
|
Alex Barron
|
d4ef909d4a
|
Length masking for batch inputs (#1173)
* length masking
* add mask to mlx_lm model interface
* remove lengths
* fix test:
* comment + fix
|
2024-12-18 19:43:52 -08:00 |
|
Prince Canuma
|
dfa4dd6c93
|
Add support for cohere2 (#1157)
* add support for cohere2
* revert to act_fn to silu
* fix tests and sliding window attention
* add tests
* add to tuner
* fix sliding window
* add coauthor :)
Co-authored-by: n8programs <43304488+N8python@users.noreply.github.com>
* Add rotating kvcache to save space
* some nits
* style
* nits
---------
Co-authored-by: n8programs <43304488+N8python@users.noreply.github.com>
Co-authored-by: N8 <n8@n8programs.com>
Co-authored-by: Awni Hannun <awni@apple.com>
|
2024-12-16 08:01:03 -08:00 |
|