Files
mlx-examples/llms/mlx_lm/tuner
Prince Canuma d661440dbb Add support for qwen2moe (#640)
* add sparsemoe block and update decoder logic

* update file name to match HF

* update name

* Code formatting

* update gates calculation

* add support for Qwen2MoE.

* fix pytest

* code formatting and fix missing comma in utils

* Remove decoder sparse step.

Co-authored-by: bozheng-hit <dsoul0621@gmail.com>

* remove gate layer anti-quantisation

* remove unused argument

---------

Co-authored-by: bozheng-hit <dsoul0621@gmail.com>
2024-04-02 11:33:29 -07:00
..
2024-01-23 08:44:37 -08:00
2024-03-24 19:34:51 -07:00
2024-04-02 11:33:29 -07:00