Files
mlx-examples/llms/mlx_lm/fuse.py
Anchen 854ad8747a feat(mlx-lm): add de-quant for fuse.py (#365)
* feat(mlx-lm): add de-quant for fuse

* chore: disable quant in to linear when de-quant enabled

* chore: add better error handling for adapter file not found
2024-01-25 18:59:32 -08:00

2.8 KiB