mlx/tests/metal_tests.cpp at 1a48713d32268ba9ffaa8ff744c55e9fce9356a8

mirror of https://github.com/ml-explore/mlx.git synced 2025-12-16 01:49:05 +08:00

Files

Juarez Bochi 4fe2fa2a64 GGUF: Avoid dequantization when format is compatible (#426 )

* GGUF: Don't dequantize q4_1

* Fix weight order. First in low bits

* Add unpacking for q4_0

* Don't dequantize q8_0

* rebase quants and split file

* don't quantize every weight

* reapply patch

* error handling

---------

Co-authored-by: Awni Hannun <awni@apple.com>

2024-01-23 15:43:57 -08:00

14 KiB

Raw Blame History

View Raw

14 KiB Raw Blame History

14 KiB

Raw Blame History