mlx/tests
Juarez Bochi 4fe2fa2a64
GGUF: Avoid dequantization when format is compatible (#426)
* GGUF: Don't dequantize q4_1

* Fix weight order. First in low bits

* Add unpacking for q4_0

* Don't dequantize q8_0

* rebase quants and split file

* don't quantize every weight

* reapply patch

* error handling

---------

Co-authored-by: Awni Hannun <awni@apple.com>
2024-01-23 15:43:57 -08:00
..
allocator_tests.cpp Metal validation (#432) 2024-01-11 11:57:24 -08:00
arg_reduce_tests.cpp Metal validation (#432) 2024-01-11 11:57:24 -08:00
array_tests.cpp Make array conform to the Python Buffer Protocol (#323) 2024-01-05 15:58:33 -08:00
autograd_tests.cpp Split multi output (#461) 2024-01-16 13:33:55 -08:00
blas_tests.cpp copyright + ack 2023-11-30 11:12:53 -08:00
CMakeLists.txt linalg.norm (#187) 2023-12-26 19:42:04 -08:00
creations_tests.cpp Spelling (#342) 2024-01-01 21:08:17 -08:00
device_tests.cpp copyright + ack 2023-11-30 11:12:53 -08:00
eval_tests.cpp Removes the retain_graph flag (#385) 2024-01-07 15:16:51 -08:00
fft_tests.cpp copyright + ack 2023-11-30 11:12:53 -08:00
graph_optimize_tests.cpp Multi output primitives (#330) 2024-01-08 16:39:08 -08:00
linalg_tests.cpp Multi output primitives (#330) 2024-01-08 16:39:08 -08:00
load_tests.cpp GGUF: Load and save metadata (#446) 2024-01-19 14:06:05 -08:00
metal_tests.cpp GGUF: Avoid dequantization when format is compatible (#426) 2024-01-23 15:43:57 -08:00
ops_tests.cpp Fix round to round half-cases to even (#482) 2024-01-17 15:27:23 -08:00
random_tests.cpp random.uniform must respect dtype, even if lower precision than "low" (#280) 2023-12-24 07:04:43 -08:00
scheduler_tests.cpp copyright + ack 2023-11-30 11:12:53 -08:00
tests.cpp copyright + ack 2023-11-30 11:12:53 -08:00
utils_tests.cpp Added mx.stack c++ frontend impl (#123) 2023-12-14 13:21:19 -08:00
vmap_tests.cpp Removes the retain_graph flag (#385) 2024-01-07 15:16:51 -08:00