mlx/python at d15fa13daf23f2c87c8071a241980c39388492a5 - mlx

mirror of https://github.com/ml-explore/mlx.git synced 2025-08-08 18:16:41 +08:00

History

Alex Barron d15fa13daf Batched Quantized Matmul + Fast Small QMV (#1503 ) * add fast qmv for small dims * fix test * batched cpu * add batched template param * refactor metal quantized.cpp		2024-10-21 16:23:17 -07:00
..
mlx	fix submodule stubs (#1492 )	2024-10-15 16:23:37 -07:00
src	fix gumbel (#1495 )	2024-10-17 13:52:39 -07:00
tests	Batched Quantized Matmul + Fast Small QMV (#1503 )	2024-10-21 16:23:17 -07:00