mlx/tests at 3de8ce3f3c81612962835b00cfa7e461a12b35b2 - mlx

mirror of https://github.com/ml-explore/mlx.git synced 2025-06-25 01:41:17 +08:00

History

Brian Keene 1865299a30 Metal shaders for memory efficient self attention on large sequences (#964 ) * Metal shaders for efficient self attention on large sequences Updated fast attention: GEMM-ified with Steel primitives Uses flash attention 1 for scale correction * more compiler silencing * Address rebase issues * Templatize kernel instantiation, revise cpu bindings * Safer writes to output * Permit batch size > 1 * Numerical fixes for sdpa self attention * Re-enable test, remove unused variable * add benchmarking script * Disable sdpa prior to perf tuning, and simplify tests for per-patch CI		2024-06-03 09:16:19 -07:00
..
mlx_tests.py	Make MLX build on x64 macOS (#901 )	2024-03-27 06:14:29 -07:00
mpi_test_distributed.py	Comms (#1097 )	2024-05-23 17:04:02 -07:00
test_array.py	feat: Added dlpack device (#1165 )	2024-05-31 12:29:01 -07:00
test_autograd.py	stable cumprod grad at 0 (#1167 )	2024-05-31 12:28:42 -07:00
test_bf16.py	bump pre commit and fix format (#373 )	2024-01-04 16:28:52 -08:00
test_blas.py	Fix matvec vector stride bug (#1168 )	2024-05-29 12:18:28 -07:00
test_compile.py	Fix a couple bugs (#1161 )	2024-05-28 15:18:18 -07:00
test_constants.py	feat: Add numpy constants (#428 )	2024-01-11 06:47:29 -08:00
test_conv.py	Add groups to 2-D convolutions (#1129 )	2024-05-22 20:01:44 -07:00
test_device.py	Adds device context manager (#679 )	2024-02-14 14:14:58 -08:00
test_eval.py	Shared events for synchronization + async eval (#998 )	2024-04-17 06:16:02 -07:00
test_fast_sdpa.py	Metal shaders for memory efficient self attention on large sequences (#964 )	2024-06-03 09:16:19 -07:00
test_fast.py	No copy command encoder (#986 )	2024-04-11 21:15:36 -07:00
test_fft.py	Metal FFT for powers of 2 up to 2048 (#915 )	2024-04-11 21:40:06 -07:00
test_graph.py	Multi output primitives (#330 )	2024-01-08 16:39:08 -08:00
test_init.py	Make shape a tuple (#591 )	2024-01-30 13:11:01 -08:00
test_linalg.py	Implemented Cholesky on CPU (#1119 )	2024-05-17 12:31:59 -07:00
test_load.py	Fix logsumexp edge case (#740 )	2024-02-25 08:39:55 -08:00
test_losses.py	Feat: Add weights argument in BCE Loss and tests (#620 )	2024-02-07 09:39:52 -08:00
test_metal.py	Reset peak memory (#1074 )	2024-05-03 17:12:51 -07:00
test_nn.py	Make sure 0 is represented in the quantization (#1016 )	2024-04-19 19:47:26 -07:00
test_ops.py	Fix multi-block sort stride management (#1169 )	2024-05-31 11:10:54 -07:00
test_optimizers.py	Treate 'minimum' differently in cosine decay (#1138 )	2024-05-20 08:00:48 -07:00
test_quantized.py	Rename block sparse (#1149 )	2024-05-22 07:48:34 -07:00
test_random.py	Implementation of mlx.random.multivariate_normal (#502 ) (#877 )	2024-04-09 13:50:12 -07:00
test_reduce.py	Add GPU support for uint64/int64 reductions (#569 )	2024-01-31 11:18:04 -08:00
test_tree.py	Add isort pre-commit and run (#68 )	2023-12-08 11:31:47 -08:00
test_upsample.py	Upsample with bicubic interpolation (#967 )	2024-04-10 15:47:22 -07:00
test_vmap.py	Adds radians and degrees (#1011 )	2024-04-22 11:17:49 -07:00