mlx/mlx/backend/metal
Awni Hannun 8993382aaa
Buffer Donation (#519)
* buffer donation

* fix to move shared pointer

* format

* gpu in place for copy and binary

* revert ops test

* cpu in place

* a little cleanup

* remove useless bench
2024-01-26 16:30:33 -08:00
..
kernels Fix oob reads in gemv kernel (#523) 2024-01-22 12:06:04 -08:00
mps copyright + ack 2023-11-30 11:12:53 -08:00
allocator.cpp Support disable metal buffer cache to prevent performance degradation caused by large memory caching (#390) 2024-01-18 08:33:34 -08:00
allocator.h Memory allocation (#292) 2024-01-02 11:59:19 -08:00
CMakeLists.txt An initial quantized matmul implementation (#205) 2023-12-18 23:18:57 -08:00
conv.cpp Update GEMM (#424) 2024-01-17 12:42:39 -08:00
copy.cpp Buffer Donation (#519) 2024-01-26 16:30:33 -08:00
copy.h copyright + ack 2023-11-30 11:12:53 -08:00
device.cpp bump pre commit and fix format (#373) 2024-01-04 16:28:52 -08:00
device.h Revisit autorelease memory pools (#260) 2023-12-22 11:01:26 -08:00
fft.cpp copyright + ack 2023-11-30 11:12:53 -08:00
indexing.cpp Metal validation (#432) 2024-01-11 11:57:24 -08:00
matmul.cpp Update GEMM (#424) 2024-01-17 12:42:39 -08:00
matmul.h Update GEMM (#424) 2024-01-17 12:42:39 -08:00
metal.cpp Buffer Donation (#519) 2024-01-26 16:30:33 -08:00
metal.h Support disable metal buffer cache to prevent performance degradation caused by large memory caching (#390) 2024-01-18 08:33:34 -08:00
primitives.cpp Buffer Donation (#519) 2024-01-26 16:30:33 -08:00
quantized.cpp Allow arbitrary first dimension in quantization kernels. (#458) 2024-01-16 00:46:21 -08:00
reduce.cpp Metal validation (#432) 2024-01-11 11:57:24 -08:00
scan.cpp copyright + ack 2023-11-30 11:12:53 -08:00
softmax.cpp copyright + ack 2023-11-30 11:12:53 -08:00
sort.cpp Spelling (#342) 2024-01-01 21:08:17 -08:00
utils.h Spelling (#342) 2024-01-01 21:08:17 -08:00