zhangyiss/mlx - mlx - Gitea for Geophysics

mirror of https://github.com/ml-explore/mlx.git synced 2025-12-16 01:49:05 +08:00

Author	SHA1	Message	Date
Awni Hannun	7064fed1b1	Minor update on MPI docs (#1805 )	2025-01-28 11:00:08 -08:00
Awni Hannun	516ded618b	Dynamic slicing (#1741 ) * dynamic slice and slice update * python bindings + tests + fix set item * fix compile issue * comment * fix jit	2025-01-07 14:02:16 -08:00
Awni Hannun	b51d70a83c	export docs (#1747 )	2025-01-03 15:04:17 -08:00
Awni Hannun	c9d30aa6ac	MLX in C++ example (#1736 ) * MLX in C++ example * nits * fix docs	2025-01-02 19:09:04 -08:00
Awni Hannun	ae69cb15e9	shapeless compile in docs and partially shapeless reshape (#1742 )	2025-01-02 16:24:42 -08:00
Danilo Peixoto	92ec632ad5	Fix Distributed Communication documentation (#1731 ) * Add missing `size()` method call for group	2025-01-02 14:08:38 -08:00
Awni Hannun	4ba0c24a8f	Export / import functions to / from a file (#1642 ) * export and import functions * refactor + works for few primitives * nit * allow primitives with state * nit * nit * simplify serialize / deserialize * fix for constants * python bindings * maybe fix serialize failure case * add example * more primitives, training kind of works * same result for python and c++ * some fixes * fix export * template it up * some simplificatoin * rebase * allow kwargs and multiple functions * exporter * more primitives for exporting * deal with endianness * handle invalid stream * add docstring	2024-12-24 11:19:13 -08:00
Awni Hannun	c3628eea49	Add `mx.finfo` and use it when making causal mask (#1726 ) * finfo * fixes * docs	2024-12-19 14:52:41 -08:00
Cheng	f9640e049d	Install mlx.dll into the same dir with python bindings on Windows (#1690 ) * Install mlx.dll into the same dir with python bindings on Windows * Set BUILD_SHARED_LIBS for dlfcn-win32 * Update cmake requirements to 3.25 * Fix cmake style	2024-12-13 19:50:39 -08:00
Awni Hannun	4e1e9520e1	Flatten and unflatten (#1692 ) * flatten and unflatten * fix grad * fix shape infer * use squeeze + unsqueeze in get_item	2024-12-11 21:51:37 -08:00
Awni Hannun	40c62c1321	Use int64 stride everywhere (#1671 ) * use int64 stride everywhere * fix ext * fix ext * more shape + cleanup * one more * few more	2024-12-09 11:09:02 -08:00
Awni Hannun	7cbb4aef17	Doc fix (#1615 )	2024-11-22 11:12:25 -08:00
Alex Barron	c79f6a4a8c	3 and 6 bit quantization (#1613 ) * Support 3 and 6 bit quantization	2024-11-22 10:22:13 -08:00
Cocoa	0d5e7716ad	fix typo: accross -> across (#1609 ) Signed-off-by: Cocoa <i@uwucocoa.moe>	2024-11-20 15:30:51 -08:00
Awni Hannun	2419edd5b2	Faster indexing math in a few kernels (#1589 ) * wip: faster compiled kernels * faster general unary with uint specialization * index type in compiled, unary, binary, ternary, copy * fix jit * jit fix * specialize gather + scatter * nit in docs	2024-11-18 19:52:00 -08:00
Awni Hannun	9f0d5c12fc	Fully wrap the command encoder (#1572 ) * fully wrap the command encoder * use consistent style + fix extensions	2024-11-08 11:50:21 -08:00
Chris Offner	46d8b16ab4	Fix vmap example in docs (#1556 )	2024-11-02 17:44:14 -07:00
Chris Offner	42533931fa	Fix typo "it's" -> "its" (#1555 )	2024-11-02 06:06:34 -07:00
Alex Barron	9e516b71ea	Add dispatchThreads to custom kernel doc (#1551 ) * add dispatchThreads info * update * add link	2024-11-01 13:07:48 -07:00
Léo	917252a5a1	Add favicon to docs (#1545 ) * add sphinx's html_favicon config * removed unneeded newline * ran pre-commit hooks	2024-10-30 13:54:13 -07:00
Awni Hannun	0eb56d5be0	Wired (#1510 ) * expose residency sets as wire/unwire * returns wired size * fix * runtime support check * fix os check * fix test * fix no metal build * docs * nit * nits in docs * nits	2024-10-25 09:35:33 -07:00
Paul Hansel	f70764a162	Fix typo in build docs (#1522 )	2024-10-24 20:55:06 -07:00
Kashif Rasul	3ddc07e936	Eigenvalues and eigenvectors (#1334 ) * initial eigvalsh * add compute_vectors * add compute_vectors_ * return a pair * add eigh to return only eigenvectors * fixed typo * merge merge Eighvalsh and Eigh into a single primitive * use the same primate with the flag * fix primatives * use MULTI * fix eval_gpu * fix decleration * rename EighPrimitive to Eigh * tests * tests * fix rebase and format * cleanup lapack * format * add cblas.h --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-10-22 12:18:48 -07:00
Awni Hannun	c26208f67d	Remove Hazard tracking with Fences (#1509 ) * remove hazard tracking * with fence map * no hazard tracking with fences * nits * fix fence retain * cleanup * fix quantized rebase	2024-10-21 19:33:32 -07:00
Awni Hannun	343aa46b78	No more 3.8 (#1493 )	2024-10-16 17:51:38 -07:00
Awni Hannun	3f86399922	Real and Imag (#1490 ) * real and imag * fix * fix	2024-10-15 16:23:15 -07:00
Awni Hannun	e1c9600da3	Add `mx.random.permutation` (#1471 ) * random permutation * comment	2024-10-08 19:42:19 -07:00
Angelos Katharopoulos	9b12093739	Add the roll op (#1455 )	2024-10-07 17:21:42 -07:00
Awni Hannun	195b429d99	Put along axis + fixe for partition grad (#1430 ) * put along axis, fixes for partition grad * zeros for arg reduce	2024-09-23 10:03:38 -07:00
Nripesh Niketan	6af5ca35b2	feat: add cross_product (#1252 ) * feat: add cross_product * lint * python binding * refactor: Improve error message for cross_product function * refactor: more close to numpy cross product * refactor: improve error message for cross_product function * finish * fix acks * allow old numpy * doc --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-09-17 13:12:43 -07:00
Awni Hannun	4f46e9c997	More fixes for arrays with large sizes (#1405 ) * compile works for big arrays when contiguous * style * nits in docs * a bunch more stuff * update jit * update jit * use constant for shapes and strides and remove elem_to_loc overload * use kernel instantiation * docs nits * update binary and ternary * comments	2024-09-17 12:46:31 -07:00
Awni Hannun	02efb310ca	Xcode 160 (#1384 ) * xcode 16.0 with debug tests * limit nproc for builds * vmap bug * assert bug * run python tests in debug mode * fix view, bool copies preserve bits' * actual view fix	2024-09-10 15:15:17 -07:00
Max-Heinrich Laves	efeb9c0f02	Transposed Convolution (#1245 ) * initial implementation for conv_transpose ran pre-commit implemented conv_transpose updated conv_general docstring updated conv_general docstring updated code comments removed commented run_conv_checks updated acknowledgments added missing entry to ops.rst added op to nn.layers resolved merge conflicts * removed ConvolutionTranspose primitive as suggested by reviewer removed ConvolutionTranspose primitive as suggested by reviewer * remove transpose flag, add another test --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-09-06 19:52:38 -07:00
Awni Hannun	ba3e913c7a	Simplifications for MLX C (#1396 ) * simplifications for MLX C * use vectors instead of map * update examples	2024-09-06 19:16:50 -07:00
Awni Hannun	9592766939	add std as method (#1387 ) * add std as method * add std as method	2024-09-01 19:49:16 -07:00
Angelos Katharopoulos	cdb59faea6	Adds send/recv ops in distributed (#1366 )	2024-08-26 23:01:37 -07:00
Alex Barron	b96e105244	Add `grid_sample` example to `metal_kernel` docs (#1352 ) * Add `zero_outputs` and `atomic_outputs` options to `metal_kernel` * add grid sample to docs * zero_outputs -> init_value * add missing header for linux	2024-08-23 18:24:16 -07:00
Awni Hannun	3b4d5484c7	Bump extension MLX version (#1350 ) * Bump extension MLX version * fix some docs nits	2024-08-23 12:38:34 -07:00
Alex Barron	0fd2a1f4b0	Custom Metal Kernels from Python (#1325 ) * start * simple kernels working * restructure * inverse example working * docs + fixes * missing file * fix imports * address comments * add docs + fix test * Review comments + refactor to a single function * update docs * remove hashing * fix contig bug in test * back to a class * trailing whitespace * fix tests * match c++ and python apis * add link + make args kw_only	2024-08-22 13:46:29 -07:00
Alex Barron	32668a7317	CPU mx.linalg.cholesky_inverse and mx.linalg.tri_inv (#1307 ) * add cholesky inv + tri inv * always run tri_inv on cpu * consistent naming	2024-08-08 15:18:02 -07:00
Awni Hannun	30bbea2f08	Add gemv masked to JIT plus some fixes (#1310 ) * add gemv masked to JIT plus some fixes * some cleanup * add utils * fix * fix 2 * more cleaning * fix * remove unused mps matmul support * one more nit * revert	2024-08-07 13:38:07 -07:00
Awni Hannun	aa1d6cadad	Fix docs latex build and nits (#1297 ) * fix docs latex build and nits * fix stub gen and try to clean up building	2024-07-29 11:44:06 -07:00
Yaroslav	8cfb9fc0b8	Update requirements.txt (#1291 )	2024-07-26 12:59:52 -07:00
Anton Belov	5029894662	[Issue #1187 ] Add nan_to_num function initial attempt (#1247 ) * initial attempt, working with wrong types * not compiling; mx.float16 and mx.bfloat16 tests added * fix nan to num * nit --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-07-25 09:57:37 -07:00
Awni Hannun	baf9fa5f42	Einsum (#1269 ) * einsum initial * fix comma break * sum axis was wrong * small cleanups * python binding * changed bindings to resemble numpy * remove todo comment * comment changes * add count of operands/inputs * fail fast if operands list is empty * ignore comma if no output * einsum path matching numpy * getting somewhere with path * remove print * it passes the first test * moved einsum tests to seperate file * seperated einsum path * moved einsum naive * remove space from equation * fast fail if no operands passed * update tests and remove printf * small cleanup * some more cleanups * removed python helper file * ack * utilize std for finding min in vector * duplicate def * remove the tuple as it was unreadable * moved einsum_naive back to ops * remaining isn't needed * avoid creating another set * cleanup * greedy path, start of naive einsum * more einsum * fix some bugs * some more fixes, tests pass * benchmark * some simplify * fix einsum and test Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com> * add a bunch more tests and fix a bunch more bugs * some docs nits --------- Co-authored-by: dc-dc-dc <dgcruz983@gmail.com> Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>	2024-07-25 09:36:44 -07:00
fgranqvist	50eff6a10a	Implement sampling from laplace distribution. (#1279 )	2024-07-24 15:15:37 +02:00
Awni Hannun	8c01a7893b	minor fix in optimizer + docs (#1264 )	2024-07-12 12:18:02 -07:00
Awni Hannun	218047c75a	docs fixes (#1263 )	2024-07-11 15:59:07 -07:00
Angelos Katharopoulos	5c1fa64fb0	Custom transforms (#1246 )	2024-07-10 18:00:01 -07:00
Alex Barron	a3c287354f	Fast Hadamard Transform (#1249 ) * Working hadamard for powers of 2 * working for m2^k add scale and check contiguity * add size check * clean up * fix test * add grads + vmap * gpu only * skip on linux * test typo * add cpu impl * remove gpu only tests * fix linux build + add is_equivalent	2024-07-09 20:39:01 -07:00

1 2 3 4

197 Commits