zhangyiss/mlx - mlx - Gitea for Geophysics

mirror of https://github.com/ml-explore/mlx.git synced 2025-07-28 04:39:28 +08:00

Author	SHA1	Message	Date
Awni Hannun	e142aaf8a1	Option for precise softmax (#953 ) * precise softmax * Add an equivalency check * Make the threadgroup memory definition fixed * precise cpu softmax * precise option on cpu * remove print --------- Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>	2024-04-04 08:32:35 -07:00
Cheng	90dfa43ff1	Don't use make_unique to create shared_ptr (#902 ) The code compiled because shared_ptr's constructor actually accepts unique_ptr.	2024-03-27 06:13:29 -07:00
Angelos Katharopoulos	6ee1112f30	Fix copy donation and add partial rope (#881 )	2024-03-22 17:28:26 -07:00
nicolov	105d236889	Add vmap for SVD and inverse (#849 )	2024-03-21 13:18:27 -07:00
Jagrit Digani	cec8661113	Add a SliceUpdate op and primitive (#850 ) * Enable copy to work with int64 strides * Fix uniform buffer indices or copy kernel arguments * Update utils.h * Remove manual unrolling of elem to loc loop * GPU copy updated to handle negative strides * Add slice update primitive	2024-03-20 10:39:25 -07:00
Awni Hannun	19ec023256	vmap matmul and admm (#836 )	2024-03-14 14:38:22 -07:00
Angelos Katharopoulos	76c919b4ec	NumberOfElements for shapeless compile and vmap fixes (#802 )	2024-03-13 10:34:14 -07:00
Jagrit Digani	776c3d226d	Convolution update (#651 ) * Init steel conv and update Conv primitive * Update slow CPU implementation to support flipping and input dilation winograd conv routing Co-authored-by: Awni Hannun <awni@apple.com>	2024-02-28 20:11:16 -08:00
Awni Hannun	e6418781ab	Fix logsumexp edge case (#740 ) * fix logsumexp * fix inf constant * also fix power grad * fix ternary dispatch	2024-02-25 08:39:55 -08:00
Rifur13	126c9869c8	Implement the 'where' primitive for conditional selection (#664 )	2024-02-22 15:10:48 -08:00
Awni Hannun	5798256fcf	Shapeless compilation for some graphs (#687 ) * shapeless compilation for some graphs * update compile benchmark * default compile a few activations * buffer donation * bugfix * shapeless fix * update tests to work for cpu and gpu fusion * test kwargs * add kwargs to compile * Recompile when python arguments change * no compile for tanh * some constant tests --------- Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>	2024-02-19 21:43:54 -08:00
Jack Mousseau	0925af43b0	Remove unused variables (#706 )	2024-02-18 12:50:10 -08:00
Awni Hannun	e88e474fd1	Reduce vmap + some fixes (#601 )	2024-02-01 11:30:28 -08:00
Angelos Katharopoulos	0de5988f92	Custom VJP and checkpointing (#541 ) * Implement custom_vjp and checkpointing * Add a dependency management primitive * Change the eval order to deep branches first * Add graph depth tracking to the array	2024-01-30 16:04:45 -08:00
Awni Hannun	b207c2c86b	Power VJP fix for 0 (#505 )	2024-01-20 01:17:40 -08:00
Jagrit Digani	78102a47ad	Update GEMM (#424 ) * Organize and collect metal subroutine templates and elements in `metal/kernels/steel/` * Update gemm elements for better performance * Add split-K specialization for gemm * Add `addmm` primitive, op and bindings for fused matmul and bias addition * Update tests and benchmarks as needed	2024-01-17 12:42:39 -08:00
Awni Hannun	a2bf7693dd	Primitive's VJP takes outputs as input (#475 ) Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>	2024-01-16 19:03:53 -08:00
Angelos Katharopoulos	d8fabaa12b	Split multi output (#461 ) * Multi-output split primitive * Add the multi-output split to the ArrayIterator * Add some grad tests for split	2024-01-16 13:33:55 -08:00
Tristan Bilot	f44c132f4a	Add scatter_min VJP (#462 )	2024-01-16 00:37:40 -08:00
Tristan Bilot	6022d4129e	scatter_max vjp + bindings + tests (#431 ) Co-authored-by: DjamelMesbah <djamel.mesbah@adservio.fr>	2024-01-14 14:12:15 -08:00
Angelos Katharopoulos	961435a243	Scatter vjp (#394 ) * Add a first scatter vjp * Implement the scatter_add vjp * Add array.at to implement user friendly scatters	2024-01-09 13:36:51 -08:00
Awni Hannun	f099ebe535	Multi output primitives (#330 ) * Multi-output primitives --------- Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>	2024-01-08 16:39:08 -08:00
Nripesh Niketan	73321b8097	feat: add logicalAnd and logicalOR (#386 ) * feat: add logicalAnd and logicalOR * run pre-commit * Refactor logical_and and logical_or functions * Add acknowledgement * Add logical AND and logical OR operators * Refactor logical_and and logical_or functions * Add support for logical operators on bool arrays * Update mlx/ops.cpp Co-authored-by: Awni Hannun <awni.hannun@gmail.com> * Update mlx/ops.cpp Co-authored-by: Awni Hannun <awni.hannun@gmail.com> * Add logical AND and OR operators for arrays and scalars * Refactor vjp and jvp methods in primitives.cpp * Add overloaded operators for logical AND and OR * format --------- Co-authored-by: Awni Hannun <awni.hannun@gmail.com> Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-08 07:00:05 -08:00
Angelos Katharopoulos	e7f5059fe4	Support for quantized matmul with w and w^T (#349 ) * Add the metal qvm implementation * Add qmm_n * Add gradient wrt to input for quantized_matmul	2024-01-03 14:22:36 -08:00
Angelos Katharopoulos	b3916cbf2b	Improve names of quantization arguments (#235 ) * Change the default quantization group_size to 64 * Rename groups to group_size and width to bits	2023-12-20 16:53:53 -08:00
Angelos Katharopoulos	dfa9f4bc58	An initial quantized matmul implementation (#205 ) * Add quantized matvec * Add quantized matrix matrix with 2nd matrix transposed * Add quantized matmul tests * Add a slow cpu quantized matmul * Add a slightly faster vectorized cpu version	2023-12-18 23:18:57 -08:00
Angelos Katharopoulos	4d4af12c6f	Adds round op and primitive (#203 )	2023-12-18 11:32:48 -08:00
Awni Hannun	e5851e52b1	Add move and swap axis, and vmap for slice, concat, and gather (#158 ) * add move and swap axis, and vmap for slice, concat, and gather	2023-12-14 12:59:12 -08:00
Luca Arnaboldi	b93c4cf378	Floor and Ceil (#150 ) * Implements Floor and Ceil Ops	2023-12-14 10:00:23 -08:00
Angelos Katharopoulos	2b714714e1	Add the remainder op (#85 ) * Add remainder in the C++ backend * Add the python binding and test	2023-12-08 15:08:52 -08:00
Awni Hannun	46a39e5b1f	copyright + ack	2023-11-30 11:12:53 -08:00
Angelos Katharopoulos	d1f86272a2	angelos's commit files	2023-11-29 10:42:59 -08:00

1 2

82 Commits