zhangyiss/mlx - mlx - Gitea for Geophysics

mirror of https://github.com/ml-explore/mlx.git synced 2025-06-26 02:33:21 +08:00

Author	SHA1	Message	Date
Angelos Katharopoulos	37d98ba6ff	No gil eval (#565 )	2024-01-26 22:03:52 -08:00
Awni Hannun	8993382aaa	Buffer Donation (#519 ) * buffer donation * fix to move shared pointer * format * gpu in place for copy and binary * revert ops test * cpu in place * a little cleanup * remove useless bench	2024-01-26 16:30:33 -08:00
Awni Hannun	07f35c9d8a	Fix a few issues: docs for flatten, erf, dequantize validation (#560 ) * doc flatten * erf doc * check values for dequantize * format	2024-01-26 15:16:46 -08:00
Jagrit Digani	bf17ab5002	Add more checks and clearer error messages to conv operations (#563 ) * Add more checks and clearer error messages to conv operations	2024-01-26 15:13:26 -08:00
Awni Hannun	8fa6b322b9	Compile front-end (#476 ) * fix tests for linux * make a move on compile * basic compile scaffold works * compile binding * clean * fix * fix grad, more tests * basic python tests * fix segfault on python exit * compile works with python closures * fix test * fix python globals bug, and erase * simplify * more cpp tests * bug fix with move function and compile at exit * simplify inputs also * enable and disable compiler * remove simplify * simplify tests use compile now * fix multi-output with compile * clear output tree from cache when function goes out of scope * ../python/src/transforms.cpp * remove closure capture * comments	2024-01-26 13:45:30 -08:00
David Koski	874b739f3c	Fix cache key in RoPE (#561 )	2024-01-26 13:10:02 -08:00
taher	077c1ee64a	QR factorization (#310 ) * add qr factorization --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-26 09:27:31 -08:00
Rifur13	2463496471	[Fix] mx.allclose bug with infinite values (#539 ) * Added isclose op and fixed comparison with inf values * Added 'equal_nan' to match numpy * format * Add test * Update python/src/ops.cpp Co-authored-by: Awni Hannun <awni.hannun@gmail.com> * Update python/src/ops.cpp Co-authored-by: Awni Hannun <awni.hannun@gmail.com> * Addressed CR comments * Update python/src/ops.cpp Co-authored-by: Awni Hannun <awni.hannun@gmail.com> * nits --------- Co-authored-by: Awni Hannun <awni.hannun@gmail.com> Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-25 20:47:06 -08:00
Awni Hannun	f27ec5e097	More helpful error message in vjp transform + concate bug (#543 ) * more helpful message in vjp transform * fix concatenate on mismatch dims * typo * typo	2024-01-24 09:58:33 -08:00
Awni Hannun	f30e63353a	Minor updates to address a few issues (#537 ) * docs on arg indices return type * arange with nan * undo isort	2024-01-23 22:24:41 -08:00
Hazem Essam	37fc9db82c	Added Adafactor (#415 ) * Added adafactor * Added Adafactor and ran pre-commit * modified operations * Added docstrings * Switched two ops to fix a bug * added underscore for internal functions and removed the plus sign in the last return statment * Removed parameter rms from the optimizer state because its not needed * Added simple MNIST test for Adafactor and temporary training log * remove test files * nits in docs * comment nit --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-23 15:11:27 -08:00
AtomicVar	755dcf6137	Enable cross_entropy loss to handle dense targets (#517 ) * Enable cross_entropy loss to handle dense targets Dense targets means probabilities or one-hot encodings. * better shape check of weights * nits in docstring --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-23 12:17:22 -08:00
LeonEricsson	6b4b30e3fc	Common neural network initializers `nn.initializers` (#456 ) * initial commit: constant, normal, uniform * identity, glorot and he initializers * docstrings * rm file * nits * nits * nits * testing suite * docs * nits in docs * more docs * remove unused template * rename packakge to nn.innit * docs, receptive field * more docs --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-23 06:47:20 -08:00
Awni Hannun	98c37d3a22	use axes in tensordot (#525 )	2024-01-22 21:17:00 -08:00
Awni Hannun	7a34e46677	Quantize with groups of 32 (#511 ) * allow quantize with group sizes of 32 * missing cpu dispatch * remove print * Fix qvm for group_size 32 --------- Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>	2024-01-21 06:19:05 -08:00
Awni Hannun	d52383367a	format (#510 )	2024-01-20 10:33:46 -08:00
Arda Orçun	363d3add6d	Add ValuError message for Adamax (#508 ) * ValuError message added * beta errors added * some corrections and testing * Learning rate limitation deleted	2024-01-20 07:56:15 -08:00
Awni Hannun	b207c2c86b	Power VJP fix for 0 (#505 )	2024-01-20 01:17:40 -08:00
Awni Hannun	6bf779e72b	fix array from list for > 32 bit types (#501 )	2024-01-19 15:49:25 -08:00
Juarez Bochi	ddf50113c5	GGUF: Load and save metadata (#446 ) * gguf metadata --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-19 14:06:05 -08:00
Anchen	f6feb61f92	feat: add support for saving safetensors in the `save_weights` (#497 ) * feat: add save safetensors support in module save_weights * chore: checking missing changes * Update python/mlx/nn/layers/base.py Co-authored-by: Awni Hannun <awni.hannun@gmail.com> * chore: update docstring for load_weights --------- Co-authored-by: Awni Hannun <awni.hannun@gmail.com>	2024-01-19 06:19:33 -08:00
Awni Hannun	c4ec836523	fix isinf for integer types (#494 )	2024-01-19 05:31:10 -08:00
AtomicVar	550d4bf7c0	Update binary_cross_entropy function to handle both logits and probabilities (#492 )	2024-01-18 19:22:23 -08:00
Ethan	a749a91c75	Support disable metal buffer cache to prevent performance degradation caused by large memory caching (#390 ) * support disable metal buffer cache, due to large unused memory buffered when llm generated long context tokens * Run format and add "cache_enabled" feature tests	2024-01-18 08:33:34 -08:00
toji	49a52610b7	Added formatter structure and a boolean value formatter (#354 ) * added formatter structure and a boolean value formatter --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-18 07:49:41 -08:00
AtomicVar	d1fef34138	Add Gaussian NLL loss function (#477 ) * Add Gaussian NLL loss function --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-18 06:44:44 -08:00
Angelos Katharopoulos	9c111f176d	Fix split optimization for array iterator (#484 )	2024-01-18 05:50:25 -08:00
Angelos Katharopoulos	90c234b7ac	Fix round to round half-cases to even (#482 )	2024-01-17 15:27:23 -08:00
Jagrit Digani	78102a47ad	Update GEMM (#424 ) * Organize and collect metal subroutine templates and elements in `metal/kernels/steel/` * Update gemm elements for better performance * Add split-K specialization for gemm * Add `addmm` primitive, op and bindings for fused matmul and bias addition * Update tests and benchmarks as needed	2024-01-17 12:42:39 -08:00
Awni Hannun	a2bf7693dd	Primitive's VJP takes outputs as input (#475 ) Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>	2024-01-16 19:03:53 -08:00
Angelos Katharopoulos	d8fabaa12b	Split multi output (#461 ) * Multi-output split primitive * Add the multi-output split to the ArrayIterator * Add some grad tests for split	2024-01-16 13:33:55 -08:00
Yashraj Singh	e72458a3fa	implemented isposinf and isneginf in one PR (#470 ) * ran precommit * updated docs	2024-01-16 06:48:07 -08:00
Angelos Katharopoulos	c15fe3e61b	Allow arbitrary first dimension in quantization kernels. (#458 ) * Allow arbitrary first dim on qmm_t and qmv * Allow arbitrary first dim on qmm and qvm * Specialized aligned vs unaligned case * Add more checks for valid quantizations	2024-01-16 00:46:21 -08:00
Tristan Bilot	f44c132f4a	Add scatter_min VJP (#462 )	2024-01-16 00:37:40 -08:00
Matthew Ernst	92a2fdd577	Adds isinf (#445 ) * adds isinf Signed-off-by: matthewfernst <matthew.f.ernst@gmail.com> * use stream + nits * typo --------- Signed-off-by: matthewfernst <matthew.f.ernst@gmail.com> Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-15 19:50:44 -08:00
Tristan Bilot	6022d4129e	scatter_max vjp + bindings + tests (#431 ) Co-authored-by: DjamelMesbah <djamel.mesbah@adservio.fr>	2024-01-14 14:12:15 -08:00
Awni Hannun	41cc7bdfdb	Fix stub generation, change graph exporting for arrows to go to outputs (#455 )	2024-01-14 14:06:16 -08:00
Diogo	2e29d0815b	Add tile op (#438 )	2024-01-12 23:03:16 -08:00
Awni Hannun	1b71487e1f	docs (#444 )	2024-01-12 13:34:16 -08:00
Ayush Shridhar	1416e7b664	Add isnan (#423 )	2024-01-12 11:16:48 -08:00
davidkoski	29081204d1	array.swapaxes should point to swapaxes free function (#441 )	2024-01-12 11:06:16 -08:00
Avikant Srivastava	975e265f74	feat: Add numpy constants (#428 ) * add numpy constants * feat: add unittests * add newaxis * add test for newaxis transformation * refactor	2024-01-11 06:47:29 -08:00
Awni Hannun	3b4f066dac	Correct types for vjp + tests (#418 ) * correct types for vjp + tests * fix build + comment	2024-01-10 13:32:37 -08:00
Juarez Bochi	b7f905787e	GGUF support (#350 ) * Initial GGUF support for tensor fields. --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-10 13:22:48 -08:00
Chunyang Wen	e3e933c6bc	Add type hint for Module (#412 )	2024-01-10 11:23:42 -08:00
Awni Hannun	1d90a76d63	in place ops behave in place, fix some overloads (#411 )	2024-01-09 16:05:38 -08:00
Angelos Katharopoulos	961435a243	Scatter vjp (#394 ) * Add a first scatter vjp * Implement the scatter_add vjp * Add array.at to implement user friendly scatters	2024-01-09 13:36:51 -08:00
Awni Hannun	e9ca65c939	Fix BN stats to not expand shape (#409 ) * fix BN stats to not expand shape * nit	2024-01-09 11:54:51 -08:00
Awni Hannun	f099ebe535	Multi output primitives (#330 ) * Multi-output primitives --------- Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>	2024-01-08 16:39:08 -08:00
YUN, Junwoo	0b8aeddac6	Additoinal losses (#336 ) * cosine similarity loss --------- Co-authored-by: Awni Hannun <awni@apple.com> * Docstring nits	2024-01-08 14:01:13 -08:00

1 2 3 4

168 Commits