zhangyiss/mlx - mlx - Gitea for Geophysics

mirror of https://github.com/ml-explore/mlx.git synced 2025-11-08 13:28:15 +08:00

Author	SHA1	Message	Date
Awni Hannun	3c2f192345	Propagate nans in binary ops (#579 ) * propagate nans in binary ops * handle empty matmul * cpu minimum/maximum propagate nan * benchmark maximum * add min as well * throw on negative indices with full * verbose on linux * fix matmul for zero K	2024-01-29 11:19:38 -08:00
Angelos Katharopoulos	37d98ba6ff	No gil eval (#565 )	2024-01-26 22:03:52 -08:00
Awni Hannun	8993382aaa	Buffer Donation (#519 ) * buffer donation * fix to move shared pointer * format * gpu in place for copy and binary * revert ops test * cpu in place * a little cleanup * remove useless bench	2024-01-26 16:30:33 -08:00
Awni Hannun	07f35c9d8a	Fix a few issues: docs for flatten, erf, dequantize validation (#560 ) * doc flatten * erf doc * check values for dequantize * format	2024-01-26 15:16:46 -08:00
Jagrit Digani	bf17ab5002	Add more checks and clearer error messages to conv operations (#563 ) * Add more checks and clearer error messages to conv operations	2024-01-26 15:13:26 -08:00
Awni Hannun	8fa6b322b9	Compile front-end (#476 ) * fix tests for linux * make a move on compile * basic compile scaffold works * compile binding * clean * fix * fix grad, more tests * basic python tests * fix segfault on python exit * compile works with python closures * fix test * fix python globals bug, and erase * simplify * more cpp tests * bug fix with move function and compile at exit * simplify inputs also * enable and disable compiler * remove simplify * simplify tests use compile now * fix multi-output with compile * clear output tree from cache when function goes out of scope * ../python/src/transforms.cpp * remove closure capture * comments	2024-01-26 13:45:30 -08:00
David Koski	874b739f3c	Fix cache key in RoPE (#561 )	2024-01-26 13:10:02 -08:00
taher	077c1ee64a	QR factorization (#310 ) * add qr factorization --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-26 09:27:31 -08:00
Rifur13	2463496471	[Fix] mx.allclose bug with infinite values (#539 ) * Added isclose op and fixed comparison with inf values * Added 'equal_nan' to match numpy * format * Add test * Update python/src/ops.cpp Co-authored-by: Awni Hannun <awni.hannun@gmail.com> * Update python/src/ops.cpp Co-authored-by: Awni Hannun <awni.hannun@gmail.com> * Addressed CR comments * Update python/src/ops.cpp Co-authored-by: Awni Hannun <awni.hannun@gmail.com> * nits --------- Co-authored-by: Awni Hannun <awni.hannun@gmail.com> Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-25 20:47:06 -08:00
Angelos Katharopoulos	87b7fa9ba2	Bump the version (#554 ) v0.0.11	2024-01-25 11:01:05 -08:00
Danilo Peixoto	624065c074	Fix package installation for CI (#521 ) Co-authored-by: Awni Hannun <awni.hannun@gmail.com>	2024-01-25 09:43:34 -08:00
Awni Hannun	f27ec5e097	More helpful error message in vjp transform + concate bug (#543 ) * more helpful message in vjp transform * fix concatenate on mismatch dims * typo * typo	2024-01-24 09:58:33 -08:00
Awni Hannun	f30e63353a	Minor updates to address a few issues (#537 ) * docs on arg indices return type * arange with nan * undo isort	2024-01-23 22:24:41 -08:00
Juarez Bochi	4fe2fa2a64	GGUF: Avoid dequantization when format is compatible (#426 ) * GGUF: Don't dequantize q4_1 * Fix weight order. First in low bits * Add unpacking for q4_0 * Don't dequantize q8_0 * rebase quants and split file * don't quantize every weight * reapply patch * error handling --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-23 15:43:57 -08:00
Hazem Essam	37fc9db82c	Added Adafactor (#415 ) * Added adafactor * Added Adafactor and ran pre-commit * modified operations * Added docstrings * Switched two ops to fix a bug * added underscore for internal functions and removed the plus sign in the last return statment * Removed parameter rms from the optimizer state because its not needed * Added simple MNIST test for Adafactor and temporary training log * remove test files * nits in docs * comment nit --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-23 15:11:27 -08:00
AtomicVar	755dcf6137	Enable cross_entropy loss to handle dense targets (#517 ) * Enable cross_entropy loss to handle dense targets Dense targets means probabilities or one-hot encodings. * better shape check of weights * nits in docstring --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-23 12:17:22 -08:00
LeonEricsson	6b4b30e3fc	Common neural network initializers `nn.initializers` (#456 ) * initial commit: constant, normal, uniform * identity, glorot and he initializers * docstrings * rm file * nits * nits * nits * testing suite * docs * nits in docs * more docs * remove unused template * rename packakge to nn.innit * docs, receptive field * more docs --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-23 06:47:20 -08:00
Awni Hannun	86e0c79467	remove stale benchmarks (#527 )	2024-01-22 22:17:58 -08:00
Awni Hannun	98c37d3a22	use axes in tensordot (#525 )	2024-01-22 21:17:00 -08:00
Sugato Ray	f326dd8334	Update README.md (#524 ) Add conda install option in docs.	2024-01-22 20:53:54 -08:00
Jagrit Digani	6d3bee3364	Fix oob reads in gemv kernel (#523 )	2024-01-22 12:06:04 -08:00
Danilo Peixoto	ecb174ca9d	Type annotations for `mlx.core` module (#512 )	2024-01-21 12:53:12 -08:00
Awni Hannun	7a34e46677	Quantize with groups of 32 (#511 ) * allow quantize with group sizes of 32 * missing cpu dispatch * remove print * Fix qvm for group_size 32 --------- Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>	2024-01-21 06:19:05 -08:00
Nripesh Niketan	92c22c1ea3	feat: Update isort version to 5.13.2 (#514 )	2024-01-21 06:11:48 -08:00
Awni Hannun	d52383367a	format (#510 )	2024-01-20 10:33:46 -08:00
Arda Orçun	363d3add6d	Add ValuError message for Adamax (#508 ) * ValuError message added * beta errors added * some corrections and testing * Learning rate limitation deleted	2024-01-20 07:56:15 -08:00
Awni Hannun	b207c2c86b	Power VJP fix for 0 (#505 )	2024-01-20 01:17:40 -08:00
Awni Hannun	6bf779e72b	fix array from list for > 32 bit types (#501 )	2024-01-19 15:49:25 -08:00
Juarez Bochi	ddf50113c5	GGUF: Load and save metadata (#446 ) * gguf metadata --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-19 14:06:05 -08:00
Arda Orçun	6589c869d6	Added MSE message (#500 ) * Added MSE message * changed wrong line. * Update examples/python/linear_regression.py Co-authored-by: Awni Hannun <awni.hannun@gmail.com> --------- Co-authored-by: Awni Hannun <awni.hannun@gmail.com>	2024-01-19 06:27:50 -08:00
Anchen	f6feb61f92	feat: add support for saving safetensors in the `save_weights` (#497 ) * feat: add save safetensors support in module save_weights * chore: checking missing changes * Update python/mlx/nn/layers/base.py Co-authored-by: Awni Hannun <awni.hannun@gmail.com> * chore: update docstring for load_weights --------- Co-authored-by: Awni Hannun <awni.hannun@gmail.com>	2024-01-19 06:19:33 -08:00
Awni Hannun	c4ec836523	fix isinf for integer types (#494 )	2024-01-19 05:31:10 -08:00
AtomicVar	550d4bf7c0	Update binary_cross_entropy function to handle both logits and probabilities (#492 )	2024-01-18 19:22:23 -08:00
Awni Hannun	f6e911ced0	version bump (#490 ) * version bump * Fix the dev version string --------- Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com> v0.0.10	2024-01-18 12:00:24 -08:00
Awni Hannun	3d99a8d31d	Fix format / build (#489 )	2024-01-18 10:01:59 -08:00
Ethan	a749a91c75	Support disable metal buffer cache to prevent performance degradation caused by large memory caching (#390 ) * support disable metal buffer cache, due to large unused memory buffered when llm generated long context tokens * Run format and add "cache_enabled" feature tests	2024-01-18 08:33:34 -08:00
toji	49a52610b7	Added formatter structure and a boolean value formatter (#354 ) * added formatter structure and a boolean value formatter --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-18 07:49:41 -08:00
AtomicVar	d1fef34138	Add Gaussian NLL loss function (#477 ) * Add Gaussian NLL loss function --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-18 06:44:44 -08:00
Angelos Katharopoulos	9c111f176d	Fix split optimization for array iterator (#484 )	2024-01-18 05:50:25 -08:00
Awni Hannun	78e5f2d17d	usage doc for function transformations (#481 )	2024-01-17 17:10:53 -08:00
Angelos Katharopoulos	90c234b7ac	Fix round to round half-cases to even (#482 )	2024-01-17 15:27:23 -08:00
Angelos Katharopoulos	135fd796d2	Fix detach for multi-output primitives (#480 )	2024-01-17 14:08:07 -08:00
Jagrit Digani	78102a47ad	Update GEMM (#424 ) * Organize and collect metal subroutine templates and elements in `metal/kernels/steel/` * Update gemm elements for better performance * Add split-K specialization for gemm * Add `addmm` primitive, op and bindings for fused matmul and bias addition * Update tests and benchmarks as needed	2024-01-17 12:42:39 -08:00
Diogo	556cdf0e06	Resolves build issues with the extension example (#419 ) * resolved extension build issues and added test to ci * missing gguflib * rebased * force mlx install from fix branch * linux build issue * point to git install and comment out ci tests	2024-01-17 12:07:05 -08:00
Awni Hannun	275db7221a	Command buffer reports errors (#479 ) * command buffer reports errors * typo * simplify	2024-01-17 11:53:30 -08:00
AtomicVar	4a9012cba0	Sort some APIs docs by names (a-z) (#472 )	2024-01-16 19:37:50 -08:00
Awni Hannun	a2bf7693dd	Primitive's VJP takes outputs as input (#475 ) Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>	2024-01-16 19:03:53 -08:00
Angelos Katharopoulos	d8fabaa12b	Split multi output (#461 ) * Multi-output split primitive * Add the multi-output split to the ArrayIterator * Add some grad tests for split	2024-01-16 13:33:55 -08:00
Avikant Srivastava	4e290d282f	feat: add time based seed to random.h (#457 ) * random seed from time * fix: chrono * refactor: snake case	2024-01-16 07:32:28 -08:00
Yashraj Singh	e72458a3fa	implemented isposinf and isneginf in one PR (#470 ) * ran precommit * updated docs	2024-01-16 06:48:07 -08:00

... 5 6 7 8 9 ...

555 Commits