zhangyiss/mlx - mlx - Gitea for Geophysics

mirror of https://github.com/ml-explore/mlx.git synced 2025-08-27 08:46:41 +08:00

Author	SHA1	Message	Date
Diogo	35431a4ac8	Adds device context manager (#679 )	2024-02-14 14:14:58 -08:00
Awni Hannun	ccf1645995	Custom primitive + RoPE fat op (#676 ) * extensions start * rope custom op * fix build * docs + rope benchmark * fix test * Add a Metal kernel for RoPE * Fix position of traditional * transform tests * Move rope computation to float and fix tests * Fix the test and a typo * change to fast * fix no metal build --------- Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>	2024-02-14 14:04:25 -08:00
Jagrit Digani	1a48713d32	Update gather and scatter to not use Argument Encoder (#683 ) * Replace argument encoder usage for gather and scatter * Use constant address space for shapes and strides * Split gather and scatter to improve compile times * Enable the GPU tests * Update the CI config * Fix scatter dispatch for scalar indices * Remove arg encoder utils --------- Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>	2024-02-14 13:42:13 -08:00
Awni Hannun	1eb04aa23f	Fix empty array construction in cpp (#684 )	2024-02-13 23:34:17 -08:00
Noah Farr	0c65517e91	Return empty array when repeats is 0 in mx.repeat (#681 ) * Return empty array when repeats is 0 * Add test case for repeats = 0	2024-02-13 17:49:31 -08:00
Vijay Krish	2fdc2462c3	Faster gather and scatter. (#682 ) Reduce unnecessary integer ops, especially since there kernels are integer bound. Increase number of iterations for benchmarks for better smoothing. Github Issue #506 Co-authored-by: Vijay Krishnamoorthy <vijay_krish@apple.com>	2024-02-13 17:47:41 -08:00
Hinrik Snær Guðmundsson	be6e9d6a9f	Fixed wording in extensions.rst (#678 ) changed "learn how add" -> "learn how to add"	2024-02-13 08:39:02 -08:00
Gabrijel Boduljak	e54cbb7ba6	Pooling layers (#357 ) Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com> Co-authored-by: Awni Hannun <awni@apple.com>	2024-02-12 22:08:13 -08:00
Angelos Katharopoulos	40c108766b	Quantized matmul fix (#677 ) * Fix qmv for small or unaligned matrices * Fix qmm	2024-02-12 18:54:21 -08:00
Mike Drob	4cc70290f7	PR Builder Workflow (#659 )	2024-02-12 17:47:21 -08:00
Awni Hannun	74caa68d02	nit in readme (#675 )	2024-02-12 12:25:04 -08:00
Awni Hannun	3756381358	Faster bfloat quantized mat-vec and vec-mat (#663 )	2024-02-11 21:53:16 -08:00
Awni Hannun	d12573daa6	quote file name (#670 )	2024-02-11 10:33:30 -08:00
Nripesh Niketan	0dbc4c7547	feat: Update pre-commit-config.yaml (#667 )	2024-02-11 06:08:20 -08:00
Vijay Krish	06072601ce	Scatter optimization : Eliminate 64b integer divide. (#662 ) Launch 2D grid to eliminate divide and mod in device code, since 64b integer division is very expensive. Github Issue #506 Co-authored-by: Vijay Krishnamoorthy <vijay_krish@apple.com>	2024-02-10 08:49:51 -08:00
Angelos Katharopoulos	11d2c8f7a1	Linux build for CI of other packages (#660 )	2024-02-09 18:17:04 -08:00
Awni Hannun	7f3f8d8f8d	Fix the softmax fix (#661 )	2024-02-09 17:02:13 -08:00
Awni Hannun	b96be943dc	bug fix (#658 )	2024-02-09 16:50:45 -08:00
Abdussamet Türker	b670485185	Remainder negative numerator bug fixed (#641 ) Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>	2024-02-09 16:49:14 -08:00
Diogo	b57bd0488d	Metadata support for safetensors (#639 ) * metadata support for safetensors * aliases making it alittle more readable * addressing comments * python binding tests	2024-02-08 19:33:15 -08:00
Angelos Katharopoulos	221f8d3fc2	Bump the version to 0.2 (#656 )	2024-02-08 11:27:12 -08:00
Awni Hannun	5c03efaf29	Compile docs (#653 ) * compile docs * docs nits + comments	2024-02-08 11:21:50 -08:00
LeonEricsson	7dccd42133	updated calls to use loc &scale (#643 )	2024-02-08 09:01:59 -08:00
Awni Hannun	1b97b2958b	Compile with capture (#629 ) * Simple kernel generation * Remove the generate kernel from graph_utils * fix multi-output with compile * fuse with stopgrad * v1 input, output capture in compile * cleanup tree update with visitor update * nit * remove todo * state for model, optional explicit init and more pure optimizer steps * move learning rate to state * add lr to opt state, some fixes in capture * fix optim * update tuple of containers as well * fix stream for compiled output * rng state for compile * nit * updates and comments --------- Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>	2024-02-07 17:29:22 -08:00
Awni Hannun	e5e816a5ef	fix sequential with empty modules at end (#647 )	2024-02-07 13:22:27 -08:00
Angelos Katharopoulos	28eac18571	Kernel generation (#614 ) Generate reusable element-wise kernels given a computation graph.	2024-02-07 13:15:59 -08:00
Noah Farr	5fd11c347d	Add loc and scale to random.normal (#638 ) * Add loc and scale to random.normal * Add tests for loc and scale for random.normal * Run pre-commit hooks * Fix code review	2024-02-07 11:49:59 -08:00
Aryan Gupta	ef73393a19	Feat: Add weights argument in BCE Loss and tests (#620 )	2024-02-07 09:39:52 -08:00
Angelos Katharopoulos	ea406d5e33	CI change (#645 ) * CI update * Skip large binary test for now * Upgrade pip * Add proper env variable skipping * Update the CI * Fix workflow name * Set the low memory flag for the tests * Change build process * Add pip upgrade * Use a venv * Add a missing env activate * Add setuptools * Add twine upload back * Re-enable automatic release builds	2024-02-07 06:04:34 -08:00
Awni Hannun	146bd69470	Skip compile when transforming (#635 ) * skip compile when transforming * simplify message	2024-02-05 21:28:37 -08:00
Jagrit Digani	316ff490b3	Remove masks from BlockLoader and clear out load case for invalid thread (#634 )	2024-02-05 16:00:17 -08:00
Awni Hannun	d40a04f8dc	minor fixes (#631 ) * minor fixes * var with ddof >= nelements	2024-02-05 13:27:49 -08:00
Awni Hannun	d75ae52ecd	Compile primitive (#571 ) * Compiled primitive with basic binary, unary graph-level fusion	2024-02-05 06:51:22 -08:00
Avikant Srivastava	31fea3758e	feat: enhancement of the error message for mlx.core.mean (#608 ) * add error message	2024-02-05 01:21:49 -08:00
Awni Hannun	e319383ef9	Faster gather (#626 ) * faster gather * update copyright	2024-02-04 17:25:44 -08:00
Awni Hannun	5c3ac52dd7	fix test (#627 )	2024-02-04 16:18:03 -08:00
David Koski	ebfd3618b0	fixes for building and running on iOS (#619 ) * fixes for building and running on iOS * per suggestion just use Accelerate	2024-02-04 12:29:17 -08:00
Avikant Srivastava	11a9fd40f0	fix: handle linspace function when num is 1 (#602 ) * fix: handle linspace function when num is 1 * add comment * fix test case * remove breakpoint	2024-02-04 11:03:49 -08:00
Daniel Strobusch	4fd2fb84a6	make python array SupportsAbs conform (like numpy) (#624 )	2024-02-04 09:31:02 -08:00
Daniel Strobusch	9852af1a19	fix "shape" docstring. (#623 )	2024-02-04 09:21:22 -08:00
minghuaw	16750f3c51	Fix typo in CMakeLists.txt (#616 )	2024-02-03 05:59:26 -08:00
Awni Hannun	95b5fb8245	minor changes (#613 )	2024-02-02 11:48:35 -08:00
AtomicVar	83f63f2184	Add Margin Ranking Loss (#536 )	2024-02-02 10:57:31 -08:00
Awni Hannun	cb6156d35d	Fix eval in trace bugs (#612 ) * Fix eval in trace bugs * comment nit	2024-02-02 09:57:12 -08:00
Piotr Rybiec	506d43035c	typo fix (#607 )	2024-02-01 17:39:55 -08:00
Angelos Katharopoulos	36cff34701	Bump the version (#604 )	2024-02-01 11:41:38 -08:00
Awni Hannun	e88e474fd1	Reduce vmap + some fixes (#601 )	2024-02-01 11:30:28 -08:00
David Koski	601c6d6aa8	Fix for AdaDelta (#603 ) - state was being read from parameter "s" - but being stored in parameter "u"	2024-02-01 09:56:27 -08:00
Angelos Katharopoulos	ba8d6bf365	Change the transformer to norm_first by default (#599 )	2024-01-31 12:55:30 -08:00
Sugato Ray	4a5f3b21bb	Add `py.typed` to support PEP-561 (type-hinting) for `mlx` (#588 ) * Add `py.typed` to support PEP-561 (type-hinting) This adds support for type-hinting information as laid in [PEP-561](https://peps.python.org/pep-0561/). * add py.typed to MANIFEST.in	2024-01-31 12:05:42 -08:00

... 20 21 22 23 24 ...

1366 Commits