zhangyiss/mlx - mlx - Gitea for Geophysics

mirror of https://github.com/ml-explore/mlx.git synced 2025-10-12 11:53:18 +08:00

Author	SHA1	Message	Date
Diogo	137f55bf28	fail early if readinto does not exist (#221 )	2023-12-19 13:27:17 -08:00
Emircan Erol	e549f84532	Triplet Loss (#211 ) * Triplet Loss * Requested Changes * Margin to alpha	2023-12-19 12:37:12 -08:00
Angelos Katharopoulos	dfa9f4bc58	An initial quantized matmul implementation (#205 ) * Add quantized matvec * Add quantized matrix matrix with 2nd matrix transposed * Add quantized matmul tests * Add a slow cpu quantized matmul * Add a slightly faster vectorized cpu version	2023-12-18 23:18:57 -08:00
Abe Leininger	e6872a4149	Added linspace (#181 ) * linspace ops support --------- Co-authored-by: Awni Hannun <awni@apple.com>	2023-12-18 19:57:55 -08:00
Juarez Bochi	f4f6e17d45	Fix cross-attention (#210 ) * Fix cross-attention With the current code, ln2 is a no-op. Its output should be passed to the cross-attention layer * Add name to contributors	2023-12-18 12:27:27 -08:00
Angelos Katharopoulos	4d4af12c6f	Adds round op and primitive (#203 )	2023-12-18 11:32:48 -08:00
jojopuppet	18cca64c81	Add smoothed L1 loss and enhancements to cross entropy loss (#166 ) * Add smooth_l1_loss * Add labels moothing for cross entropy loss --------- Co-authored-by: Awni Hannun <awni@apple.com>	2023-12-18 07:26:21 -08:00
Cyril Zakka, MD	8eb56beb3a	Added clip function (#159 ) * Added clip * Added Python bindings * Formatting * Added cpp tests * Added Python tests * python bindings work * rebase --------- Co-authored-by: Awni Hannun <awni@apple.com>	2023-12-17 20:00:29 -08:00
Awni Hannun	ee0c2835c5	Docs updates (#198 ) Reorganize NN docs + a few other tidbits.	2023-12-17 13:20:55 -08:00
Awni Hannun	90d04072b7	fix build w/ flatten (#195 )	2023-12-17 11:58:45 -08:00
__mo_san__	52e1589a52	implemented Flatten Module (#149 ) * implemented flatten op --------- Co-authored-by: Awni Hannun <awni@apple.com>	2023-12-16 21:54:37 -08:00
YUN, Junwoo	eebd7c275d	Add optimizers (AdaMax, AdaDelta, RMSprop) and ordering optimizer classes (#142 ) * Add AdaMax, AdaDelta, RMSprop	2023-12-16 21:43:15 -08:00
Awni Hannun	104c34f906	setite negative indexing bug (#189 )	2023-12-16 06:44:47 -08:00
Diogo	dc2edc762c	added tri / tril / triu (#170 ) * added tri / tril / triu * fixed tests * ctest tests * tri overload and simplified tests * changes from comment * more tests for m * ensure assert if not 2-D * remove broadcast_to * minor tweaks --------- Co-authored-by: Awni Hannun <awni@apple.com>	2023-12-15 17:30:34 -08:00
Awni Hannun	2e02acdc83	add base kwarg to rope (#186 )	2023-12-15 16:47:59 -08:00
Víctor Aguilar	f24200db2c	accross -> across (#183 )	2023-12-15 13:46:50 -08:00
Jason	e28b57e371	Added mx.stack c++ frontend impl (#123 ) * stack C++ operation + python bindings	2023-12-14 13:21:19 -08:00
Awni Hannun	e5851e52b1	Add move and swap axis, and vmap for slice, concat, and gather (#158 ) * add move and swap axis, and vmap for slice, concat, and gather	2023-12-14 12:59:12 -08:00
Luca Arnaboldi	b93c4cf378	Floor and Ceil (#150 ) * Implements Floor and Ceil Ops	2023-12-14 10:00:23 -08:00
Stv.X	1e0c78b970	Fixed typo in some proprietary terms. (#161 )	2023-12-13 19:48:00 -08:00
Awni Hannun	25f70d4ca4	Fix divide types + floor divide (//) (#138 ) * divide types * fix black + test	2023-12-11 20:20:58 -08:00
Diogo	02de234ef0	Activations LeakyReLU / PReLU / Softplus / Mish (#109 ) * Leaky_relu / prelu / softplus / mish * added tests * updated bench * remove torch refs, add init to PReLU * added arvix reference to mish * added missing docs	2023-12-11 19:40:57 -08:00
Nicholas Santavas	f5df47ec6e	Add Step, ELU, SELU, Swish activation functions (#117 ) * Add Step, ELU, SELU, Swish activation functions This commit adds the Step, ELU, SELU and Swish activations functions * add to the docs * review	2023-12-11 17:04:07 -08:00
Awni Hannun	b9226c367c	Fix CI format + build issue (#137 ) * fix ci * Fix python bindings build --------- Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>	2023-12-11 15:01:41 -08:00
Angelos Katharopoulos	3214629601	Mlx array accessor (#128 ) * Add an accessor to interoperate with custom types * Change the docs to custom signatures	2023-12-11 13:42:55 -08:00
__mo_san__	072044e28f	fix and update binary cross entropy loss tests (#133 ) * fix conflicts * updated tests	2023-12-11 12:42:17 -08:00
Cyril Zakka, MD	e080290ba4	Added eye/identity ops (#119 ) `eye` and `identity` C++ and Python ops	2023-12-11 12:38:17 -08:00
Awni Hannun	69505b4e9b	fixes (#131 )	2023-12-11 09:26:49 -08:00
__mo_san__	f4ddd7dc44	Add Binary Cross Entropy loss (#122 ) * update BCE added tests for it ... * added binary cross entropy loss to docs * resolving conflicts for merge	2023-12-11 07:55:18 -08:00
Jason	b0cd092b7f	Added activation functions: leaky_relu relu6 softplus elu celu logsigmoid (#108 ) * added leaky_relu relu6 softplus elu celu logsigmoid * minor fixes for docstring and benchmark imports * fixed elu implementation and added tests * added tests for optional param, changed leaky_relu param to fit pytorch documentation	2023-12-10 16:31:38 -08:00
Awni Hannun	71d1fff90a	Bug fix in metal binary kernel dispatch for large arrays (#125 ) * bug fix * format	2023-12-10 16:12:31 -08:00
Awni Hannun	2d0130f80f	fix loss tests (#118 ) * fix loss tests * use none as default	2023-12-10 10:08:19 -08:00
__mo_san__	c1e1c1443f	Added Adagrad optimizer (#102 )	2023-12-10 09:22:39 -08:00
Henry Ansah	68bf1d7867	add nn module for sigmoid activation (#111 ) * add nn module for sigmoid activation * update .gitignore with .cache folder generated by jetbrains fleet ide * remove .cache folder	2023-12-10 07:00:39 -08:00
Angelos Katharopoulos	600db7d754	Fix build on Xcode 14 (#116 ) * Fix build on Xcode 14 * Style fixes	2023-12-10 06:58:52 -08:00
__mo_san__	ef7b8756c0	Add tanh activation function (#115 ) * added Adagrad optimizer ... * added Tanh activation function ... * reformatted file ... * remove unrelated stuff ... * Update activations.py	2023-12-09 19:25:38 -08:00
Enoch Kan	0b28399638	added mse_loss, nll_loss and kl_div_loss (#98 ) * added mse_loss, nll_loss and kl_div_loss * fixed axis not defined error in nll_loss * fixed axis not defined in kl_div_loss * added tests for mse, nll and kl_div * modified docstrings and added reduce helper func * updated docstring in kl_div_loss and moved helper func * added new kl divergence implementation * added reduction to test * updated docstring of kl_div_loss with correct spelling * added losses to nn.rst in docs	2023-12-09 14:25:03 -08:00
Joe Barrow	ac6dc5d3eb	Adding optional bias param to MultiHeadAttention (#104 ) * Adding optional param to * Run style-checker	2023-12-09 11:04:28 -08:00
Awni Hannun	89b90dcfec	Pr template (#99 ) * pr template * format fix	2023-12-09 09:36:56 -08:00
Angelos Katharopoulos	fd836d891b	Hashable dtype and mlx.core prefixed repr (#89 ) * Make dtype hashable * Add mlx.core prefix to our dtypes' repr * Update the dtype test	2023-12-09 09:35:28 -08:00
Awni Hannun	2520dbcf0a	add losses to the docs, fix black failur (#92 )	2023-12-09 06:06:52 -08:00
Abe Leininger	430bfb4944	Adds Nesterov momentum to SGD (#87 )	2023-12-08 23:23:36 -08:00
ShiJZ	08d51bf232	Make it easier to test new optimizers implemented: no need to change test file manually (#90 ) * add helper function get_all_optimizers() in test_optimizers.py * remove unused import	2023-12-08 21:39:08 -08:00
Kai Ma	cb9e585b8e	Style fix for loss functions (#91 ) * MLE and L1 loss functions * logsoftmax change and tests * subtract max logit for numerical stability * l1 name change * cross entropy reduction + unit tests * docstrings * l1 test name change * old loss impl + default none * style	2023-12-08 21:11:56 -08:00
Kai Ma	641d316484	MLE and L1 loss functions (#88 ) * MLE and L1 loss functions * logsoftmax change and tests * subtract max logit for numerical stability * l1 name change * cross entropy reduction + unit tests * docstrings * l1 test name change * old loss impl + default none	2023-12-08 20:21:37 -08:00
Angelos Katharopoulos	2b714714e1	Add the remainder op (#85 ) * Add remainder in the C++ backend * Add the python binding and test	2023-12-08 15:08:52 -08:00
Joe Barrow	69a24e6a1e	AdamW implementation (#72 ) * AdamW implementation without bias correction * Makes use of the underlying Adam implementation	2023-12-08 14:45:34 -08:00
Zach Schillaci	5b9be57ac3	Add isort pre-commit and run (#68 )	2023-12-08 11:31:47 -08:00
Angelos Katharopoulos	209404239b	Fix the accelerate dispatch for the power op (#70 ) - The exponent and base were swapped because accelerate is using exponent-base instead of base-exponent - Fix also the test for binary ops as it was testing op(x, x) which couldn't catch ordering errors like that	2023-12-08 10:58:03 -08:00
Zach Schillaci	d11d77e581	Spelling fixes in transformer.py (#59 )	2023-12-07 13:32:09 -08:00

1 2

60 Commits