zhangyiss/mlx - mlx - Gitea for Geophysics

mirror of https://github.com/ml-explore/mlx.git synced 2025-08-21 20:46:46 +08:00

Author	SHA1	Message	Date
m0saan	cf5a5a4a01	updated the batch norm doc string ^^	2023-12-24 23:10:37 +01:00
__mo_san__	c68a472b83	Update python/mlx/nn/layers/normalization.py Co-authored-by: Robert McCraith <mccraithrobert@gmail.com>	2023-12-24 23:10:37 +01:00
m0saan	019a85511c	improve batch norm code ^^	2023-12-24 23:10:37 +01:00
__mo_san__	b444a6a693	Update normalization.py	2023-12-24 23:10:37 +01:00
m0saan	a43b853194	refactored and updated batch norm tests ^^	2023-12-24 23:10:37 +01:00
__mo_san__	8b08f440d9	Update __init__.py	2023-12-24 23:10:34 +01:00
m0saan	82ca771e69	updated BN implementation to be more generic ^^	2023-12-24 23:10:10 +01:00
m0saan	7b0f8bda9c	updated docs and added examples to doc string ^^	2023-12-24 23:10:10 +01:00
m0saan	7ec3cadf98	added test cases for batch norm on 3D input & refactored code ^^	2023-12-24 23:10:10 +01:00
__mo_san__	eca773b62c	Update normalization.py	2023-12-24 23:10:10 +01:00
m0saan	a0b2a34e98	rebasing ...	2023-12-24 23:09:52 +01:00
m0saan	d4bf9a2976	calc running mean and var only when training	2023-12-24 23:08:45 +01:00
m0saan	c3c2fcf41d	update batch norm implementation -> fixed some bug and added support for 3D inputs	2023-12-24 23:08:45 +01:00
m0saan	e9fd1cf02d	update batch norm implementation	2023-12-24 23:08:45 +01:00
__mo_san__	ad53687ae7	Update normalization.py	2023-12-24 23:08:45 +01:00
m0saan	2b617b63bd	implemented batchnorm layer	2023-12-24 23:08:45 +01:00
Zach Schillaci	22fee5a383	Remove redundant assert in losses.py (#281 )	2023-12-24 08:39:08 -08:00
Daniel Strobusch	7365d142a3	random.uniform must respect dtype, even if lower precision than "low" (#280 ) Fix an edge case where random uniform returns a float32 array, even if a lower precision dtype is wanted due to adding the float32 "low" array.	2023-12-24 07:04:43 -08:00
Vidit Agarwal	8c3da54c7d	Fix failing test for log cosh loss (#275 ) * fix assert statement in log_cosh_loss * reformatted by pre-commit black	2023-12-23 16:26:46 -08:00
Vidit Agarwal	acf1721b98	Corrected the example of value_and_grad (#274 ) * Corrected the example for mx.value_and_grad * Reformat through pre-commit/black	2023-12-23 11:06:38 -08:00
Finn Voorhees	f91f450141	Fix argmax returns documentation (#263 )	2023-12-22 20:33:17 -08:00
Nicholas Santavas	d35fa1db41	Add Hinge, Huber and LogCosh losses (#199 )	2023-12-22 10:28:10 -08:00
Justin Deschenaux	e8deca84e0	Add dropout2d (#250 )	2023-12-22 08:02:29 -08:00
Angelos Katharopoulos	1d053e0d1d	Fix the alibi test that was left unchanged (#252 )	2023-12-21 14:59:25 -08:00
Hazem Essam	0aa65c7a6b	Added ALiBi implementation (#232 )	2023-12-21 14:36:38 -08:00
Angelos Katharopoulos	2c7df6795e	Make sure that arrays are freed when saving (#247 )	2023-12-21 14:08:24 -08:00
Angelos Katharopoulos	b3916cbf2b	Improve names of quantization arguments (#235 ) * Change the default quantization group_size to 64 * Rename groups to group_size and width to bits	2023-12-20 16:53:53 -08:00
Angelos Katharopoulos	57fe918cf8	Adds C++ and nn quantization utilities (#230 ) * Add C++ de-/quantize ops * Add quantize functions to the docs and tests * Add a QuantizedLinear module	2023-12-20 14:17:38 -08:00
Justin Deschenaux	4912ff3ec2	Add Lion optimizer (#209 ) * Add Lion optimizer * Update acknowledgements also with past contributions	2023-12-20 13:54:58 -08:00
Awni Hannun	f40d17047d	Indexing bug (#233 ) * fix * test	2023-12-20 10:44:01 -08:00
Angelos Katharopoulos	2807c6aff0	Implements divide for integer types and adds floor_divide op (#228 ) * Add floor_divide * Add floor_divide to the tests * Add floor_divide to the docs	2023-12-19 20:12:19 -08:00
Diogo	137f55bf28	fail early if readinto does not exist (#221 )	2023-12-19 13:27:17 -08:00
Emircan Erol	e549f84532	Triplet Loss (#211 ) * Triplet Loss * Requested Changes * Margin to alpha	2023-12-19 12:37:12 -08:00
Angelos Katharopoulos	dfa9f4bc58	An initial quantized matmul implementation (#205 ) * Add quantized matvec * Add quantized matrix matrix with 2nd matrix transposed * Add quantized matmul tests * Add a slow cpu quantized matmul * Add a slightly faster vectorized cpu version	2023-12-18 23:18:57 -08:00
Abe Leininger	e6872a4149	Added linspace (#181 ) * linspace ops support --------- Co-authored-by: Awni Hannun <awni@apple.com>	2023-12-18 19:57:55 -08:00
Juarez Bochi	f4f6e17d45	Fix cross-attention (#210 ) * Fix cross-attention With the current code, ln2 is a no-op. Its output should be passed to the cross-attention layer * Add name to contributors	2023-12-18 12:27:27 -08:00
Angelos Katharopoulos	4d4af12c6f	Adds round op and primitive (#203 )	2023-12-18 11:32:48 -08:00
jojopuppet	18cca64c81	Add smoothed L1 loss and enhancements to cross entropy loss (#166 ) * Add smooth_l1_loss * Add labels moothing for cross entropy loss --------- Co-authored-by: Awni Hannun <awni@apple.com>	2023-12-18 07:26:21 -08:00
Cyril Zakka, MD	8eb56beb3a	Added clip function (#159 ) * Added clip * Added Python bindings * Formatting * Added cpp tests * Added Python tests * python bindings work * rebase --------- Co-authored-by: Awni Hannun <awni@apple.com>	2023-12-17 20:00:29 -08:00
Awni Hannun	ee0c2835c5	Docs updates (#198 ) Reorganize NN docs + a few other tidbits.	2023-12-17 13:20:55 -08:00
Awni Hannun	90d04072b7	fix build w/ flatten (#195 )	2023-12-17 11:58:45 -08:00
__mo_san__	52e1589a52	implemented Flatten Module (#149 ) * implemented flatten op --------- Co-authored-by: Awni Hannun <awni@apple.com>	2023-12-16 21:54:37 -08:00
YUN, Junwoo	eebd7c275d	Add optimizers (AdaMax, AdaDelta, RMSprop) and ordering optimizer classes (#142 ) * Add AdaMax, AdaDelta, RMSprop	2023-12-16 21:43:15 -08:00
Awni Hannun	104c34f906	setite negative indexing bug (#189 )	2023-12-16 06:44:47 -08:00
Diogo	dc2edc762c	added tri / tril / triu (#170 ) * added tri / tril / triu * fixed tests * ctest tests * tri overload and simplified tests * changes from comment * more tests for m * ensure assert if not 2-D * remove broadcast_to * minor tweaks --------- Co-authored-by: Awni Hannun <awni@apple.com>	2023-12-15 17:30:34 -08:00
Awni Hannun	2e02acdc83	add base kwarg to rope (#186 )	2023-12-15 16:47:59 -08:00
Víctor Aguilar	f24200db2c	accross -> across (#183 )	2023-12-15 13:46:50 -08:00
Jason	e28b57e371	Added mx.stack c++ frontend impl (#123 ) * stack C++ operation + python bindings	2023-12-14 13:21:19 -08:00
Awni Hannun	e5851e52b1	Add move and swap axis, and vmap for slice, concat, and gather (#158 ) * add move and swap axis, and vmap for slice, concat, and gather	2023-12-14 12:59:12 -08:00
Luca Arnaboldi	b93c4cf378	Floor and Ceil (#150 ) * Implements Floor and Ceil Ops	2023-12-14 10:00:23 -08:00

1 2

91 Commits