zhangyiss/mlx - mlx - Gitea for Geophysics

mirror of https://github.com/ml-explore/mlx.git synced 2025-08-10 11:16:38 +08:00

Author	SHA1	Message	Date
Hazem Essam	0aa65c7a6b	Added ALiBi implementation (#232 )	2023-12-21 14:36:38 -08:00
Angelos Katharopoulos	b3916cbf2b	Improve names of quantization arguments (#235 ) * Change the default quantization group_size to 64 * Rename groups to group_size and width to bits	2023-12-20 16:53:53 -08:00
Angelos Katharopoulos	57fe918cf8	Adds C++ and nn quantization utilities (#230 ) * Add C++ de-/quantize ops * Add quantize functions to the docs and tests * Add a QuantizedLinear module	2023-12-20 14:17:38 -08:00
Juarez Bochi	f4f6e17d45	Fix cross-attention (#210 ) * Fix cross-attention With the current code, ln2 is a no-op. Its output should be passed to the cross-attention layer * Add name to contributors	2023-12-18 12:27:27 -08:00
Awni Hannun	ee0c2835c5	Docs updates (#198 ) Reorganize NN docs + a few other tidbits.	2023-12-17 13:20:55 -08:00
Awni Hannun	2e02acdc83	add base kwarg to rope (#186 )	2023-12-15 16:47:59 -08:00
Víctor Aguilar	f24200db2c	accross -> across (#183 )	2023-12-15 13:46:50 -08:00
Awni Hannun	25f70d4ca4	Fix divide types + floor divide (//) (#138 ) * divide types * fix black + test	2023-12-11 20:20:58 -08:00
Diogo	02de234ef0	Activations LeakyReLU / PReLU / Softplus / Mish (#109 ) * Leaky_relu / prelu / softplus / mish * added tests * updated bench * remove torch refs, add init to PReLU * added arvix reference to mish * added missing docs	2023-12-11 19:40:57 -08:00
Nicholas Santavas	f5df47ec6e	Add Step, ELU, SELU, Swish activation functions (#117 ) * Add Step, ELU, SELU, Swish activation functions This commit adds the Step, ELU, SELU and Swish activations functions * add to the docs * review	2023-12-11 17:04:07 -08:00
Jason	b0cd092b7f	Added activation functions: leaky_relu relu6 softplus elu celu logsigmoid (#108 ) * added leaky_relu relu6 softplus elu celu logsigmoid * minor fixes for docstring and benchmark imports * fixed elu implementation and added tests * added tests for optional param, changed leaky_relu param to fit pytorch documentation	2023-12-10 16:31:38 -08:00
Awni Hannun	71d1fff90a	Bug fix in metal binary kernel dispatch for large arrays (#125 ) * bug fix * format	2023-12-10 16:12:31 -08:00
Henry Ansah	68bf1d7867	add nn module for sigmoid activation (#111 ) * add nn module for sigmoid activation * update .gitignore with .cache folder generated by jetbrains fleet ide * remove .cache folder	2023-12-10 07:00:39 -08:00
__mo_san__	ef7b8756c0	Add tanh activation function (#115 ) * added Adagrad optimizer ... * added Tanh activation function ... * reformatted file ... * remove unrelated stuff ... * Update activations.py	2023-12-09 19:25:38 -08:00
Joe Barrow	ac6dc5d3eb	Adding optional bias param to MultiHeadAttention (#104 ) * Adding optional param to * Run style-checker	2023-12-09 11:04:28 -08:00
Zach Schillaci	5b9be57ac3	Add isort pre-commit and run (#68 )	2023-12-08 11:31:47 -08:00
Zach Schillaci	d11d77e581	Spelling fixes in transformer.py (#59 )	2023-12-07 13:32:09 -08:00
rushyam	2e126aeb7e	Feature Addition: Encoder-Decoder Transformer Architecture (#50 ) * Implemented decoder-transformer-layer, decoder-transformer and introduce encoder-decoder transformer * added relu layer * add src, tgt, memory mask --------- Co-authored-by: rushyam <rushyam@rushyams-MacBook-Air.local>	2023-12-07 07:37:36 -08:00
Jagrit Digani	2440fe0124	NPY loading segfault bug (#34 ) * Fixed Gil semantics in loading and saving from python file streams	2023-12-06 12:03:47 -08:00
Markus Enzweiler	2ffaee0c0d	Updated default argument for stride to 1 in Conv2d() in the docstring (#22 )	2023-12-06 07:17:58 -08:00
Awni Hannun	46a39e5b1f	copyright + ack	2023-11-30 11:12:53 -08:00
Jagrit Digani	e6306cfee9	jagrit's commit files	2023-11-29 10:52:08 -08:00
Angelos Katharopoulos	d1f86272a2	angelos's commit files	2023-11-29 10:42:59 -08:00
Awni Hannun	8ca7f9e8e9	awni's commit files	2023-11-29 10:30:41 -08:00

1 2 3

124 Commits