mlx-examples

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-08-10 19:26:46 +08:00

History

Prince Canuma 76c3244cc5 Add support for Cohere's Command-R (#565 ) * initial commit for command-R * update mlp, layernorm, lm_head and model args * add custom layernorm * add default to tie_word_embeddings * add layernorm weight type and refactor * update layernorm (bias conditional) in model/layers * fix layer norm use traditional rope * add test --------- Co-authored-by: Awni Hannun <awni@apple.com>		2024-03-13 07:03:36 -07:00
..
__init__.py	Mlx llm package (#301 )	2024-01-12 10:25:56 -08:00
base.py	Mlx llm package (#301 )	2024-01-12 10:25:56 -08:00
cohere.py	Add support for Cohere's Command-R (#565 )	2024-03-13 07:03:36 -07:00
gemma.py	[mlx-lm] Use sdpa in llama / mistral model (#515 )	2024-03-07 17:41:23 -08:00
layers.py	Add support for Cohere's Command-R (#565 )	2024-03-13 07:03:36 -07:00
llama.py	chore(mlx-lm): fix tie_word_embeddings for qwen2 (#566 )	2024-03-12 21:34:32 -07:00
mixtral.py	Enable unit testing in Circle and start some MLX LM tests (#545 )	2024-03-07 09:31:57 -08:00
olmo.py	Enable unit testing in Circle and start some MLX LM tests (#545 )	2024-03-07 09:31:57 -08:00
phi.py	Enable unit testing in Circle and start some MLX LM tests (#545 )	2024-03-07 09:31:57 -08:00
phixtral.py	[mlx-lm] Add precompiled normalizations (#451 )	2024-02-22 12:40:55 -08:00
plamo.py	Enable unit testing in Circle and start some MLX LM tests (#545 )	2024-03-07 09:31:57 -08:00
qwen2.py	chore(mlx-lm): fix tie_word_embeddings for qwen2 (#566 )	2024-03-12 21:34:32 -07:00
qwen.py	Enable unit testing in Circle and start some MLX LM tests (#545 )	2024-03-07 09:31:57 -08:00
stablelm.py	[mlx-lm] Use sdpa in llama / mistral model (#515 )	2024-03-07 17:41:23 -08:00
starcoder2.py	chore(mlx-lm): fix tie_word_embeddings for qwen2 (#566 )	2024-03-12 21:34:32 -07:00