mlx-examples/llms/mlx_lm/models
Prince Canuma 76c3244cc5
Add support for Cohere's Command-R (#565)
* initial commit for command-R

* update mlp, layernorm, lm_head and model args

* add custom layernorm

* add default to tie_word_embeddings

* add layernorm weight type and refactor

* update layernorm (bias conditional) in model/layers

* fix layer norm use traditional rope

* add test

---------

Co-authored-by: Awni Hannun <awni@apple.com>
2024-03-13 07:03:36 -07:00
..
__init__.py Mlx llm package (#301) 2024-01-12 10:25:56 -08:00
base.py Mlx llm package (#301) 2024-01-12 10:25:56 -08:00
cohere.py Add support for Cohere's Command-R (#565) 2024-03-13 07:03:36 -07:00
gemma.py [mlx-lm] Use sdpa in llama / mistral model (#515) 2024-03-07 17:41:23 -08:00
layers.py Add support for Cohere's Command-R (#565) 2024-03-13 07:03:36 -07:00
llama.py chore(mlx-lm): fix tie_word_embeddings for qwen2 (#566) 2024-03-12 21:34:32 -07:00
mixtral.py Enable unit testing in Circle and start some MLX LM tests (#545) 2024-03-07 09:31:57 -08:00
olmo.py Enable unit testing in Circle and start some MLX LM tests (#545) 2024-03-07 09:31:57 -08:00
phi.py Enable unit testing in Circle and start some MLX LM tests (#545) 2024-03-07 09:31:57 -08:00
phixtral.py [mlx-lm] Add precompiled normalizations (#451) 2024-02-22 12:40:55 -08:00
plamo.py Enable unit testing in Circle and start some MLX LM tests (#545) 2024-03-07 09:31:57 -08:00
qwen2.py chore(mlx-lm): fix tie_word_embeddings for qwen2 (#566) 2024-03-12 21:34:32 -07:00
qwen.py Enable unit testing in Circle and start some MLX LM tests (#545) 2024-03-07 09:31:57 -08:00
stablelm.py [mlx-lm] Use sdpa in llama / mistral model (#515) 2024-03-07 17:41:23 -08:00
starcoder2.py chore(mlx-lm): fix tie_word_embeddings for qwen2 (#566) 2024-03-12 21:34:32 -07:00