mlx-examples/llms/mlx_lm/models at 758597eaa80c9b07acf9588c6e04c22f06eeb3ee - mlx-examples - Gitea for Geophysics

zhangyiss/mlx-examples

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-12-16 02:08:55 +08:00

Files

History

Goekdeniz-Guelmez 758597eaa8 adding multi token input and correct cache handling in ssm step

2024-10-22 20:44:23 +02:00

..

__init__.py

Mlx llm package (#301 )

2024-01-12 10:25:56 -08:00

base.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

cache.py

generation works but outputs gibberish

2024-10-20 18:04:34 +02:00

cohere.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

dbrx.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

deepseek_v2.py

Tokenizer updates + tests (#1024 )

2024-10-14 10:48:46 -07:00

deepseek.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

gemma2.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

gemma.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

gpt2.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

gpt_bigcode.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

gpt_neox.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

internlm2.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

llama.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

mamba2 copy.py

generation works! trying training now

2024-10-22 18:25:59 +02:00

mamba2-other.py

adding debug statements (somehiw generating only goes through the fist MambaMixer block pass)

2024-10-16 21:09:30 +02:00

mamba2-prch.py

generation works! trying training now

2024-10-22 18:25:59 +02:00

mamba2.py

adding multi token input and correct cache handling in ssm step

2024-10-22 20:44:23 +02:00

mamba.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

minicpm.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

mixtral.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

nemotron.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

olmo.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

openelm.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

phi3.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

phi3small.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

phi.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

phimoe.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

phixtral.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

plamo.py

Fix PLaMo model to support Grouped Query Attention (#1037 )

2024-10-12 15:26:50 -07:00

qwen2_moe.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

qwen2.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

qwen.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

recurrent_gemma.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

stablelm.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

starcoder2.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

su_rope.py

Add Phi-3.5-MoE (#946 )

2024-08-24 06:52:33 -07:00

switch_layers.py

Handle longer prompt/generation (#931 )

2024-08-16 15:28:39 -07:00