mlx-examples/llms/mlx_lm/models at cd036ccfb535e349c1e176e9bb497b7887a95ce7 - mlx-examples - Gitea for Geophysics

zhangyiss/mlx-examples

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-12-16 02:08:55 +08:00

Files

History

Goekdeniz-Guelmez cd036ccfb5 fix generation works too (almost)

2024-10-16 21:13:36 +02:00

..

__init__.py

Mlx llm package (#301 )

2024-01-12 10:25:56 -08:00

base.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

cache.py

Tokenizer updates + tests (#1024 )

2024-10-14 10:48:46 -07:00

cohere.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

dbrx.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

deepseek_v2.py

Tokenizer updates + tests (#1024 )

2024-10-14 10:48:46 -07:00

deepseek.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

gemma2.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

gemma.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

gpt2.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

gpt_bigcode.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

gpt_neox.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

internlm2.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

llama.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

mamba2 copy.py

fixing loading the model

2024-10-11 20:53:29 +02:00

mamba2-other.py

adding debug statements (somehiw generating only goes through the fist MambaMixer block pass)

2024-10-16 21:09:30 +02:00

mamba2-prch.py

adding debug statements

2024-10-11 21:36:41 +02:00

mamba2.py

fix generation works too (almost)

2024-10-16 21:13:36 +02:00

mamba.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

minicpm.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

mixtral.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

nemotron.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

olmo.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

openelm.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

phi3.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

phi3small.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

phi.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

phimoe.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

phixtral.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

plamo.py

Fix PLaMo model to support Grouped Query Attention (#1037 )

2024-10-12 15:26:50 -07:00

qwen2_moe.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

qwen2.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

qwen.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

recurrent_gemma.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

stablelm.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

starcoder2.py

More cache improvements (#1015 )

2024-10-07 20:45:51 -07:00

su_rope.py

Add Phi-3.5-MoE (#946 )

2024-08-24 06:52:33 -07:00

switch_layers.py

Handle longer prompt/generation (#931 )

2024-08-16 15:28:39 -07:00