mlx-examples

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-12-16 02:08:55 +08:00

Files

Awni Hannun ee60e2a9d5 Kv cache (#643 )

* in place kv_cache

* fix

* fix kv cache size

* partially fix kv cache dtype

* step kv cache

* multiple of step size

* more teests + kv cache

* more kv cache

* udpate all models to use kv cache

2024-05-08 08:18:13 -07:00

test_datsets.py

Support for OpenAI’s fine-tuning dataset format (#548 )

2024-03-19 16:45:46 -07:00

test_gguf.py

fix(mlx-lm): type hints in gguf.py (#621 )

2024-03-26 07:56:01 -07:00

test_lora.py

chore(mlx-lm): fix the number of validation batches configuration. (#752 )

2024-05-04 06:52:42 -07:00

test_models.py

Kv cache (#643 )

2024-05-08 08:18:13 -07:00

test_sample_utils.py

fix(mlx-lm): type hints in gguf.py (#621 )

2024-03-26 07:56:01 -07:00

test_server.py

fix(mlx-lm): broken server.py (#690 )

2024-04-18 14:26:18 -07:00

test_utils.py

LoRA on all linear transformer block layers (#546 )

2024-03-12 07:37:40 -07:00