mlx-examples/llms/tests
Awni Hannun ee60e2a9d5
Kv cache (#643)
* in place kv_cache

* fix

* fix kv cache size

* partially fix kv cache dtype

* step kv cache

* multiple of step size

* more teests + kv cache

* more kv cache

* udpate all models to use kv cache
2024-05-08 08:18:13 -07:00
..
test_datsets.py Support for OpenAI’s fine-tuning dataset format (#548) 2024-03-19 16:45:46 -07:00
test_gguf.py fix(mlx-lm): type hints in gguf.py (#621) 2024-03-26 07:56:01 -07:00
test_lora.py chore(mlx-lm): fix the number of validation batches configuration. (#752) 2024-05-04 06:52:42 -07:00
test_models.py Kv cache (#643) 2024-05-08 08:18:13 -07:00
test_sample_utils.py fix(mlx-lm): type hints in gguf.py (#621) 2024-03-26 07:56:01 -07:00
test_server.py fix(mlx-lm): broken server.py (#690) 2024-04-18 14:26:18 -07:00
test_utils.py LoRA on all linear transformer block layers (#546) 2024-03-12 07:37:40 -07:00