mlx-examples

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-12-16 02:08:55 +08:00

Files

Awni Hannun fca087be49 More cache improvements (#1015 )

* fix rotating kv cache for chat use case

* reorg + fixes to caching, unify prompt caching across types and use cases for e.g. caching during a chat

* nit in chat

* fix tests

* fix tests

* fix tests

* docs

* chat command

* comments + docs

* Define meta_state on all Cache implementations

* fixes + trim_prompt_cache api

* fix default model

---------

Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>

2024-10-07 20:45:51 -07:00

test_datsets.py

Configuration-based use of HF hub-hosted datasets for training (#701 )

2024-06-26 10:20:50 -07:00

test_finetune.py

Feature: QDoRA (#891 )

2024-09-30 08:01:11 -07:00

test_generate.py

repetiton_penalty and logits_bias just using logits_processors (#1004 )

2024-09-30 08:49:03 -07:00

test_gguf.py

fix(mlx-lm): type hints in gguf.py (#621 )