mlx-examples

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-06-24 09:21:18 +08:00

Author	SHA1	Message	Date
Awni Hannun	fca087be49	More cache improvements (#1015 ) * fix rotating kv cache for chat use case * reorg + fixes to caching, unify prompt caching across types and use cases for e.g. caching during a chat * nit in chat * fix tests * fix tests * fix tests * docs * chat command * comments + docs * Define meta_state on all Cache implementations * fixes + trim_prompt_cache api * fix default model --------- Co-authored-by: Angelos Katharopoulos <a_katharopoulos@apple.com>	2024-10-07 20:45:51 -07:00
James Zhao	bf921afcbe	Make sure to import the correct "version" module when installing mlx_whisper and mlx_lm from local source code. (#969 ) * Make sure to import the correct "version" module when installing the mlx_whisper package from local source code. * Make sure to import the correct "version" module when installing the mlx_lm package from local source code * fix --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-09-03 13:16:21 -07:00
Angelos Katharopoulos	1003a8b2dd	Add the ability to load the KV cache from a file (#956 )	2024-08-28 22:11:45 -07:00
Chime Ogbuji	df6bc09d74	Configuration-based use of HF hub-hosted datasets for training (#701 ) * Add hf_dataset configuration for using HF hub-hosted datasets for (Q)LoRA training * Pre-commit formatting * Fix YAML config example * Print DS info * Include name * Add hf_dataset parameter default * Remove TextHFDataset and CompletionsHFDataset and use Dataset and CompletionsDataset instead, adding a text_key constructor argument to the former (and changing it to work with a provided data structure instead of just from a JSON file), and prompt_key and completion_key arguments to the latter with defaults for backwards compatibility. * nits * update docs --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-06-26 10:20:50 -07:00
Ivan Fioravanti	b468091f7f	Add model management functionality for local caches (#736 ) * Add model management functionality for local caches This commit introduces a set of command-line utilities for managing MLX models downloaded and saved locally in Hugging Face cache. The functionalities include scanning existing models, retrieving detailed information about a specific model, and deleting a model by its name. * Added mlx_lm.model to setup.py * nits --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-05-03 12:20:13 -07:00
madroid	6775d6cb3f	Whisper: Add pip distribution configuration to support pip installations. (#739 ) * Whisper: rename whisper to mlx_whisper * Whisper: add setup.py config for publish * Whisper: add assets data to setup config * Whisper: pre-commit for setup.py * Whisper: Update README.md * Whisper: Update README.md * nits * fix package data * nit in readme --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-05-01 09:00:02 -07:00
Phúc H. Lê Khắc	35206806ac	Create executables for generate, lora, server, merge, convert (#682 ) * feat: create executables mlx_lm.<cmd> * nits in docs --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-04-16 16:08:49 -07:00
Awni Hannun	95f82e67a2	Fix import warning (#479 ) * fix import warning * fix version import * remove api, move convert to utils * also update circle to run external PRs	2024-02-27 08:47:56 -08:00
Awni Hannun	97c09a863d	bump version and include in package (#475 )	2024-02-21 09:40:36 -08:00
Awni Hannun	8fd953ee2b	Support for slerp merging models (#455 ) * support for slerp merging models * docs * update docs * format'	2024-02-19 20:37:15 -08:00
Awni Hannun	06ddb8414d	Fix Qwen2 and SD (#441 ) * fix qwen2 * version bump * fix list shape	2024-02-14 13:43:12 -08:00
Madroid Ma	954aa50c54	LoRA: Improve validation error for LoRA layer count exceeding model layer (#427 ) * LoRA: Improve validation error for LoRA layer count exceeding model layer This commit enhances the error handling when the specified LoRA layer count exceeds the total number of layers in the model. It clarifies the error message to provide actionable feedback for users, guiding them to adjust their input parameters accordingly. * format + nits --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-02-13 06:56:27 -08:00
Awni Hannun	aa7447efa2	Olmo in MLX LM (#415 ) * run olmo * format	2024-02-05 21:13:49 -08:00
Ashish	0b57f0eae6	Add StableLM-2 1.6B (#378 ) * init * stablelm * add to readme * bump version --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-26 10:28:00 -08:00
Awni Hannun	21aa8038fb	MLX LM version bump (#358 ) * version bump * include new package	2024-01-23 09:05:57 -08:00
Awni Hannun	b0870ed679	fix response + bump version (#319 )	2024-01-15 11:51:21 -08:00
Angelos Katharopoulos	1fa40067fe	Change tuple type definitions to use Tuple (#308 )	2024-01-12 11:15:09 -08:00
Awni Hannun	c6440416a2	Mlx llm package (#301 ) * fix converter * add recursive files * remove gitignore * remove gitignore * add packages properly * read me update * remove dup readme * relative * fix convert * fix community name * fix url * version	2024-01-12 10:25:56 -08:00

18 Commits