mlx-examples/llms/mlx_lm at 199df9e1105a49a5ac064ff2c6abb7bcd1b7285c - mlx-examples - Gitea for Geophysics

zhangyiss/mlx-examples

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-12-16 02:08:55 +08:00

Files

History

AtakanTekparmak 199df9e110 fix: Added dedicated error handling to load and get_model_path (#775 )

* fix: Added dedicated error handling to load and get_model_path

Added proper error handling to load and get_model_path by adding a dedicated exception class, because when the local path is not right, it still throws the huggingface RepositoryNotFoundError

* fix: Changed error message and resolved lack of import

* fix: Removed redundant try-catch block

* nits in message

* nits in message

---------

Co-authored-by: Awni Hannun <awni@apple.com>

2024-05-20 06:39:05 -07:00

..

support dora finetune in mlx-examples/llms/mlx_lm (#779 )

2024-05-16 08:21:26 -07:00

Support non incremental kv cache growth (#766 )

2024-05-15 12:56:24 -07:00

support dora finetune in mlx-examples/llms/mlx_lm (#779 )

2024-05-16 08:21:26 -07:00

__init__.py

Fix import warning (#479 )

2024-02-27 08:47:56 -08:00

convert.py

Create executables for generate, lora, server, merge, convert (#682 )

2024-04-16 16:08:49 -07:00

fuse.py

support dora finetune in mlx-examples/llms/mlx_lm (#779 )

2024-05-16 08:21:26 -07:00

generate.py

support dora finetune in mlx-examples/llms/mlx_lm (#779 )

2024-05-16 08:21:26 -07:00

gguf.py

fix(mlx-lm): type hints in gguf.py (#621 )

2024-03-26 07:56:01 -07:00

LORA.md

MiniCPM implementation (#685 )

2024-04-25 15:29:28 -07:00

lora.py

support dora finetune in mlx-examples/llms/mlx_lm (#779 )

2024-05-16 08:21:26 -07:00

MANAGE.md

Add model management functionality for local caches (#736 )

2024-05-03 12:20:13 -07:00

manage.py

Add model management functionality for local caches (#736 )

2024-05-03 12:20:13 -07:00

MERGE.md

Create executables for generate, lora, server, merge, convert (#682 )

2024-04-16 16:08:49 -07:00

merge.py

Create executables for generate, lora, server, merge, convert (#682 )

2024-04-16 16:08:49 -07:00

py.typed

Add py.typed to support PEP-561 (type-hinting) (#389 )

2024-01-30 21:17:38 -08:00

README.md

feat: move lora into mlx-lm (#337 )

2024-01-23 08:44:37 -08:00

requirements.txt

Quantize embedding / Update quantize API (#680 )

2024-04-18 18:16:10 -07:00

sample_utils.py

Use async eval (#670 )

2024-04-11 13:18:23 -07:00

SERVER.md

Validate server params & fix logit bias bug (#731 )

2024-04-30 07:27:40 -07:00

server.py

Add MLX Cache Limit setting for mlx_lm.generate and mlx_lm.server CLI (#744 )

2024-05-03 12:42:48 -07:00

tokenizer_utils.py

Kv cache (#643 )

2024-05-08 08:18:13 -07:00

UPLOAD.md

Mlx llm package (#301 )

2024-01-12 10:25:56 -08:00

utils.py

fix: Added dedicated error handling to load and get_model_path (#775 )

2024-05-20 06:39:05 -07:00

version.py

support dora finetune in mlx-examples/llms/mlx_lm (#779 )

2024-05-16 08:21:26 -07:00

README.md

Generate Text with MLX and 🤗 Hugging Face

This an example of large language model text generation that can pull models from the Hugging Face Hub.

For more information on this example, see the README in the parent directory.

This package also supports fine tuning with LoRA or QLoRA. For more information see the LoRA documentation.