mlx-examples/llms/mlx_lm at 4e017008169d31fe943b8ca6023f058a724e555f - mlx-examples - Gitea for Geophysics

zhangyiss/mlx-examples

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-12-16 02:08:55 +08:00

Files

History

Zai Thottakath 4e01700816 Allow the entire model to be targed for LoRA and DoRA fine tuning: LoRA and DoRA embeddings with small DoRALinear bug fix (#914 )

* feature: LoRA adapter for Embeddings

* feature: wire in LoRAEmbedding into the tuner. Allow the embedding and non model.layers Linear layers to be targeted for fine tuning

* feature: DoRA adapter for Embeddings

* feature: wire in DoRAEmbedding

* bugfix: ensure self.m is recalculated when the linear layer is changed in DoRALinear.from_linear

* refactor: prefer from_base over from_linear or from_embedding. prefer fuse over to_linear or to_embedding

* cleanup: remove unused imports in test_dora.py

* refactor: remove unnecessary non_layer_modules

* cleanup: remove wrong comments for lora embedding dropout. remove uncessary parens in dora embedding dropout

* nits

---------

Co-authored-by: Awni Hannun <awni@apple.com>

2024-08-16 07:38:36 -07:00

..

Example of response generation with optional arguments (#853 )

2024-07-09 06:49:59 -07:00

Unify attention mask in LLMs (#911 )

2024-07-25 16:45:22 -07:00

Allow the entire model to be targed for LoRA and DoRA fine tuning: LoRA and DoRA embeddings with small DoRALinear bug fix (#914 )

2024-08-16 07:38:36 -07:00

__init__.py

mlx_lm: Add Streaming Capability to Generate Function (#807 )

2024-06-03 09:04:39 -07:00

convert.py

Create executables for generate, lora, server, merge, convert (#682 )

2024-04-16 16:08:49 -07:00

fuse.py

Allow the entire model to be targed for LoRA and DoRA fine tuning: LoRA and DoRA embeddings with small DoRALinear bug fix (#914 )

2024-08-16 07:38:36 -07:00

generate.py

mlx_lm: Add Streaming Capability to Generate Function (#807 )

2024-06-03 09:04:39 -07:00

gguf.py

Whisper updates to allow HF models (#923 )

2024-08-09 11:11:58 -07:00

LORA.md

Configuration-based use of HF hub-hosted datasets for training (#701 )

2024-06-26 10:20:50 -07:00

lora.py

Pass use_dora parameter to linear_to_lora_layers (#885 )

2024-07-11 14:34:34 -07:00

MANAGE.md

Add model management functionality for local caches (#736 )

2024-05-03 12:20:13 -07:00

manage.py

Add model management functionality for local caches (#736 )

2024-05-03 12:20:13 -07:00

MERGE.md

Create executables for generate, lora, server, merge, convert (#682 )

2024-04-16 16:08:49 -07:00

merge.py

Create executables for generate, lora, server, merge, convert (#682 )

2024-04-16 16:08:49 -07:00

py.typed

Add py.typed to support PEP-561 (type-hinting) (#389 )

2024-01-30 21:17:38 -08:00

README.md

feat: move lora into mlx-lm (#337 )

2024-01-23 08:44:37 -08:00

requirements.txt

Example of response generation with optional arguments (#853 )

2024-07-09 06:49:59 -07:00

sample_utils.py

Min P implementation (#926 )

2024-08-15 15:45:02 -07:00

SERVER.md

Adapters loading (#902 )

2024-08-01 16:18:18 -07:00

server.py

Predict stop sequence matches during streaming (#541 )

2024-08-06 15:24:15 -07:00

tokenizer_utils.py

fix yi (#852 )

2024-06-27 06:38:19 -07:00

UPLOAD.md

Mlx llm package (#301 )

2024-01-12 10:25:56 -08:00

utils.py

Min P implementation (#926 )

2024-08-15 15:45:02 -07:00

version.py

Configuration-based use of HF hub-hosted datasets for training (#701 )

2024-06-26 10:20:50 -07:00

README.md

Generate Text with MLX and 🤗 Hugging Face

This an example of large language model text generation that can pull models from the Hugging Face Hub.

For more information on this example, see the README in the parent directory.

This package also supports fine tuning with LoRA or QLoRA. For more information see the LoRA documentation.