mlx-examples/llms/mlx_lm
Gökdeniz Gülmez 50e5ca81a8
Adding full finetuning (#903)
* Adding full model weights finetuning

* Updating the LORA.md and ACKNOWLEDGMENTS.md files.

* removing --use-dora and --fulll-training and adding --fine-tune-type

* some clean up

* reformating and fixing dora training

* updated CONFIG_DEFAULTS

* update config example

* update in the config example fie

* Update LORA.md

* merge and commit

* adding argument for dora linear layer

* clean up

* clean up in the example yaml file

* fix

* final fix before sending

* small addition to re md file

* fix for loading the fully trained model by saving all the files and configs correctly

* clean up

* removing the unnesesairy files

* changing lora layers back to 16

* removed max file size

* nits

* resolve merge

* some consistency changes

---------

Co-authored-by: Awni Hannun <awni@apple.com>
2024-09-29 17:12:47 -07:00
..
examples Adding full finetuning (#903) 2024-09-29 17:12:47 -07:00
models Adding support for mamba (#940) 2024-09-28 07:02:53 -07:00
tuner Adding full finetuning (#903) 2024-09-29 17:12:47 -07:00
__init__.py Make sure to import the correct "version" module when installing mlx_whisper and mlx_lm from local source code. (#969) 2024-09-03 13:16:21 -07:00
_version.py Update LLM generation docs to use chat template (#973) 2024-09-07 06:06:15 -07:00
cache_prompt.py Fix the cache_prompt (#979) 2024-09-06 20:19:27 -07:00
convert.py Create executables for generate, lora, server, merge, convert (#682) 2024-04-16 16:08:49 -07:00
fuse.py Adding full finetuning (#903) 2024-09-29 17:12:47 -07:00
generate.py Add prompt piping (#962) 2024-09-03 13:29:10 -07:00
gguf.py Fix export to gguf (#993) 2024-09-20 13:33:45 -07:00
LORA.md Adding full finetuning (#903) 2024-09-29 17:12:47 -07:00
lora.py Adding full finetuning (#903) 2024-09-29 17:12:47 -07:00
MANAGE.md Add model management functionality for local caches (#736) 2024-05-03 12:20:13 -07:00
manage.py Add model management functionality for local caches (#736) 2024-05-03 12:20:13 -07:00
MERGE.md Create executables for generate, lora, server, merge, convert (#682) 2024-04-16 16:08:49 -07:00
merge.py Create executables for generate, lora, server, merge, convert (#682) 2024-04-16 16:08:49 -07:00
py.typed Add py.typed to support PEP-561 (type-hinting) (#389) 2024-01-30 21:17:38 -08:00
README.md feat: move lora into mlx-lm (#337) 2024-01-23 08:44:37 -08:00
requirements.txt Use fast rope (#945) 2024-08-23 13:18:51 -07:00
sample_utils.py Min P implementation (#926) 2024-08-15 15:45:02 -07:00
SERVER.md Add /v1/models endpoint to mlx_lm.server (#984) 2024-09-28 07:21:11 -07:00
server.py Add /v1/models endpoint to mlx_lm.server (#984) 2024-09-28 07:21:11 -07:00
tokenizer_utils.py Fix setattr for the TokenizerWrapper (#961) 2024-08-28 14:47:33 -07:00
UPLOAD.md Mlx llm package (#301) 2024-01-12 10:25:56 -08:00
utils.py Adding full finetuning (#903) 2024-09-29 17:12:47 -07:00

Generate Text with MLX and 🤗 Hugging Face

This an example of large language model text generation that can pull models from the Hugging Face Hub.

For more information on this example, see the README in the parent directory.

This package also supports fine tuning with LoRA or QLoRA. For more information see the LoRA documentation.