mlx-examples

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-06-28 12:13:25 +08:00

Author	SHA1	Message	Date
madroid	39d5ca6427	LoRA: report last train info (#595 )	2024-03-19 17:29:50 -07:00
madroid	b0bcd86a40	Support for OpenAI’s fine-tuning dataset format (#548 ) * LoRA: move load_dataset to tuner/datasets.py file * LoRA: support OpenAI chat format datasets see https://platform.openai.com/docs/guides/fine-tuning/example-format * LoRA: support OpenAI completion format datasets * LoRA: formatting dataset timing to reduce memory footprint * Refactor dataset item access in PromptCompletionDataset * Update mlx_lm/LORA.md * Update mlx_lm/LORA.md * check Unsupported data format * add tests, fine-tune doc * add tests, fine-tune doc * add jinja2 for chat template * nits in readme * nits in readme --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-03-19 16:45:46 -07:00
madroid	d4e1de1d5b	add peak_memory info to training callback (#572 )	2024-03-13 20:17:10 -07:00
Awni Hannun	14fe868825	version (#570 )	2024-03-13 10:09:36 -07:00
Awni Hannun	39084e81c2	Some improvements to LoRA (#528 ) * set cache_limit * remove set cache_limit * cleanup * add gradient checkpointing * fix sort * mokey patch call for checkpoint * fix example config	2024-03-12 20:02:03 -07:00
Chime Ogbuji	e56d9015ef	LoRA on all linear transformer block layers (#546 ) * Add --lora-all-linear option to apply LoRa to all linear transfer block layers * Moved to YAML config and added specification of rank & alpha * nits in conifg, more tests * nit * run tests for prs --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-03-12 07:37:40 -07:00
Awni Hannun	ad3cf5ed98	dropout 0 as default (#549 )	2024-03-08 13:07:10 -08:00
Muhtasham Oblokulov	81e2a80026	Add Starcoder 2 (#502 ) * Add Starcoder2 model and update utils.py * Refactor model arguments and modules in starcoder2.py * Refactor FeedForward class to MLP in starcoder2.py * Fix typo * pre-commit * Refactor starcoder2.py: Update model arguments and modules * Fix LM head and MLP layers * Rename input layer norm * Update bias in linear layers * Refactor token embeddings in Starcoder2Model * Rename to standard HF attention layer name * Add LayerNorm * Add transposed token embeddings (like in Gemma) * Refactor MLP and TransformerBlock classes * Add tie_word_embeddings option to ModelArgs and update Model implementation * Add conditional check for tying word embeddings in Starcoder2Model * Fix bias in lm_head linear layer * Remove unused LayerNorm in stablelm * Update transformers dependency to use GitHub repository * fix lm head bug, revert transformer req * Update RoPE initialization in Attention class --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-03-02 19:39:23 -08:00
Ashish	261f1280f6	Update to StableLM code (#514 ) * StableLM now part of Transformers as stablelm rather than stablelm_epoch; changed config to match new changes * removing old file * reference new stablelm	2024-03-01 09:53:38 -08:00
Madroid Ma	f03c8a7b44	LoRA: adapter file Support path information (#505 ) * LoRA: adapter file Support path information * fix pre-commit lint * from os.path to pathlib.Path * Update llms/mlx_lm/tuner/trainer.py Co-authored-by: Awni Hannun <awni.hannun@gmail.com> * rename check_checkpoints_path to checkpoints_path * fix pre-commit lint --------- Co-authored-by: Awni Hannun <awni.hannun@gmail.com>	2024-02-29 22:20:49 -08:00
Awni Hannun	ab9172baac	Gemma support (#474 ) * gemma support * format * lora support for gemma	2024-02-21 08:47:13 -08:00
Madroid Ma	8eee4399f4	LoRA: Add printing and callbacks for learning rate during training (#457 ) * LoRA：Refactor TrainingCallback to enhance flexibility and extensibility This commit refactors the TrainingCallback class to accept a dictionary parameter for both on_train_loss_report and on_val_loss_report methods. By switching from multiple parameters to a single dict parameter, this change significantly improves the class's flexibility and makes it easier to extend with new training or validation metrics in the future without altering the method signatures. This approach simplifies the addition of new information to be logged or processed and aligns with best practices for scalable and maintainable code design. * LoRA: Add printing and callbacks for learning rate during training	2024-02-20 13:07:21 -08:00
Awni Hannun	e4d5630698	Basic CircleCI (#449 ) * basic style checks for circleci * format * fix config	2024-02-16 22:13:55 -08:00
Madroid Ma	0ba466369f	LoRA: add training callbacks (#414 ) * LoRA: add training callbacks * LoRA: add trained tokens print & callback	2024-02-16 06:04:57 -08:00
Madroid Ma	726b1ddec0	fix: check LoRA layers number error (#446 )	2024-02-16 06:03:33 -08:00
Chime Ogbuji	e446598f62	Passing parameterized loss and batching to trainer (#391 )	2024-02-13 07:03:25 -08:00
Madroid Ma	954aa50c54	LoRA: Improve validation error for LoRA layer count exceeding model layer (#427 ) * LoRA: Improve validation error for LoRA layer count exceeding model layer This commit enhances the error handling when the specified LoRA layer count exceeds the total number of layers in the model. It clarifies the error message to provide actionable feedback for users, guiding them to adjust their input parameters accordingly. * format + nits --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-02-13 06:56:27 -08:00
Awni Hannun	d4666615bb	Lazy import + refactor Lora layer addition (#426 ) * lazy model import in mlx_lm * change lora loading * fix olmo lora * remove a bunch of unused stuff from plamo * move phixtral to mlx-lm and out of llms/	2024-02-12 10:51:02 -08:00
Ivan Fioravanti	4576946151	Add checkpoints directory for adapter weights (#431 ) * Add checkpoints directory for adapter weights The code was modified to create a checkpoints directory if it doesn't exist yet. Adapter weights are now saved to this checkpoints directory during the training iterations. Corrected indentation of Save adapter weights code because it was part of "if eval" * Fixing a blank added by mistake	2024-02-12 10:50:05 -08:00
Nripesh Niketan	f1ef378a58	Feat: update pre-commit rev (#432 )	2024-02-11 07:23:27 -08:00
Anchen	0a49ba0697	fix(mlx-lm): apply lora layer doesn't update the lora weights (#396 )	2024-01-31 11:51:26 -08:00
Anchen	614de6652f	chore(mlx-lm): add reset lora layers helper (#377 ) * chore(mlx-lm): add reset lora layers helper * chore: rename the func * chore: update docstring * Update llms/mlx_lm/tuner/utils.py Co-authored-by: Awni Hannun <awni.hannun@gmail.com> --------- Co-authored-by: Awni Hannun <awni.hannun@gmail.com>	2024-01-29 20:54:49 -08:00
Anchen	854ad8747a	feat(mlx-lm): add de-quant for fuse.py (#365 ) * feat(mlx-lm): add de-quant for fuse * chore: disable quant in to linear when de-quant enabled * chore: add better error handling for adapter file not found	2024-01-25 18:59:32 -08:00
Anchen	f51e98fcf1	chore(mlx-lm): truncate the input sentence to max seq len in lora iterate_batches (#373 ) * chore(mlx-lm): pass max seq len to evaluate in training loop * chore: make sure the batch seq not exceed max len * chore: update comment * chore: add warning before truncate input	2024-01-25 12:38:04 -08:00
Anchen	b1dec281b3	feat(mlx-lm): add lora hypeparameters in lora layer (#366 ) * feat(mlx-lm): add lora hypeparameters in lora layer * chore: address comments	2024-01-24 08:11:25 -08:00
Anchen	ab91ac1075	chore(mlx-lm): add load model with adapter and fix bug in sample (#360 ) * chore: add load model with adapter support and fix bug in sample * chore: ignore temp during calculating prob in sample	2024-01-23 19:47:39 -08:00
Anchen	362e88a744	feat: move lora into mlx-lm (#337 ) * feat: Add lora and qlora training to mlx-lm --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-23 08:44:37 -08:00

1 2

77 Commits