mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-08-29 18:26:37 +08:00

Examples in the MLX framework

mlx

Go to file

Anupam Mediratta 5c89d1f6a6 Add instruct tuning support to LoRA training Fixes #484 Add support for instruct tuning with input/output pairs and alternative loss functions. * llms/mlx_lm/lora.py - Add `CompletionsDataset` class to support input/output pairs. - Modify `Dataset` class to handle different dataset types. - Update `main` function to include new dataset type. * llms/mlx_lm/tuner/trainer.py - Modify `default_loss` function to support alternative loss functions. - Add new `instruct_loss` function for instruct tuning. * llms/mlx_lm/LORA.md - Add instructions for instruct tuning with input/output pairs. - Update documentation to include alternative loss functions. * llms/tests/test_datasets.py - Add tests for `CompletionsDataset` and `create_dataset` functions. * llms/tests/test_trainer.py - Add tests for `default_loss` and `instruct_loss` functions. --- For more details, open the [Copilot Workspace session](https://copilot-workspace.githubnext.com/ml-explore/mlx-examples/issues/484?shareId=XXXX-XXXX-XXXX-XXXX).		2025-01-20 11:42:13 +05:30
.circleci	Add support for fewshot and apply chat template lm_eval functionality (#1180 )	2025-01-06 07:58:43 -08:00
bert	- Removed unused Python imports (#683 )	2024-04-16 07:50:32 -07:00
cifar	- Removed unused Python imports (#683 )	2024-04-16 07:50:32 -07:00
clip	feat(clip): add linear probe evaluation script (#960 )	2024-10-24 21:56:17 -07:00
cvae	Update a few examples to use compile (#420 )	2024-02-08 13:00:41 -08:00
encodec	MusicGen (#1020 )	2024-10-11 10:16:20 -07:00
flux	Change Flux default max_shift to 1.15 to match the official one (#1137 )	2024-12-08 23:29:48 -08:00
gcn	- Removed unused Python imports (#683 )	2024-04-16 07:50:32 -07:00
llava	fix llava (#1149 )	2024-12-12 10:37:26 -08:00
llms	Add instruct tuning support to LoRA training	2025-01-20 11:42:13 +05:30
lora	Validation with full data set, results in NaN validation score (#879 )	2024-07-10 08:36:11 -07:00
mnist	Use stable url for MNIST (#749 )	2024-05-03 17:13:05 -07:00
musicgen	Update README.md (#1045 )	2024-10-14 06:21:25 -07:00
normalizing_flow	Update a few examples to use compile (#420 )	2024-02-08 13:00:41 -08:00
segment_anything	Segment Anything Model (#552 )	2024-06-02 16:45:51 -07:00
speechcommands	Fix data_iter in prepare_dataset from speechcommands example (#1113 )	2024-12-02 23:56:07 -08:00
stable_diffusion	Fix format (#1115 )	2024-11-20 16:15:53 -08:00
t5	MusicGen (#1020 )	2024-10-11 10:16:20 -07:00
transformer_lm	transformer_lm: add --dataset enwik8 (#838 )	2024-06-26 11:59:01 -07:00
whisper	Allow converting models from local directories (#1118 )	2024-11-24 16:41:06 -08:00
.gitignore	More cache improvements (#1015 )	2024-10-07 20:45:51 -07:00
.pre-commit-config.yaml	chore: update black pre-commit hooks to latest versions (#955 )	2024-08-26 07:54:23 -07:00
ACKNOWLEDGMENTS.md	Adding full finetuning (#903 )	2024-09-29 17:12:47 -07:00
CODE_OF_CONDUCT.md	contribution + code of conduct	2023-11-29 12:31:18 -08:00
CONTRIBUTING.md	feat: add update_config functionality (#531 )	2024-03-14 06:36:05 -07:00
LICENSE	consistent copyright	2023-11-30 11:11:04 -08:00
README.md	FLUX: update README.md (#1036 )	2024-10-14 11:21:41 -07:00

README.md

MLX Examples

This repo contains a variety of standalone examples using the MLX framework.

The MNIST example is a good starting point to learn how to use MLX.

Some more useful examples are listed below.

Text Models

MLX LM a package for LLM text generation, fine-tuning, and more.
Transformer language model training.
Minimal examples of large scale text generation with LLaMA, Mistral, and more in the LLMs directory.
A mixture-of-experts (MoE) language model with Mixtral 8x7B.
Parameter efficient fine-tuning with LoRA or QLoRA.
Text-to-text multi-task Transformers with T5.
Bidirectional language understanding with BERT.

Image Models

Generating images
- FLUX
- Stable Diffusion or SDXL
Image classification using ResNets on CIFAR-10.
Convolutional variational autoencoder (CVAE) on MNIST.

Audio Models

Speech recognition with OpenAI's Whisper.
Audio compression and generation with Meta's EnCodec.

Multimodal models

Joint text and image embeddings with CLIP.
Text generation from image and text inputs with LLaVA.
Image segmentation with Segment Anything (SAM).

Other Models

Semi-supervised learning on graph-structured data with GCN.
Real NVP normalizing flow for density estimation and sampling.

Hugging Face

Note: You can now directly download a few converted checkpoints from the MLX Community organization on Hugging Face. We encourage you to join the community and contribute new models.

Contributing

We are grateful for all of our contributors. If you contribute to MLX Examples and wish to be acknowledged, please add your name to the list in your pull request.

Citing MLX Examples

The MLX software suite was initially developed with equal contribution by Awni Hannun, Jagrit Digani, Angelos Katharopoulos, and Ronan Collobert. If you find MLX Examples useful in your research and wish to cite it, please use the following BibTex entry:

@software{mlx2023,
  author = {Awni Hannun and Jagrit Digani and Angelos Katharopoulos and Ronan Collobert},
  title = {{MLX}: Efficient and flexible machine learning on Apple silicon},
  url = {https://github.com/ml-explore},
  version = {0.0},
  year = {2023},
}