mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-12-16 02:08:55 +08:00

Go to file

Shunta Saito 85c1ff8fd6 Add PLaMo-13B model as an LLM example (#303 )

* Convert HF weights of PLaMo and load it to a plamo model in mlx

* Fix model inference part

* Add bos at the beginning of the prompt

* Fix convert.py to copy tokenizer.model into the converted dir

* Use the required insturction format in generate.py when "--instruct" option is specified

* Change filenames and update existing scripts

* Add README

* Add requirements.txt

* Fix plamo.py to stop generation when EOS appears

* Add quantization to convert.py

* Use mlx>=0.0.9 for mx.core.outer() in PLaMo model

* Update acknowledgements.md

* Fix card text in upload_to_hub()

* Not use prompt template when --instruct is not specified

* Ask if you trust_remote_code for loading tokenizer of PLaMo

* Check the user trusts the remote code when converting

* Remove plamo directory

* Update README

* Add PLaMo model file

* Fix the handling of cache in PLaMo and update README

* Ask if trust_remote_code only when the model is PLaMo

* Remove resolve_trust_remote_code from convert.py and use the latest transformers

* Remove code not to add EOS

* Update README to fix an example not to use noncommercial version of the model

* Remove unused imports

* Remove unnecessary description about the instruct model of PLaMo from README

* format, nits in README

* typo

---------

Co-authored-by: Shunta Saito <shunta@mitmul-mbp.local>
Co-authored-by: Awni Hannun <awni@apple.com>

2024-01-23 07:17:24 -08:00

bert

Change tuple type definitions to use Tuple (#308 )

2024-01-12 11:15:09 -08:00

cifar

Updated CIFAR-10 ResNet example to use BatchNorm instead of LayerNorm (#257 )

2024-01-12 05:43:11 -08:00

gcn

Support Hugging Face models (#215 )

2024-01-03 15:13:26 -08:00

llms

Add PLaMo-13B model as an LLM example (#303 )

2024-01-23 07:17:24 -08:00

lora

feat(lora): add de-quantized support for fuse.py (#351 )

2024-01-22 17:32:24 -08:00

mnist

Refactor activation function and loss calculation (#325 )

2024-01-16 13:42:56 -08:00

normalizing_flow

Fix import order of normalizing_flow (#326 )

2024-01-16 08:45:55 -08:00

speechcommands

Use pip for mlx data with speech commands (#307 )

2024-01-12 11:06:33 -08:00

stable_diffusion

two minor fixes (#335 )

2024-01-18 14:18:13 -08:00

add speculative decoding example for llama (#149 )

2023-12-28 15:20:43 -08:00

transformer_lm

Add llms subdir + update README (#145 )

2023-12-20 10:22:25 -08:00

whisper

Added lora support for Phi-2 (#302 )

2024-01-12 13:45:30 -08:00

.gitignore

Align CLI args and some smaller fixes (#167 )

2023-12-22 14:34:32 -08:00

.pre-commit-config.yaml

Feat: Bump isort version (#350 )

2024-01-21 06:35:15 -08:00

ACKNOWLEDGMENTS.md

Add PLaMo-13B model as an LLM example (#303 )

2024-01-23 07:17:24 -08:00

CODE_OF_CONDUCT.md

contribution + code of conduct

2023-11-29 12:31:18 -08:00

CONTRIBUTING.md

Update CONTRIBUTING.md

2023-12-09 08:02:34 +09:00

LICENSE

consistent copyright

2023-11-30 11:11:04 -08:00

README.md

Update README.md (#251 )

2024-01-07 11:35:39 -08:00

README.md

MLX Examples

This repo contains a variety of standalone examples using the MLX framework.

The MNIST example is a good starting point to learn how to use MLX.

Some more useful examples are listed below.

Text Models

Transformer language model training.
Large scale text generation with LLaMA, Mistral, Phi-2, and more in the LLMs directory.
A mixture-of-experts (MoE) language model with Mixtral 8x7B.
Parameter efficient fine-tuning with LoRA or QLoRA.
Text-to-text multi-task Transformers with T5.
Bidirectional language understanding with BERT.

Image Models

Generating images with Stable Diffusion.

Audio Models

Speech recognition with OpenAI's Whisper.

Other Models

Semi-supervised learning on graph-structured data with GCN.

Hugging Face

Note: You can now directly download a few converted checkpoints from the MLX Community organization on Hugging Face. We encourage you to join the community and contribute new models.

Contributing

We are grateful for all of our contributors. If you contribute to MLX Examples and wish to be acknowledged, please add your name to the list in your pull request.

Citing MLX Examples

The MLX software suite was initially developed with equal contribution by Awni Hannun, Jagrit Digani, Angelos Katharopoulos, and Ronan Collobert. If you find MLX Examples useful in your research and wish to cite it, please use the following BibTex entry:

@software{mlx2023,
  author = {Awni Hannun and Jagrit Digani and Angelos Katharopoulos and Ronan Collobert},
  title = {{MLX}: Efficient and flexible machine learning on Apple silicon},
  url = {https://github.com/ml-explore},
  version = {0.0},
  year = {2023},
}