mlx-examples

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-11-08 16:10:37 +08:00

Author	SHA1	Message	Date
Awni Hannun	aa7447efa2	Olmo in MLX LM (#415 ) * run olmo * format	2024-02-05 21:13:49 -08:00
Ivan Fioravanti	7fbca214b1	Add max sequence length argument in lora.py (#408 ) A new argument "--max_seq_length" has been added to the command-line parser and passed as a parameter to the main function of the lora.py script. This allows users to specify and control the maximum sequence length during training.	2024-02-04 12:28:21 -08:00
Junyang Lin	9d0dd34403	add qwen2 (#411 )	2024-02-04 08:31:38 -08:00
Angelos Katharopoulos	e9b32747b4	Add grad checkpointing and PE in the transformer example (#387 ) * Add grad checkpointing and PE in the transformer example * Remove other frameworks from LM example * Remove the other frameworks from MNIST example * Improve the transformer LM example * Fix black and change LR	2024-02-01 13:04:03 -08:00
Awni Hannun	ec14583c2a	work with tuple shape (#393 )	2024-02-01 13:03:47 -08:00
ZHAOKAI WANG	0340113e02	BUG FIX: Decoding results in garbled text when multiple tokens represent a single character (e.g., Chinese). (#398 ) * Decoding results in garbled text when multiple tokens represent a single character (e.g., Chinese). * Decoding results in garbled text when multiple tokens represent a single character (e.g., Chinese).	2024-01-31 19:27:29 -08:00
Gabrijel Boduljak	94358219cf	CLIP (ViT) (#315 ) * probably approximatelly correct CLIPTextEncoder * implemented CLIPEncoderLayer as built-in nn.TransformerEncoderLayer * replaced embedding layer with simple matrix * implemented ViT * added ViT tests * fixed tests * added pooler_output for text * implemented complete CLIPModel * implemented init * implemented convert.py and from_pretrained * fixed some minor bugs and added the README.md * removed tokenizer unused comments * removed unused deps * updated ACKNOWLEDGEMENTS.md * Feat: Image Processor for CLIP (#1) @nkasmanoff: * clip image processor * added example usage * refactored image preprocessing * deleted unused image_config.py * removed preprocessing port * added dependency to mlx-data * fixed attribution and moved photos to assets * implemented a simple port of CLIPImageProcessor * review changes * PR review changes * renamed too verbose arg * updated README.md * nits in readme / conversion * simplify some stuff, remove unneeded inits * remove more init stuff * more simplify * make test a unit test * update main readme * readme nits --------- Co-authored-by: Noah Kasmanoff <nkasmanoff@gmail.com> Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-31 14:19:53 -08:00
Madroid Ma	ba3a9355d1	LoRA: Remove unnecessary model type judgments (#388 ) * LoRA: Remove unnecessary model type judgments 1. Supported models are already checked in the load_model function in utils, no need to repeat the check in lora 2. The checks in lora are not synchronized with those in utils * LoRA: add LoRA supported models in mlx_lm utils	2024-01-31 11:55:27 -08:00
Anchen	0a49ba0697	fix(mlx-lm): apply lora layer doesn't update the lora weights (#396 )	2024-01-31 11:51:26 -08:00
Sugato Ray	ab8bde1590	Add `py.typed` to support PEP-561 (type-hinting) (#389 ) This adds support for type-hinting information as laid in [PEP-561](https://peps.python.org/pep-0561/).	2024-01-30 21:17:38 -08:00
David Koski	f8fadf7a17	Fix token count computation to fix tps measurements (#392 )	2024-01-30 11:24:16 -08:00
Anchen	614de6652f	chore(mlx-lm): add reset lora layers helper (#377 ) * chore(mlx-lm): add reset lora layers helper * chore: rename the func * chore: update docstring * Update llms/mlx_lm/tuner/utils.py Co-authored-by: Awni Hannun <awni.hannun@gmail.com> --------- Co-authored-by: Awni Hannun <awni.hannun@gmail.com>	2024-01-29 20:54:49 -08:00
Ashish	20b969b412	Replace time.time() with time.perf_counter() as it is more suited for benchmarking (#380 )	2024-01-26 14:11:38 -08:00
Awni Hannun	5aa652d3c2	remove simplify (#379 )	2024-01-26 13:54:49 -08:00
Ashish	0b57f0eae6	Add StableLM-2 1.6B (#378 ) * init * stablelm * add to readme * bump version --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-26 10:28:00 -08:00
Anchen	854ad8747a	feat(mlx-lm): add de-quant for fuse.py (#365 ) * feat(mlx-lm): add de-quant for fuse * chore: disable quant in to linear when de-quant enabled * chore: add better error handling for adapter file not found	2024-01-25 18:59:32 -08:00
Anchen	f51e98fcf1	chore(mlx-lm): truncate the input sentence to max seq len in lora iterate_batches (#373 ) * chore(mlx-lm): pass max seq len to evaluate in training loop * chore: make sure the batch seq not exceed max len * chore: update comment * chore: add warning before truncate input	2024-01-25 12:38:04 -08:00
Yiğit Ö. Ünver	0f19237fb8	docs: added missing imports (#375 ) * add: missing import * add: missing import	2024-01-25 10:44:53 -08:00
Anchen	b1dec281b3	feat(mlx-lm): add lora hypeparameters in lora layer (#366 ) * feat(mlx-lm): add lora hypeparameters in lora layer * chore: address comments	2024-01-24 08:11:25 -08:00
Anchen	5fc8668a53	fix(mlx-lm): handle legacy quant models (#369 )	2024-01-24 07:44:05 -08:00
Anchen	ab91ac1075	chore(mlx-lm): add load model with adapter and fix bug in sample (#360 ) * chore: add load model with adapter support and fix bug in sample * chore: ignore temp during calculating prob in sample	2024-01-23 19:47:39 -08:00
Juarez Bochi	f5b80c95fb	Example reading directly from gguf file (#222 ) * Draft of tiny llama from gguf * Transpose all * No transposition with new layout * Read config from gguf * Create tokenizer from gguf * move gguf and update to be similar to hf_llm * change model to HF style + updates to REAMDE * nits in REAMDE * nit readme * only use mlx for metadata * fix eos/bos tokenizer * fix tokenization * quantization runs * 8-bit works * tokenizer fix * bump mlx version --------- Co-authored-by: Juarez Bochi <juarez.bochi@grammarly.com> Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-23 15:41:54 -08:00
iLoveBug	40b61c1719	fix the chinese character generation as same as PR #321 (#342 ) * fix the chinese character generation as same as PR #321 * reuse the generate logic to utils.py * format * verbose defualt * fix conflicst with colorize and character check --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-23 12:44:23 -08:00
Awni Hannun	21aa8038fb	MLX LM version bump (#358 ) * version bump * include new package	2024-01-23 09:05:57 -08:00
Anchen	362e88a744	feat: move lora into mlx-lm (#337 ) * feat: Add lora and qlora training to mlx-lm --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-23 08:44:37 -08:00
Shunta Saito	85c1ff8fd6	Add PLaMo-13B model as an LLM example (#303 ) * Convert HF weights of PLaMo and load it to a plamo model in mlx * Fix model inference part * Add bos at the beginning of the prompt * Fix convert.py to copy tokenizer.model into the converted dir * Use the required insturction format in generate.py when "--instruct" option is specified * Change filenames and update existing scripts * Add README * Add requirements.txt * Fix plamo.py to stop generation when EOS appears * Add quantization to convert.py * Use mlx>=0.0.9 for mx.core.outer() in PLaMo model * Update acknowledgements.md * Fix card text in upload_to_hub() * Not use prompt template when --instruct is not specified * Ask if you trust_remote_code for loading tokenizer of PLaMo * Check the user trusts the remote code when converting * Remove plamo directory * Update README * Add PLaMo model file * Fix the handling of cache in PLaMo and update README * Ask if trust_remote_code only when the model is PLaMo * Remove resolve_trust_remote_code from convert.py and use the latest transformers * Remove code not to add EOS * Update README to fix an example not to use noncommercial version of the model * Remove unused imports * Remove unnecessary description about the instruct model of PLaMo from README * format, nits in README * typo --------- Co-authored-by: Shunta Saito <shunta@mitmul-mbp.local> Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-23 07:17:24 -08:00
Ivan Fioravanti	c45c2311bd	Add colorized output option to generate script (#347 ) * Add colorized output option to generate script Two new functions were added to the script that allow output to be colorized based on the T[0] probability. Changes were made to the `generate_step` function in utils.py to permit colorization. Additionally, an argument for colorization was introduced to the command-line parser. * Rename 'colorize' parameter with 'return_probability' in generate_step	2024-01-23 05:25:44 -08:00
Sugato Ray	a445ac2895	Update docs with `conda` install option (#354 )	2024-01-22 21:14:48 -08:00
Baptiste Canton	42672f5446	add an option to apply the tokenizer chat template (#338 ) * add an option to apply the tokenizer chat template * fix the option to apply the tokenizer chat template * better error messages for chat template issues * apply the chat template by default when possible * nit in comment' * rebase --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-22 19:52:42 -08:00
Anchen	8022083979	feat(lora): add de-quantized support for fuse.py (#351 ) * feat(lora): add de-quantized support for fuse.py * address comments	2024-01-22 17:32:24 -08:00
Anchen	30be4c4734	refactor(qwen): moving qwen into mlx-lm (#312 ) * refactor(qwen): moving qwen into mlx-lm * chore: update doc * chore: fix type hint * add qwen model support in convert * chore: fix doc * chore: only load model in quantize_model * chore: make the convert script only copy tokenizer files instead of load it and save * chore: update docstring * chore: remove unnecessary try catch * chore: clean up for tokenizer and update transformers 4.37 * nits in README --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-22 15:00:07 -08:00
Nripesh Niketan	de15532da8	Feat: Bump isort version (#350 )	2024-01-21 06:35:15 -08:00
Anchen	1415595409	chore(lora): support mixtral in lora example (#343 )	2024-01-20 06:07:45 -08:00
Anchen	527cea4027	chore: fix the convert.py script for weights are not sanitized and support quant for non-32 dimensions (#340 ) * chore: fix convert script for weights not sanitized and suport quant for non 32 dim * Update llms/mlx_lm/utils.py Co-authored-by: Awni Hannun <awni.hannun@gmail.com> * chore: fix typo --------- Co-authored-by: Awni Hannun <awni.hannun@gmail.com>	2024-01-19 21:07:21 -08:00
bojanbabic	61297f547b	Missing requirements needed for convert script (#320 ) * fix requirements and add eos parameter * fix black * address comment * address comments - remove new arg	2024-01-18 19:04:24 -08:00
Awni Hannun	bcc9fc3581	two minor fixes (#335 )	2024-01-18 14:18:13 -08:00
Zheng Qu	d8680a89f9	Add argument `--save-every N` to lora.py for saving model regularly (#310 )	2024-01-16 20:03:33 -08:00
LeonEricsson	b4c20cc7f7	Stable Diffusion: Input image downsampling (#276 )	2024-01-16 13:45:00 -08:00
AtomicVar	2ba5d3db14	Refactor activation function and loss calculation (#325 )	2024-01-16 13:42:56 -08:00
AtomicVar	ce7b65e8c4	Fix import order of normalizing_flow (#326 )	2024-01-16 08:45:55 -08:00
someone	2287294723	fix mlx_lm generator for chinese (#321 ) * fix generator for chinese * add REPLACEMENT_CHAR --------- Co-authored-by: cg <cg@qq.com>	2024-01-16 07:13:33 -08:00
Awni Hannun	b0870ed679	fix response + bump version (#319 )	2024-01-15 11:51:21 -08:00
Anchen	195bec2fa3	feat(mlx_lm): add mixtral support in mlx_lm (#318 ) * feat: add mixtral support in mlx_lm * chore: update doc	2024-01-15 07:18:14 -08:00
Siddharth Mishra-Sharma	19b6167d81	Normalizing flow example (#133 ) * Implement normalizing flow Real NVP example * Add requirements and basic usage to normalizing flow example * Minor changes to README in normalizing flow example * Remove trailing commas in function arguments for unified formatting in flows example * Fix minor typos, add some annotations * format + nits in README * readme fix * mov, minor changes in main, copywright * remove debug * fix * Simplified class structure in distributions; better code re-use in bijectors * Remove rogue space * change name again * nits --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-13 16:58:48 -08:00
Marcel Bischoff	cd3cff0858	Phixtral (#290 ) * initial * file * remove debug * Adding README * typo * simplify readme * nits in readmes --------- Co-authored-by: Marcel Bischoff <marcel.bischoff@awarehq.com> Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-13 08:35:03 -08:00
Anchen	a39b735c3b	chore(mlx-lm): update phi2 model args to sync with hf config format. (#311 ) * chore(mlx-lm): update phi2 model args to sync with hf config format * chore: fix type hint	2024-01-13 07:51:45 -08:00
Yousif	7575125d5d	Added lora support for Phi-2 (#302 ) * Added lora support for Phi-2 * Added Phi-2 support in fuse and convert * format + readme --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-12 13:45:30 -08:00
Alexandre Boucaud	3ac731dd4f	Fix TypeError in whisper benchmark script (#306 ) * Add missing keyword to the decoding options * Reverting last commit * Fixing transcribe keyword in benckmark.py * Add argument name to load_model This is intended to avoid confusion	2024-01-12 13:07:15 -08:00
Pedro Cuenca	ef93979973	Update model card uploaded with converted models (#309 )	2024-01-12 13:03:52 -08:00
Angelos Katharopoulos	1fa40067fe	Change tuple type definitions to use Tuple (#308 )	2024-01-12 11:15:09 -08:00

1 2 3 4 5 ...

286 Commits