mlx-examples

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-08-09 02:16:37 +08:00

Author	SHA1	Message	Date
Awni Hannun	f24edfa9dc	[mlx-lm] Add precompiled normalizations (#451 ) * add precompiled normalizations * nits	2024-02-22 12:40:55 -08:00
Awni Hannun	ab9172baac	Gemma support (#474 ) * gemma support * format * lora support for gemma	2024-02-21 08:47:13 -08:00
Awni Hannun	8fd953ee2b	Support for slerp merging models (#455 ) * support for slerp merging models * docs * update docs * format'	2024-02-19 20:37:15 -08:00
devonthomas35	cc671cd1c7	Mixtral: Fix non-default arg follows default exception (#450 ) Mixtral models throw the following exception ``` Traceback (most recent call last): File "<frozen runpy>", line 198, in _run_module_as_main File "<frozen runpy>", line 88, in _run_code File "/opt/homebrew/anaconda3/lib/python3.11/site-packages/mlx_lm/generate.py", line 119, in <module> main(args) File "/opt/homebrew/anaconda3/lib/python3.11/site-packages/mlx_lm/generate.py", line 96, in main model, tokenizer = load(args.model, tokenizer_config=tokenizer_config) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/opt/homebrew/anaconda3/lib/python3.11/site-packages/mlx_lm/utils.py", line 278, in load model = load_model(model_path) ^^^^^^^^^^^^^^^^^^^^^^ File "/opt/homebrew/anaconda3/lib/python3.11/site-packages/mlx_lm/utils.py", line 221, in load_model model_class, model_args_class = _get_classes(config=config) ^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/opt/homebrew/anaconda3/lib/python3.11/site-packages/mlx_lm/utils.py", line 46, in _get_classes arch = importlib.import_module(f"mlx_lm.models.{model_type}") ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/opt/homebrew/anaconda3/lib/python3.11/importlib/__init__.py", line 126, in import_module return _bootstrap._gcd_import(name[level:], package, level) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "<frozen importlib._bootstrap>", line 1204, in _gcd_import File "<frozen importlib._bootstrap>", line 1176, in _find_and_load File "<frozen importlib._bootstrap>", line 1147, in _find_and_load_unlocked File "<frozen importlib._bootstrap>", line 690, in _load_unlocked File "<frozen importlib._bootstrap_external>", line 940, in exec_module File "<frozen importlib._bootstrap>", line 241, in _call_with_frames_removed File "/opt/homebrew/anaconda3/lib/python3.11/site-packages/mlx_lm/models/mixtral.py", line 11, in <module> @dataclass ^^^^^^^^^ File "/opt/homebrew/anaconda3/lib/python3.11/dataclasses.py", line 1230, in dataclass return wrap(cls) ^^^^^^^^^ File "/opt/homebrew/anaconda3/lib/python3.11/dataclasses.py", line 1220, in wrap return _process_class(cls, init, repr, eq, order, unsafe_hash, ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/opt/homebrew/anaconda3/lib/python3.11/dataclasses.py", line 1027, in _process_class _init_fn(all_init_fields, File "/opt/homebrew/anaconda3/lib/python3.11/dataclasses.py", line 545, in _init_fn raise TypeError(f'non-default argument {f.name!r} ' TypeError: non-default argument 'model_type' follows default argument ```	2024-02-18 13:30:26 -08:00
Angelos Katharopoulos	f71e965d57	Change gqa to use repeat instead of concatenate (#443 )	2024-02-14 17:40:11 -08:00
Awni Hannun	06ddb8414d	Fix Qwen2 and SD (#441 ) * fix qwen2 * version bump * fix list shape	2024-02-14 13:43:12 -08:00
Awni Hannun	d4666615bb	Lazy import + refactor Lora layer addition (#426 ) * lazy model import in mlx_lm * change lora loading * fix olmo lora * remove a bunch of unused stuff from plamo * move phixtral to mlx-lm and out of llms/	2024-02-12 10:51:02 -08:00
Anchen	da7adae5ec	fix(mlx-m): lazy load hf_olmo (#424 )	2024-02-08 09:02:43 -08:00
Markus Enzweiler	9b387007ab	Example of a Convolutional Variational Autoencoder (CVAE) on MNIST (#264 ) * initial commit * style fixes * update of ACKNOWLEDGMENTS * fixed comment * minor refactoring; removed unused imports * added cifar and cvae to top-level README.md * removed mention of cuda/mps in argparse * fixed training status output * load_weights() with strict=True * pretrained model update * fixed imports and style * requires mlx>=0.0.9 * updated with results using mlx 0.0.9 * removed mention of private repo * simplify and combine to one file, more consistency with other exmaples * few more nits * nits * spell * format --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-02-06 20:02:27 -08:00
Chris McMaster	2303238e44	Update olmo.py (#419 ) exit should be imported outside of interactive mode	2024-02-06 16:16:46 -08:00
Anchen	a7d139f484	fix(mlx-lm): olmo 1b model (#417 )	2024-02-06 05:27:05 -08:00
Awni Hannun	aa7447efa2	Olmo in MLX LM (#415 ) * run olmo * format	2024-02-05 21:13:49 -08:00
Junyang Lin	9d0dd34403	add qwen2 (#411 )	2024-02-04 08:31:38 -08:00
Ashish	0b57f0eae6	Add StableLM-2 1.6B (#378 ) * init * stablelm * add to readme * bump version --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-26 10:28:00 -08:00
Anchen	362e88a744	feat: move lora into mlx-lm (#337 ) * feat: Add lora and qlora training to mlx-lm --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-23 08:44:37 -08:00
Shunta Saito	85c1ff8fd6	Add PLaMo-13B model as an LLM example (#303 ) * Convert HF weights of PLaMo and load it to a plamo model in mlx * Fix model inference part * Add bos at the beginning of the prompt * Fix convert.py to copy tokenizer.model into the converted dir * Use the required insturction format in generate.py when "--instruct" option is specified * Change filenames and update existing scripts * Add README * Add requirements.txt * Fix plamo.py to stop generation when EOS appears * Add quantization to convert.py * Use mlx>=0.0.9 for mx.core.outer() in PLaMo model * Update acknowledgements.md * Fix card text in upload_to_hub() * Not use prompt template when --instruct is not specified * Ask if you trust_remote_code for loading tokenizer of PLaMo * Check the user trusts the remote code when converting * Remove plamo directory * Update README * Add PLaMo model file * Fix the handling of cache in PLaMo and update README * Ask if trust_remote_code only when the model is PLaMo * Remove resolve_trust_remote_code from convert.py and use the latest transformers * Remove code not to add EOS * Update README to fix an example not to use noncommercial version of the model * Remove unused imports * Remove unnecessary description about the instruct model of PLaMo from README * format, nits in README * typo --------- Co-authored-by: Shunta Saito <shunta@mitmul-mbp.local> Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-23 07:17:24 -08:00
Anchen	30be4c4734	refactor(qwen): moving qwen into mlx-lm (#312 ) * refactor(qwen): moving qwen into mlx-lm * chore: update doc * chore: fix type hint * add qwen model support in convert * chore: fix doc * chore: only load model in quantize_model * chore: make the convert script only copy tokenizer files instead of load it and save * chore: update docstring * chore: remove unnecessary try catch * chore: clean up for tokenizer and update transformers 4.37 * nits in README --------- Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-22 15:00:07 -08:00
Awni Hannun	bcc9fc3581	two minor fixes (#335 )	2024-01-18 14:18:13 -08:00
Anchen	195bec2fa3	feat(mlx_lm): add mixtral support in mlx_lm (#318 ) * feat: add mixtral support in mlx_lm * chore: update doc	2024-01-15 07:18:14 -08:00
Anchen	a39b735c3b	chore(mlx-lm): update phi2 model args to sync with hf config format. (#311 ) * chore(mlx-lm): update phi2 model args to sync with hf config format * chore: fix type hint	2024-01-13 07:51:45 -08:00
Angelos Katharopoulos	1fa40067fe	Change tuple type definitions to use Tuple (#308 )	2024-01-12 11:15:09 -08:00
Awni Hannun	c6440416a2	Mlx llm package (#301 ) * fix converter * add recursive files * remove gitignore * remove gitignore * add packages properly * read me update * remove dup readme * relative * fix convert * fix community name * fix url * version	2024-01-12 10:25:56 -08:00

22 Commits