mlx-examples/llms/mlx_lm/convert.py

# Copyright © 2023-2024 Apple Inc.

import argparse

from .utils import convert


def configure_parser() -> argparse.ArgumentParser:
    """
    Configures and returns the argument parser for the script.

    Returns:
        argparse.ArgumentParser: Configured argument parser.
    """
    parser = argparse.ArgumentParser(
        description="Convert Hugging Face model to MLX format"
    )

    parser.add_argument("--hf-path", type=str, help="Path to the Hugging Face model.")
    parser.add_argument(
        "--mlx-path", type=str, default="mlx_model", help="Path to save the MLX model."
    )
    parser.add_argument(
        "-q", "--quantize", help="Generate a quantized model.", action="store_true"
    )
    parser.add_argument(
        "--q-group-size", help="Group size for quantization.", type=int, default=64
    )
    parser.add_argument(
        "--q-bits", help="Bits per weight for quantization.", type=int, default=4
    )
    parser.add_argument(
        "--dtype",
        help="Type to save the non-quantized parameters.",
        type=str,
        choices=["float16", "bfloat16", "float32"],
        default="float16",
    )
    parser.add_argument(
        "--upload-repo",
        help="The Hugging Face repo to upload the model to.",
        type=str,
        default=None,
    )
    parser.add_argument(
        "-d",
        "--dequantize",
        help="Dequantize a quantized model.",
        action="store_true",
        default=False,
    )
    return parser


def main():
    parser = configure_parser()
    args = parser.parse_args()
    convert(**vars(args))


if __name__ == "__main__":
    main()
Fix import warning (#479) * fix import warning * fix version import * remove api, move convert to utils * also update circle to run external PRs 2024-02-28 00:47:56 +08:00			`# Copyright © 2023-2024 Apple Inc.`
Support Hugging Face models (#215) * support hf direct models 2024-01-04 07:13:26 +08:00
Fix import warning (#479) * fix import warning * fix version import * remove api, move convert to utils * also update circle to run external PRs 2024-02-28 00:47:56 +08:00			`import argparse`
Mlx llm package (#301) * fix converter * add recursive files * remove gitignore * remove gitignore * add packages properly * read me update * remove dup readme * relative * fix convert * fix community name * fix url * version 2024-01-13 02:25:56 +08:00
Fix import warning (#479) * fix import warning * fix version import * remove api, move convert to utils * also update circle to run external PRs 2024-02-28 00:47:56 +08:00			`from .utils import convert`
refactor(hf_llm): moving phi2 example into hf_llm (#293) * refactor: moving phi2 example into hf_llm * chore: clean up * chore: update phi2 model args so it can load args from config * fix phi2 + nits + readme * allow any HF repo, update README * fix bug in llama --------- Co-authored-by: Awni Hannun <awni@apple.com> 2024-01-12 04:29:12 +08:00

			`def configure_parser() -> argparse.ArgumentParser:`
			`"""`
			`Configures and returns the argument parser for the script.`

			`Returns:`
			`argparse.ArgumentParser: Configured argument parser.`
			`"""`
			`parser = argparse.ArgumentParser(`
			`description="Convert Hugging Face model to MLX format"`
			`)`

			`parser.add_argument("--hf-path", type=str, help="Path to the Hugging Face model.")`
			`parser.add_argument(`
			`"--mlx-path", type=str, default="mlx_model", help="Path to save the MLX model."`
			`)`
			`parser.add_argument(`
			`"-q", "--quantize", help="Generate a quantized model.", action="store_true"`
			`)`
			`parser.add_argument(`
			`"--q-group-size", help="Group size for quantization.", type=int, default=64`
			`)`
			`parser.add_argument(`
			`"--q-bits", help="Bits per weight for quantization.", type=int, default=4`
			`)`
			`parser.add_argument(`
			`"--dtype",`
override dtype with quant (#1062) 2024-10-23 00:56:45 +08:00			`help="Type to save the non-quantized parameters.",`
refactor(hf_llm): moving phi2 example into hf_llm (#293) * refactor: moving phi2 example into hf_llm * chore: clean up * chore: update phi2 model args so it can load args from config * fix phi2 + nits + readme * allow any HF repo, update README * fix bug in llama --------- Co-authored-by: Awni Hannun <awni@apple.com> 2024-01-12 04:29:12 +08:00			`type=str,`
			`choices=["float16", "bfloat16", "float32"],`
			`default="float16",`
			`)`
			`parser.add_argument(`
			`"--upload-repo",`
			`help="The Hugging Face repo to upload the model to.",`
			`type=str,`
			`default=None,`
			`)`
add dequantize option to mlx_lm/convert.py (#547) 2024-03-20 10:50:08 +08:00			`parser.add_argument(`
			`"-d",`
			`"--dequantize",`
			`help="Dequantize a quantized model.",`
			`action="store_true",`
			`default=False,`
			`)`
refactor(hf_llm): moving phi2 example into hf_llm (#293) * refactor: moving phi2 example into hf_llm * chore: clean up * chore: update phi2 model args so it can load args from config * fix phi2 + nits + readme * allow any HF repo, update README * fix bug in llama --------- Co-authored-by: Awni Hannun <awni@apple.com> 2024-01-12 04:29:12 +08:00			`return parser`
Support Hugging Face models (#215) * support hf direct models 2024-01-04 07:13:26 +08:00

Create executables for generate, lora, server, merge, convert (#682) * feat: create executables mlx_lm.<cmd> * nits in docs --------- Co-authored-by: Awni Hannun <awni@apple.com> 2024-04-17 07:08:49 +08:00			`def main():`
Mlx llm package (#301) * fix converter * add recursive files * remove gitignore * remove gitignore * add packages properly * read me update * remove dup readme * relative * fix convert * fix community name * fix url * version 2024-01-13 02:25:56 +08:00			`parser = configure_parser()`
			`args = parser.parse_args()`
			`convert(**vars(args))`
Create executables for generate, lora, server, merge, convert (#682) * feat: create executables mlx_lm.<cmd> * nits in docs --------- Co-authored-by: Awni Hannun <awni@apple.com> 2024-04-17 07:08:49 +08:00

			`if __name__ == "__main__":`
			`main()`