Commit Graph

  • c5cb83b9a0 Update lora readme.md, update mistral-7b-v0.1 to mistral-7b-v0.2-instruct madroid 2023-12-25 22:11:46 +0800
  • 447abb2b0f QWEN: Fix unsupported ScalarType BFloat16 (#187) Yifan 2023-12-25 22:10:01 +0800
  • 738448c2d4
    QWEN: Fix unsupported ScalarType BFloat16 (#187) Yifan 2023-12-25 22:10:01 +0800
  • b9607f9510
    QWEN: Fix unsupported ScalarType BFloat16 Yifan 2023-12-25 20:18:26 +0800
  • 35b219b7ed
    Merge branch 'ml-explore:main' into main LeonEricsson 2023-12-25 11:27:08 +0100
  • c190af6dc6 updated README (#184) Vidyasagar Bhargava 2023-12-24 19:49:53 +0530
  • 647e48870a
    updated README (#184) Vidyasagar Bhargava 2023-12-24 19:49:53 +0530
  • a43afd3234 updated README vidyasagarbhargava 2023-12-24 13:51:27 +0530
  • efa9837195 Mixtral: Stop at EOS token (#183) devonthomas35 2023-12-23 21:25:42 -0800
  • 939086e6a3
    Mixtral: Stop at EOS token (#183) devonthomas35 2023-12-23 21:25:42 -0800
  • 2794f04873 Fix precommit hooks Devon Thomas 2023-12-23 20:00:01 -0800
  • 5458722776 Fix precommit hooks Devon Thomas 2023-12-23 19:58:07 -0800
  • c79b004d27 Precommit format files Devon Thomas 2023-12-23 19:56:44 -0800
  • f2baee6fd2 Stop at EOS token Devon Thomas 2023-12-23 19:33:29 -0800
  • 909a1e6909 fashion-mnist example (#180) Kashif Rasul 2023-12-23 16:34:45 +0100
  • 0371d90ccb
    fashion-mnist example (#180) Kashif Rasul 2023-12-23 16:34:45 +0100
  • 0c2f1a3c55 fix from review Kashif Rasul 2023-12-23 16:25:53 +0100
  • ebadac12e8 use non-zero exit code on error (#177) Daniel Strobusch 2023-12-23 16:10:13 +0100
  • 848f118ac5
    use non-zero exit code on error (#177) Daniel Strobusch 2023-12-23 16:10:13 +0100
  • 5320c4ff1f fix bad convert parameter (#178) Daniel Strobusch 2023-12-23 16:09:49 +0100
  • 092e87211e
    fix bad convert parameter (#178) Daniel Strobusch 2023-12-23 16:09:49 +0100
  • 0ed2de6a61 fashion mnist example Kashif Rasul 2023-12-23 11:22:23 +0100
  • 352b488cf6
    fix bad convert parameter Daniel Strobusch 2023-12-23 09:05:45 +0100
  • 668d0f3b08
    use non-zero exit code on error Daniel Strobusch 2023-12-23 08:46:31 +0100
  • 88df9691b3 Align CLI args and some smaller fixes (#167) Alvaro Bartolome 2023-12-22 23:34:32 +0100
  • f4709cb807
    Align CLI args and some smaller fixes (#167) Alvaro Bartolome 2023-12-22 23:34:32 +0100
  • 1abaebffdf one more Awni Hannun 2023-12-22 14:33:31 -0800
  • 7296440151 standardize Awni Hannun 2023-12-22 14:32:23 -0800
  • d05fa79284 Align CLI args and minor fixes Alvaro Bartolome 2023-12-21 09:49:45 +0100
  • bdc1f4d1f6 Fix variable naming of config in mixtral/convert.py Alvaro Bartolome 2023-12-21 09:42:35 +0100
  • 192ea3dd29 Add .DS_Store files to .gitignore Alvaro Bartolome 2023-12-21 09:38:38 +0100
  • 7ca5bd7208 Fix conversion + inference errors. - Mistral (#176) Vaibhav Srivastav 2023-12-23 03:40:25 +0530
  • 0eaa323c10
    Fix conversion + inference errors. - Mistral (#176) Vaibhav Srivastav 2023-12-23 03:40:25 +0530
  • 325d1b3ba9 wire rope_theta throuugh to nn.RoPE Awni Hannun 2023-12-22 14:09:45 -0800
  • adbd188c57 Fix conversion + inference errors. Vaibhav Srivastav 2023-12-23 01:19:58 +0530
  • ce7815b931 feat: add mistral tps (#173) Todsaporn Banjerdkit 2023-12-22 22:55:57 +0700
  • 7ae445f6c7
    feat: add mistral tps (#173) Todsaporn Banjerdkit 2023-12-22 22:55:57 +0700
  • 31273bafbf eval params before timing + format Awni Hannun 2023-12-22 07:54:36 -0800
  • d7693b8f8a feat: add mistral tps katopz 2023-12-22 14:22:58 +0700
  • dc1a0ec67d stream conversion Awni Hannun 2023-12-21 15:25:51 -0800
  • e7b7c4fa92 fix typo (#169) Daniel Strobusch 2023-12-21 23:17:11 +0100
  • 188a91074b
    fix typo (#169) Daniel Strobusch 2023-12-21 23:17:11 +0100
  • e1d0c254d6 Quantize example (#162) Awni Hannun 2023-12-21 12:59:37 -0800
  • 3cf436b529
    Quantize example (#162) Awni Hannun 2023-12-21 12:59:37 -0800
  • 942a6ef620 qwen conversion Awni Hannun 2023-12-21 12:54:47 -0800
  • 9dbbd8755b mixtral Awni Hannun 2023-12-21 10:53:14 -0800
  • 029e1f1ca4
    fix typo Daniel Strobusch 2023-12-21 18:09:17 +0100
  • 17d0967617 Add support for byt5 models (#161) Juarez Bochi 2023-12-21 11:46:36 -0500
  • 4c9db80ed2
    Add support for byt5 models (#161) Juarez Bochi 2023-12-21 11:46:36 -0500
  • 9890ae6403
    Remove unused import Juarez Bochi 2023-12-21 10:08:03 -0500
  • c66eed8e9f phi2 quantized Awni Hannun 2023-12-20 21:43:54 -0800
  • c8abd7906d llama / mistral conversion in good shape Awni Hannun 2023-12-20 21:25:25 -0800
  • aced530649 args for quantization Awni Hannun 2023-12-20 21:08:03 -0800
  • 89db6ffdfe quantization in mistral / nits in llama Awni Hannun 2023-12-20 17:16:58 -0800
  • af4d82c93a one config processor Awni Hannun 2023-12-20 15:00:06 -0800
  • f6684df32a conversion + quantization working Awni Hannun 2023-12-20 14:48:30 -0800
  • 8b824fb768 testing quantization Awni Hannun 2023-12-20 14:21:40 -0800
  • 3914b149a5 update path to load weights (#164) Deven Mistry 2023-12-21 09:31:17 -0500
  • 6c574dbecf
    update path to load weights (#164) Deven Mistry 2023-12-21 09:31:17 -0500
  • be121ba181 updated results (#165) Sarthak Yadav 2023-12-21 15:30:17 +0100
  • 4addd02988
    updated results (#165) Sarthak Yadav 2023-12-21 15:30:17 +0100
  • 18f0a96cee 1. Add user warning for sequences over 2048 tokens in iterate_batches. (#166) wyanzhao 2023-12-21 06:29:31 -0800
  • 22620de3ee
    1. Add user warning for sequences over 2048 tokens in iterate_batches. (#166) wyanzhao 2023-12-21 06:29:31 -0800
  • 58f409feb0 rename --model_path to --model-path (#151) Daniel Strobusch 2023-12-21 15:28:57 +0100
  • 43b6522af2
    rename --model_path to --model-path (#151) Daniel Strobusch 2023-12-21 15:28:57 +0100
  • f93d0feb0b
    rename --model_path to --model-path Daniel Strobusch 2023-12-20 10:35:34 +0100
  • d8a4920e66 1. Add user warning for sequences over 2048 tokens in iterate_batches. wyanzhao 2023-12-20 23:35:16 -0800
  • 33e96625c8 updated results Sarthak Yadav 2023-12-21 08:07:41 +0100
  • 043220f0b7 update path to load weights deven367 2023-12-20 23:08:06 -0500
  • df44fc5008 fix typo in readme (#163) Deven Mistry 2023-12-20 22:47:41 -0500
  • 3efb1cc2cc
    fix typo in readme (#163) Deven Mistry 2023-12-20 22:47:41 -0500
  • 7c4d2ef4f4 fix typo in readme deven367 2023-12-20 22:43:04 -0500
  • 9f3a83df63
    Add support for byt5 models Juarez Bochi 2023-12-20 17:09:18 -0500
  • de2c1022e3 Removed debug message regarding orphaned weights Pawel Kowalski 2023-12-20 18:43:40 +0100
  • dac547367d Stable diffusion XL working; "add_embedding" layer not implemented Pawel Kowalski 2023-12-20 17:25:05 +0100
  • fe2291710f partial implementation of SD XL, incl. CLIP with projection, but doesn't produce good output Pawel Kowalski 2023-12-19 13:21:34 +0100
  • c63870d3cb partial implementation of SD XL, incl. CLIP with projection, but doesn't produce good output Pawel Kowalski 2023-12-19 13:21:07 +0100
  • e091fd2abc Reverting to a single model Pawel Kowalski 2023-12-14 09:57:55 +0100
  • b417c79673 moved the weight squeeze to map_unet_weights, style check Pawel Kowalski 2023-12-13 23:36:47 +0100
  • 49cbb24b49 Allow integer as attention_head_dim, reshape downloaded weights to match model if mismatch Pawel Kowalski 2023-12-12 22:23:52 +0100
  • 6c13c072b2 Use config.json in llama (#159) Pedro Cuenca 2023-12-20 19:34:44 +0100
  • ce30cc3d8f
    Use config.json in llama (#159) Pedro Cuenca 2023-12-20 19:34:44 +0100
  • 1120adab98 Merge remote-tracking branch 'upstream/main' into llama-config-json Pedro Cuenca 2023-12-20 19:23:38 +0100
  • b3f23a7191 Add llms subdir + update README (#145) Awni Hannun 2023-12-20 10:22:25 -0800
  • 27c0a8c002
    Add llms subdir + update README (#145) Awni Hannun 2023-12-20 10:22:25 -0800
  • 4dd4b670b3 Typo Pedro Cuenca 2023-12-20 19:21:37 +0100
  • 5c29379c86 Fix convert Pedro Cuenca 2023-12-20 19:20:59 +0100
  • 89280be600 Fix pop Pedro Cuenca 2023-12-20 19:20:09 +0100
  • 03824e3477 format Awni Hannun 2023-12-20 10:14:01 -0800
  • 7b5787e62c update readmes a bit Awni Hannun 2023-12-20 10:09:03 -0800
  • e4777e6698 use same pre-commit as mlx Awni Hannun 2023-12-19 09:30:32 -0800
  • 3b98fb3a0f Fix minor typos, add some annotations Siddharth Mishra-Sharma 2023-12-20 13:10:05 -0500
  • d2422f0911 nits Awni Hannun 2023-12-19 09:05:15 -0800
  • f2216965c1 add llms subdir + update README Awni Hannun 2023-12-19 09:03:26 -0800
  • 3c64f1a1dc Add config.json to Mixtral. (#158) Vaibhav Srivastav 2023-12-20 23:17:23 +0530
  • aed14618ca
    Add config.json to Mixtral. (#158) Vaibhav Srivastav 2023-12-20 23:17:23 +0530
  • 3ef1e9944a Use config.json in llama Pedro Cuenca 2023-12-20 18:09:28 +0100
  • b970e281ac
    Update mixtral/mixtral.py Vaibhav Srivastav 2023-12-20 22:37:53 +0530
  • f881821895 Add config.json to Mixtral. Vaibhav Srivastav 2023-12-20 22:32:23 +0530
  • 920bd13668 Use config.json, add model_type (#157) Pedro Cuenca 2023-12-20 17:39:37 +0100