Commit Graph

  • a6ae63226f
    Update README.md Seitaro Sugawara 2024-10-14 14:06:35 +0900
  • 68518a3194 FLUX: add generate images tqdm desc madroid 2024-10-13 21:47:33 +0800
  • aed4b007fc FLUX: ref dataset args madroid 2024-10-13 21:46:21 +0800
  • 7612c646f3
    Fix PLaMo model to support Grouped Query Attention (#1037) Shunta Saito 2024-10-13 07:26:50 +0900
  • 537598afc6 nit Awni Hannun 2024-10-12 14:30:53 -0700
  • 55bcfbc6a5 Make llm async eval less brittle Awni Hannun 2024-10-12 14:24:16 -0700
  • 7a20389c06 FLUX: fix pre-commit lint madroid 2024-10-13 01:57:23 +0800
  • 082b27ffb2 FLUX: update README.md madroid 2024-10-13 01:51:40 +0800
  • b0de67ec03 FLUX: support huggingface dataset madroid 2024-10-13 01:38:58 +0800
  • ca88343118 FLUX: extract FluxPipeline from __init__.py to flux.py madroid 2024-10-13 01:17:37 +0800
  • e4d48333e4 Fix PLaMo model to support Grouped Query Attention Shunta Saito 2024-10-12 20:24:23 +0900
  • d8611dd69f
    Small typo fixed in flux README.md (#1035) Ivan Fioravanti 2024-10-12 15:14:01 +0200
  • 624a156d7b FLUX: add load_dataset to __ini__ file madroid 2024-10-12 20:04:40 +0800
  • b22611d2a9 FLUX: move dataset to single file madroid 2024-10-12 19:56:59 +0800
  • 1252536b4b FLUX: The dataset is adjusted to train.jsonl madroid 2024-10-12 19:37:34 +0800
  • a4a875773d FLUX: update README.md madroid 2024-10-12 18:58:55 +0800
  • b0e017a16c FLUX: split small functions madroid 2024-10-12 18:44:47 +0800
  • 7ffc089eb9 FLUX: update README.md madroid 2024-10-12 18:23:54 +0800
  • 84c5e6a512 Small typo fixed in flux README.md ivanfioravanti 2024-10-12 10:44:11 +0200
  • a5f2bab070
    Add FLUX finetuning (#1028) Angelos Katharopoulos 2024-10-11 21:17:41 -0700
  • 2a8d483e0f Add the training data Angelos Katharopoulos 2024-10-11 21:14:27 -0700
  • 4bd0294598 Add better memory usage reporting Angelos Katharopoulos 2024-10-11 20:52:12 -0700
  • 7c8c5818f7 Add a README and requirements.txt Angelos Katharopoulos 2024-10-11 20:40:23 -0700
  • 00ba27fe6c adding debug statements Goekdeniz-Guelmez 2024-10-11 21:36:41 +0200
  • 6f88dd59d7 quick clean up and fix Goekdeniz-Guelmez 2024-10-11 21:08:13 +0200
  • 9c075a71f8 Merge branch 'adding-support-for-mamba2' of https://github.com/Goekdeniz-Guelmez/mlx-examples into adding-support-for-mamba2 Goekdeniz-Guelmez 2024-10-11 20:54:35 +0200
  • 4e1236cbf6 fixing loading the model Goekdeniz-Guelmez 2024-10-11 20:53:29 +0200
  • d72fdeb4ee
    MusicGen (#1020) Alex Barron 2024-10-11 10:16:20 -0700
  • 510088e304 readme Alex Barron 2024-10-11 10:12:46 -0700
  • 740bbb2433 fix readme Alex Barron 2024-10-11 10:01:07 -0700
  • 90779df767 add readme and requirements Alex Barron 2024-10-11 09:46:51 -0700
  • 76f5faba62 Add the notice Angelos Katharopoulos 2024-10-10 22:12:38 -0700
  • f7749ab043 Address comments and add some image augmentation Angelos Katharopoulos 2024-10-10 17:58:01 -0700
  • 8c3b25f88c Fix image cropping Angelos Katharopoulos 2024-10-10 03:17:23 -0700
  • fc88e3b0d0 Fix gradient accumulation averaging Angelos Katharopoulos 2024-10-10 02:45:26 -0700
  • 3c587ed618 Fixes and default args adjustment Angelos Katharopoulos 2024-10-10 02:21:08 -0700
  • 807bd66b80 Enable generation with a trained adapter Angelos Katharopoulos 2024-10-10 01:34:50 -0700
  • ecd8828e33 Further refactoring Angelos Katharopoulos 2024-10-10 00:35:44 -0700
  • f538394eec Cleanup the dreambooth Angelos Katharopoulos 2024-10-09 23:26:39 -0700
  • 446d8b6439 Revert SD dreambooth Angelos Katharopoulos 2024-10-09 22:43:36 -0700
  • b54218ea08 General updates Angelos Katharopoulos 2024-10-09 22:29:29 -0700
  • 19dc28f08a Fix time schedule Angelos Katharopoulos 2024-10-09 11:55:39 -0700
  • 27aaff8f31 Finetune all layers Angelos Katharopoulos 2024-10-08 16:31:33 -0700
  • d9c5fd5ba4 Add the raw option to txt2image Angelos Katharopoulos 2024-10-08 04:09:38 -0700
  • bb8436a441 Update dataset Angelos Katharopoulos 2024-10-08 01:09:24 -0700
  • f2ccad52f4 Add lr schedule Angelos Katharopoulos 2024-10-04 18:09:01 -0700
  • e7751e4c29 Add gradient accumulation and data parallelism Angelos Katharopoulos 2024-10-03 18:03:45 -0700
  • 7cffcdcaff Flux lora training Angelos Katharopoulos 2024-10-03 11:35:56 -0700
  • 9eef46e645 Refactor the pipeline Angelos Katharopoulos 2024-10-02 14:34:45 -0700
  • aefe60e79d Avoid upcasting and fix batch size > 1 Angelos Katharopoulos 2024-09-28 01:41:56 -0700
  • 070c58ed92 Bugfix in t5 rpos and initial generation example Angelos Katharopoulos 2024-09-28 01:09:59 -0700
  • 88603f0330 Add the tokenizers Angelos Katharopoulos 2024-09-27 15:08:41 -0700
  • 63932d777c Working clip, t5 and flux model Angelos Katharopoulos 2024-09-27 13:37:02 -0700
  • ed17f815f5 Flux implementation in examples Angelos Katharopoulos 2024-09-25 00:58:30 -0700
  • f61f4b5cf1 Start a stable diffusion dreambooth example Angelos Katharopoulos 2024-09-20 14:46:34 -0700
  • 77301a2973 nits Awni Hannun 2024-10-09 16:55:52 -0700
  • c9e9c75c66 add can_trim_prompt_cache Awni Hannun 2024-10-09 13:02:02 -0700
  • acec71b474 nit Awni Hannun 2024-10-09 10:34:59 -0700
  • 9435dfa3cd tokenizer updates + tests Awni Hannun 2024-10-09 10:32:43 -0700
  • 4360e7ccec
    clear cache during prompt processing (#1027) Awni Hannun 2024-10-09 16:48:32 -0700
  • 1ac75759ee clear cache during prompt processing Awni Hannun 2024-10-09 14:04:42 -0700
  • 9ee726cf18
    Merge branch 'main' into feat/batch_generate L Lllvvuu 2024-10-09 15:19:22 -0400
  • 8fb82fee43
    Merge branch 'main' into feat/batch_generate L Lllvvuu 2024-10-09 15:03:30 -0400
  • b7373cb44f
    fix long prompt generations (#1023) Awni Hannun 2024-10-09 11:09:36 -0700
  • 6a37b5106a symlinks Alex Barron 2024-10-09 09:51:16 -0700
  • 3ddd7e9923 fix long prompt generations Awni Hannun 2024-10-09 09:07:38 -0700
  • 4d2ee67402 change to from_pretrained Alex Barron 2024-10-08 15:46:04 -0700
  • 9432f1a643 add benchmarks Alex Barron 2024-10-08 14:54:09 -0700
  • d301705a2f Add MusicGen model Alex Barron 2024-10-03 15:03:32 -0700
  • fca087be49
    More cache improvements (#1015) Awni Hannun 2024-10-07 20:45:51 -0700
  • 16c6f6bd88 fix default model Awni Hannun 2024-10-07 20:05:54 -0700
  • fbff8e2fd5 fixes + trim_prompt_cache api Awni Hannun 2024-10-07 16:50:03 -0700
  • 7a3d0dd459 Define meta_state on all Cache implementations Angelos Katharopoulos 2024-10-07 15:46:48 -0700
  • f6ff4f28b4 comments + docs Awni Hannun 2024-10-07 13:16:58 -0700
  • 52ffc2f477 chat command Awni Hannun 2024-10-06 21:24:46 -0700
  • c7508270c3 return logprobs farris 2024-10-05 20:59:18 -0700
  • daf79f37cf docs Awni Hannun 2024-10-05 18:34:50 -0700
  • 4dc3cc0300 fix tests Awni Hannun 2024-10-05 15:32:07 -0700
  • 62dbd418d9 fix tests Awni Hannun 2024-10-05 15:24:16 -0700
  • 60c9794618 fix tests Awni Hannun 2024-10-05 15:19:25 -0700
  • 5f52882e32 nit in chat Awni Hannun 2024-10-05 14:53:11 -0700
  • 782f5a71b7 reorg + fixes to caching, unify prompt caching across types and use cases for e.g. caching during a chat Awni Hannun 2024-10-05 14:49:39 -0700
  • 52d6ca0ad0
    Merge branch 'ml-explore:main' into adding-support-for-mamba2 Gökdeniz Gülmez 2024-10-04 22:25:31 +0200
  • ed060a7c5c fix rotating kv cache for chat use case Awni Hannun 2024-10-04 07:43:13 -0700
  • 266f99a1e7 fix doc to match arg rename Anthony Wu 2024-10-03 15:53:13 -1000
  • 7b818c0d39 fix bugs, add test coverage Anthony Wu 2024-10-03 15:45:48 -1000
  • b6435dc9cc refactored to stdin - arg, and output-name template Anthony Wu 2024-10-03 15:03:31 -1000
  • 0db4543859 Make T5 work with official models without conversions Cheng 2024-10-04 08:33:09 +0900
  • bb5d7db5d7 add support for audio and input name from stdin Anthony Wu 2024-10-03 00:34:47 -1000
  • 9bc53fc210
    convert (#1006) Awni Hannun 2024-10-02 13:13:33 -0700
  • 36c1d8e8dc
    Server: support function calling (#1003) madroid 2024-10-03 03:36:07 +0800
  • 264ba43707 update trainer/lora.py and adding DepthWiseConv1d because mlx 0.18.0 doesnt axepts groups parameter Goekdeniz-Guelmez 2024-10-02 19:19:32 +0200
  • 409ddc427e updating ACKNOWLEDGMENTS.md file Goekdeniz-Guelmez 2024-10-02 18:21:25 +0200
  • 49b9fc1a4c
    Create mamba2.py Gökdeniz Gülmez 2024-10-02 12:48:15 +0200
  • bfc60460d4 mlx_lm.server --model mlx-community/Qwen2.5-7B-Instruct-8bit --trust-remote-code --port 8722 chenguangjian.jk 2024-10-02 13:08:46 +0800
  • c4e0f04b90 mlx_lm.server --model mlx-community/Qwen2.5-14B-Instruct-8bit --trust-remote-code --port 8722 chenguangjian.jk 2024-10-02 04:20:20 +0800
  • b49d2add61 convert Awni Hannun 2024-10-01 06:29:39 -0700
  • c7dd1641bd
    Merge branch 'ml-explore:main' into main chenguangjian 2024-10-01 01:36:38 +0800
  • 0866e23a67
    repetiton_penalty and logits_bias just using logits_processors (#1004) nathan 2024-09-30 17:49:03 +0200
  • 1209d4357d nits Awni Hannun 2024-09-30 08:31:09 -0700