Commit Graph

  • 2384a5696a up. Vaibhav Srivastav 2024-01-08 13:25:24 +0530
  • 9affcbdeb8 Add HF Hub upload option. Vaibhav Srivastav 2024-01-08 13:19:09 +0530
  • ddf3f09f6e refactor: make the phi2 example can be directly load the model from hf without convert needed anchen 2024-01-07 22:03:30 -0800
  • 0f6ef682fd Update README.md (#248) Nino Risteski 2024-01-08 05:13:58 +0100
  • 9742ad0f51
    Update README.md (#248) Nino Risteski 2024-01-08 05:13:58 +0100
  • 0644926afb quantize linear (#250) Awni Hannun 2024-01-07 18:48:59 -0800
  • 485fb9ac0f
    quantize linear (#250) Awni Hannun 2024-01-07 18:48:59 -0800
  • 649add3451
    Merge branch 'ml-explore:main' into main Chime Ogbuji 2024-01-07 17:06:42 -0500
  • bde70c9e49 Update README.md (#251) Ikko Eltociear Ashimine 2024-01-08 04:35:39 +0900
  • 737b4c81a3
    Update README.md (#251) Ikko Eltociear Ashimine 2024-01-08 04:35:39 +0900
  • 774449a128
    Update README.md Ikko Eltociear Ashimine 2024-01-08 04:34:52 +0900
  • 94d7bd2ac3 quantize linear Awni Hannun 2024-01-07 11:31:12 -0800
  • 052d9c3bd8
    Update README.md Nino Risteski 2024-01-07 20:08:56 +0100
  • 3d0152f4f6 [Whisper] Add word timestamps and confidence scores (#201) bofeng huang 2024-01-07 19:01:29 +0100
  • bf9926489e
    [Whisper] Add word timestamps and confidence scores (#201) bofeng huang 2024-01-07 19:01:29 +0100
  • d7f5bd4751 nit Awni Hannun 2024-01-07 10:00:50 -0800
  • 05f4c3e5b0 format + readme Awni Hannun 2024-01-07 09:57:20 -0800
  • a90c60ab8d Fix typo in lora convert.py (#245) mc0ps 2024-01-07 06:30:30 -0500
  • 25ebd36112
    Fix typo in lora convert.py (#245) mc0ps 2024-01-07 06:30:30 -0500
  • 1a0514c6c6 added --color argument Leon Ericsson 2024-01-07 12:02:00 +0100
  • 692dd6c27c draft token coloring Leon Ericsson 2024-01-07 11:58:37 +0100
  • c8c2be0a77 fix typo in args.torch_path mc0ps 2024-01-07 02:56:48 -0500
  • 2bb89da7b7
    Merge branch 'ml-explore:main' into main Chime Ogbuji 2024-01-06 17:21:40 -0500
  • 37c5555616 nits Leon Ericsson 2024-01-06 23:16:17 +0100
  • 59d8feb424 Update README.md (#243) Nino Risteski 2024-01-06 20:44:49 +0100
  • b152d12d7b
    Update README.md (#243) Nino Risteski 2024-01-06 20:44:49 +0100
  • 5a458f0c85
    Update README.md Nino Risteski 2024-01-06 20:30:12 +0100
  • 32a7449c36
    Update CODE_OF_CONDUCT.md Nino Risteski 2024-01-06 20:27:56 +0100
  • bfbcb5d55b Add test for word-level timestamps and confidence scores bofenghuang 2024-01-06 18:59:53 +0100
  • 9cdf6388c7 Cast qk to fp32 bofenghuang 2024-01-06 18:59:10 +0100
  • e0a81d9283
    Merge branch 'main' into main Chime Ogbuji 2024-01-06 12:52:00 -0500
  • 57111100a2 Save alignment_heads bofenghuang 2024-01-06 17:47:46 +0100
  • 441494b11a Move multiple ops from np to mlx, clean comments bofenghuang 2024-01-06 17:27:06 +0100
  • 5512f8e6f0 Create a separate forward_with_cross_qk function bofenghuang 2024-01-06 17:25:41 +0100
  • a9641628de Add word timestamps and confidence scores bofenghuang 2023-12-28 20:01:10 +0100
  • 396cf5654a refactor: merge deepseek coder example into hf_llm example (#234) Anchen 2024-01-06 07:53:46 -0800
  • 758f05c09a
    refactor: merge deepseek coder example into hf_llm example (#234) Anchen 2024-01-06 07:53:46 -0800
  • 7baf6618b5 Rename requirements.txt for windows file name support Jonas Örnfelt 2024-01-06 16:38:21 +0100
  • 947e6c2281 chore: fix lint anchen 2024-01-06 07:27:43 -0800
  • a30273f606
    Update llms/hf_llm/models.py Anchen 2024-01-07 02:24:54 +1100
  • 568c9739a7 update README Leon Ericsson 2024-01-06 14:20:15 +0100
  • 79dfdd6c31 fix to generate_draft, --color removed for now Leon Ericsson 2024-01-06 14:07:24 +0100
  • 35dcab90ef Merge branch 'chore/deepseek' of github.com:mzbac/mlx-examples into chore/deepseek anchen 2024-01-05 23:41:45 -0800
  • fa6ff4e517 chore: remove default rope_scaling dict and use get to access type and factor to avoid key error anchen 2024-01-05 23:39:49 -0800
  • a95e8f1587 force fp16 for quantized models (#240) Awni Hannun 2024-01-05 21:29:15 -0800
  • cf0ad26a89
    force fp16 for quantized models (#240) Awni Hannun 2024-01-05 21:29:15 -0800
  • 4f2483edc7 force fp16 for quantized models Awni Hannun 2024-01-05 21:21:09 -0800
  • 91017ac223 add numpy as a requirement to run lora.py (#238) Lawrence Wu 2024-01-05 16:16:28 -0800
  • 37856f70a8
    add numpy as a requirement to run lora.py (#238) Lawrence Wu 2024-01-05 16:16:28 -0800
  • aa79e8ec3c removed unused imports Lawrence Wu 2024-01-05 15:59:33 -0800
  • 521ff0ba4b add numpy as a requirement to run lora.py Lawrence Wu 2024-01-05 15:53:42 -0800
  • fbf3956cdc merge lookup_decoding under speculative example Leon Ericsson 2024-01-05 22:46:03 +0100
  • d67aa16661 Updates from pre-commit run --all-files Chime Ogbuji 2024-01-05 15:13:34 -0500
  • 119cadbd09 Minor fixes Fixed import path, iteration calculation, and creation of configuration namespace from YAML. Chime Ogbuji 2024-01-05 12:22:00 -0500
  • 0dbc78ca65 Additional comment about YAML file format Chime Ogbuji 2024-01-05 11:59:37 -0500
  • 4f7023ce19 Initial commit of configurable supervized lora training framework. Will need to sync with #213 when merged into main Chime Ogbuji 2024-01-05 11:55:14 -0500
  • cab36602e9
    chore: fix format in readme Anchen 2024-01-05 23:59:45 +1100
  • c270881a85 remove deepseek example anchen 2024-01-05 03:18:42 -0800
  • 632391284d refactor: merge deepseek coder example into hf_llm example anchen 2024-01-05 02:12:14 -0800
  • 6b71e18a0a Qlora (#219) Awni Hannun 2024-01-04 21:05:59 -0800
  • 37b41cec60
    Qlora (#219) Awni Hannun 2024-01-04 21:05:59 -0800
  • f4ccff9a89 Handle receiving 0 tokens gracefully (#231) Christian Bieniak 2024-01-05 14:14:13 +1100
  • 4fa659acbd
    Handle receiving 0 tokens gracefully (#231) Christian Bieniak 2024-01-05 14:14:13 +1100
  • 53ee382cf8 Move no token check to statistics section Christian Bieniak 2024-01-05 14:01:18 +1100
  • 192782b27c Formatting Christian Bieniak 2024-01-05 13:23:26 +1100
  • a1a873109a handle 0 tokens gracefully Christian Bieniak 2024-01-05 13:20:42 +1100
  • 972c83a49d one more fix Awni Hannun 2024-01-04 16:34:50 -0800
  • 8b960fc3e6 fix loading adapters Awni Hannun 2024-01-04 11:54:49 -0800
  • 0c140c9019 comments Awni Hannun 2024-01-03 16:37:10 -0800
  • 7102738f46 include hf in readme Awni Hannun 2024-01-03 14:30:36 -0800
  • d8c711cfdd more helpful error message Awni Hannun 2024-01-03 14:27:26 -0800
  • b693e1b18d typo Awni Hannun 2024-01-03 13:53:04 -0800
  • d58f68ab1b section on quantizing for memory reduction Awni Hannun 2024-01-03 13:50:42 -0800
  • 139fbf39bc update main readme Awni Hannun 2024-01-03 13:45:46 -0800
  • e81cab43e4 updates on 0.0.7 Awni Hannun 2024-01-03 13:38:45 -0800
  • 837fcc2097 start of qlora Awni Hannun 2023-12-22 13:52:09 -0800
  • 497adf5aba Update README.md to fix --hf-model param call. (#229) Andy Peatling 2024-01-04 11:53:51 -0800
  • 12c9bafbf5
    Update README.md to fix --hf-model param call. (#229) Andy Peatling 2024-01-04 11:53:51 -0800
  • 39feb176c8
    Update README.md to fix --hf-model param call. Andy Peatling 2024-01-04 11:26:37 -0800
  • 7e7b817ddf fix to use actual prompt (#227) Awni Hannun 2024-01-04 11:12:05 -0800
  • e14afb3e77
    fix to use actual prompt (#227) Awni Hannun 2024-01-04 11:12:05 -0800
  • caf314e703 fix to use actual prompt Awni Hannun 2024-01-04 11:07:36 -0800
  • a9ee883bf7 Fix upload to hub for HF LLMs conversion script. (#221) Vaibhav Srivastav 2024-01-04 19:36:05 +0530
  • f95cf30a31
    Fix upload to hub for HF LLMs conversion script. (#221) Vaibhav Srivastav 2024-01-04 19:36:05 +0530
  • b2703f5406 reverting last commit. Vaibhav Srivastav 2024-01-04 15:55:48 +0530
  • 24d74842a6 Weights -> model. Vaibhav Srivastav 2024-01-04 15:08:24 +0530
  • af957bff3f Fix upload to hub snippet. Vaibhav Srivastav 2024-01-04 13:34:56 +0530
  • fc9dde34c9 Support Hugging Face models (#215) Awni Hannun 2024-01-03 15:13:26 -0800
  • a5d6d0436c
    Support Hugging Face models (#215) Awni Hannun 2024-01-03 15:13:26 -0800
  • f0aaab7d91 comment Awni Hannun 2024-01-03 15:01:02 -0800
  • 99581115a0 format with later version of black Awni Hannun 2024-01-03 14:59:45 -0800
  • d097652adc model card Awni Hannun 2024-01-03 13:59:28 -0800
  • 918973decf nit Awni Hannun 2024-01-03 12:13:20 -0800
  • 76c8bdbb2a readme nits Awni Hannun 2024-01-03 11:58:27 -0800
  • 7b3573113c nits Awni Hannun 2024-01-03 11:54:02 -0800
  • f124d627f4 hugs Awni Hannun 2024-01-03 11:52:15 -0800
  • 2995a67486 comments + readme updates Awni Hannun 2024-01-03 11:50:34 -0800
  • 384ad5792e address comments Awni Hannun 2024-01-03 11:33:41 -0800
  • 9bb3b4bd77 upload to hub Awni Hannun 2024-01-03 09:42:43 -0800
  • 1d80a686f8 work with hf repos Awni Hannun 2024-01-03 09:35:31 -0800