Pedro Cuenca
|
b7f742ef56
|
Mixed quant recipes (#1300)
* Mixed 3/6 and 2/6 recipes based on Alex Barron's
* format / nits
---------
Co-authored-by: Awni Hannun <awni.hannun@gmail.com>
|
2025-02-26 11:32:36 -08:00 |
|
Awni Hannun
|
66e7bcb886
|
override dtype with quant (#1062)
|
2024-10-22 09:56:45 -07:00 |
|
Phúc H. Lê Khắc
|
35206806ac
|
Create executables for generate, lora, server, merge, convert (#682)
* feat: create executables mlx_lm.<cmd>
* nits in docs
---------
Co-authored-by: Awni Hannun <awni@apple.com>
|
2024-04-16 16:08:49 -07:00 |
|
Alwin Arrasyid
|
6c3d4c8ba2
|
add dequantize option to mlx_lm/convert.py (#547)
|
2024-03-19 19:50:08 -07:00 |
|
Awni Hannun
|
95f82e67a2
|
Fix import warning (#479)
* fix import warning
* fix version import
* remove api, move convert to utils
* also update circle to run external PRs
|
2024-02-27 08:47:56 -08:00 |
|
Angelos Katharopoulos
|
dc4f2e0a6b
|
Lazy loading models for faster convert and merge (#462)
|
2024-02-20 13:36:55 -08:00 |
|
Anchen
|
362e88a744
|
feat: move lora into mlx-lm (#337)
* feat: Add lora and qlora training to mlx-lm
---------
Co-authored-by: Awni Hannun <awni@apple.com>
|
2024-01-23 08:44:37 -08:00 |
|
Anchen
|
527cea4027
|
chore: fix the convert.py script for weights are not sanitized and support quant for non-32 dimensions (#340)
* chore: fix convert script for weights not sanitized and suport quant for non 32 dim
* Update llms/mlx_lm/utils.py
Co-authored-by: Awni Hannun <awni.hannun@gmail.com>
* chore: fix typo
---------
Co-authored-by: Awni Hannun <awni.hannun@gmail.com>
|
2024-01-19 21:07:21 -08:00 |
|
Anchen
|
195bec2fa3
|
feat(mlx_lm): add mixtral support in mlx_lm (#318)
* feat: add mixtral support in mlx_lm
* chore: update doc
|
2024-01-15 07:18:14 -08:00 |
|
Pedro Cuenca
|
ef93979973
|
Update model card uploaded with converted models (#309)
|
2024-01-12 13:03:52 -08:00 |
|
Awni Hannun
|
c6440416a2
|
Mlx llm package (#301)
* fix converter
* add recursive files
* remove gitignore
* remove gitignore
* add packages properly
* read me update
* remove dup readme
* relative
* fix convert
* fix community name
* fix url
* version
|
2024-01-12 10:25:56 -08:00 |
|