.. |
examples
|
Configurable LR schedulers (#604)
|
2024-03-29 13:41:10 -07:00 |
models
|
Quantize embedding / Update quantize API (#680)
|
2024-04-18 18:16:10 -07:00 |
tuner
|
fix dequantization (#693)
|
2024-04-19 10:46:59 -07:00 |
__init__.py
|
Fix import warning (#479)
|
2024-02-27 08:47:56 -08:00 |
convert.py
|
Create executables for generate, lora, server, merge, convert (#682)
|
2024-04-16 16:08:49 -07:00 |
fuse.py
|
Save lora config (#636)
|
2024-04-02 13:52:53 -07:00 |
generate.py
|
Create executables for generate, lora, server, merge, convert (#682)
|
2024-04-16 16:08:49 -07:00 |
gguf.py
|
fix(mlx-lm): type hints in gguf.py (#621)
|
2024-03-26 07:56:01 -07:00 |
LORA.md
|
Create executables for generate, lora, server, merge, convert (#682)
|
2024-04-16 16:08:49 -07:00 |
lora.py
|
Create executables for generate, lora, server, merge, convert (#682)
|
2024-04-16 16:08:49 -07:00 |
MERGE.md
|
Create executables for generate, lora, server, merge, convert (#682)
|
2024-04-16 16:08:49 -07:00 |
merge.py
|
Create executables for generate, lora, server, merge, convert (#682)
|
2024-04-16 16:08:49 -07:00 |
py.typed
|
Add py.typed to support PEP-561 (type-hinting) (#389)
|
2024-01-30 21:17:38 -08:00 |
README.md
|
feat: move lora into mlx-lm (#337)
|
2024-01-23 08:44:37 -08:00 |
requirements.txt
|
Quantize embedding / Update quantize API (#680)
|
2024-04-18 18:16:10 -07:00 |
sample_utils.py
|
Use async eval (#670)
|
2024-04-11 13:18:23 -07:00 |
SERVER.md
|
Create executables for generate, lora, server, merge, convert (#682)
|
2024-04-16 16:08:49 -07:00 |
server.py
|
use logging in mlx server (#705)
|
2024-04-22 07:50:06 -07:00 |
tokenizer_utils.py
|
Quantize embedding / Update quantize API (#680)
|
2024-04-18 18:16:10 -07:00 |
UPLOAD.md
|
Mlx llm package (#301)
|
2024-01-12 10:25:56 -08:00 |
utils.py
|
Add support for logit bias (#697)
|
2024-04-21 06:53:56 -07:00 |
version.py
|
Quantize embedding / Update quantize API (#680)
|
2024-04-18 18:16:10 -07:00 |