Awni Hannun
6731254e76
Use fast rope ( #945 )
...
* use fast rope
* fix llama
* use fast rope for llama3.1
* requires unreleased mlx
* fix su
* fix deepseek v2
* only one of base or freqs
* nit
* fix
* hard code freqs
2024-08-23 13:18:51 -07:00
dmdaksh
7d7e236061
- Removed unused Python imports ( #683 )
...
- bert/model.py:10: tree_unflatten
- bert/model.py:2: dataclass
- bert/model.py:8: numpy
- cifar/resnet.py:6: Any
- clip/model.py:15: tree_flatten
- clip/model.py:9: Union
- gcn/main.py:8: download_cora
- gcn/main.py:9: cross_entropy
- llms/gguf_llm/models.py:12: tree_flatten, tree_unflatten
- llms/gguf_llm/models.py:9: numpy
- llms/mixtral/mixtral.py:12: tree_map
- llms/mlx_lm/models/dbrx.py:2: Dict, Union
- llms/mlx_lm/tuner/trainer.py:5: partial
- llms/speculative_decoding/decoder.py:1: dataclass, field
- llms/speculative_decoding/decoder.py:2: Optional
- llms/speculative_decoding/decoder.py:5: mlx.nn
- llms/speculative_decoding/decoder.py:6: numpy
- llms/speculative_decoding/main.py:2: glob
- llms/speculative_decoding/main.py:3: json
- llms/speculative_decoding/main.py:5: Path
- llms/speculative_decoding/main.py:8: mlx.nn
- llms/speculative_decoding/model.py:6: tree_unflatten
- llms/speculative_decoding/model.py:7: AutoTokenizer
- llms/tests/test_lora.py:13: yaml_loader
- lora/lora.py:14: tree_unflatten
- lora/models.py:11: numpy
- lora/models.py:3: glob
- speechcommands/kwt.py:1: Any
- speechcommands/main.py:7: mlx.data
- stable_diffusion/stable_diffusion/model_io.py:4: partial
- whisper/benchmark.py:5: sys
- whisper/test.py:5: subprocess
- whisper/whisper/audio.py:6: Optional
- whisper/whisper/decoding.py:8: mlx.nn
2024-04-16 07:50:32 -07:00
Angelos Katharopoulos
eff6690952
Fix CFG for SDXL ( #667 )
2024-04-09 06:06:41 -07:00
devonthomas35
fe5edee360
Fix image2image for SDXL ( #563 )
...
---------
Co-authored-by: Angelos Katharopoulos <katharas@gmail.com>
2024-03-11 12:18:47 -07:00
zweifisch
d0fa6cfcae
feat: stable-diffusion t2i add --seed ( #558 )
2024-03-10 06:12:54 -07:00
Angelos Katharopoulos
3a9e6c3f70
Stable diffusion XL ( #516 )
2024-03-08 10:24:19 -08:00
Awni Hannun
06ddb8414d
Fix Qwen2 and SD ( #441 )
...
* fix qwen2
* version bump
* fix list shape
2024-02-14 13:43:12 -08:00
Nripesh Niketan
f1ef378a58
Feat: update pre-commit rev ( #432 )
2024-02-11 07:23:27 -08:00
Awni Hannun
ec14583c2a
work with tuple shape ( #393 )
2024-02-01 13:03:47 -08:00
AtomicVar
2ba5d3db14
Refactor activation function and loss calculation ( #325 )
2024-01-16 13:42:56 -08:00
Awni Hannun
a5d6d0436c
Support Hugging Face models ( #215 )
...
* support hf direct models
2024-01-03 15:13:26 -08:00
Angelos Katharopoulos
37fd2464dc
Add an image2image example in the stable diffusion ( #198 )
2023-12-28 18:31:45 -08:00
Awni Hannun
27c0a8c002
Add llms subdir + update README ( #145 )
...
* add llms subdir + update README
* nits
* use same pre-commit as mlx
* update readmes a bit
* format
2023-12-20 10:22:25 -08:00
Pawel Kowalski
fc1495abaa
Stable diffusion - check model weights shape and support int for "attention_head_dim" ( #85 )
...
* Allow integer as attention_head_dim
* Reshape downloaded weights to match model if there is a mismatch
2023-12-15 13:01:02 -08:00
Awni Hannun
a99e9d551e
hf correction
2023-12-12 17:08:04 -08:00
Robert McCraith
4ed942d7f5
fix: typo in variable name
2023-12-07 21:30:04 +00:00
Awni Hannun
1900564f59
format
2023-11-30 11:52:47 -08:00
Awni Hannun
31bc57c4ff
add copyright in source
2023-11-30 11:08:53 -08:00
Angelos Katharopoulos
b364cc56cd
Add the Llama and Stable Diffusion examples
2023-11-29 10:38:20 -08:00