Pawel Kowalski
|
4c88163941
|
Stable diffusion - check model weights shape and support int for "attention_head_dim" (#85)
* Allow integer as attention_head_dim
* Reshape downloaded weights to match model if there is a mismatch
|
2023-12-15 13:01:02 -08:00 |
|
Awni Hannun
|
fc13e96e6c
|
Merge pull request #116 from idoru/fix-phi-2-temp-arg
phi-2: fix --temp/--seed arguments.
|
2023-12-15 12:29:19 -08:00 |
|
Sam Coward
|
a5bacfd04f
|
Pass along temp argument to generate()
|
2023-12-15 15:16:41 -05:00 |
|
Awni Hannun
|
7d4a41ace8
|
Merge pull request #112 from ml-explore/fix_mixtral
[Bugfix] Fix RoPE base bug in Mixtral example
|
2023-12-15 08:39:02 -08:00 |
|
Awni Hannun
|
737db11152
|
Merge pull request #108 from devonthomas35/phi2_eos
Phi-2: Stop generating at eos token
|
2023-12-15 07:34:11 -08:00 |
|
Awni Hannun
|
001b5803ce
|
fix RoPE bug + minor updates
|
2023-12-14 21:45:25 -08:00 |
|
devonthomas35
|
f6ac70c736
|
Refactor EOS check
|
2023-12-14 21:11:23 -08:00 |
|
Awni Hannun
|
12a5597ac3
|
Merge pull request #107 from ml-explore/hf_mixtral
Use official HF for mixtral
|
2023-12-14 16:57:19 -08:00 |
|
Awni Hannun
|
7cf66dc88c
|
format
|
2023-12-14 16:56:50 -08:00 |
|
devonthomas35
|
7f992db5bc
|
Remove unnecessary return
|
2023-12-14 15:52:22 -08:00 |
|
devonthomas35
|
8d496ba61a
|
Stop generating at eos token
|
2023-12-14 15:50:59 -08:00 |
|
Awni Hannun
|
6249f46215
|
incude instruct option
|
2023-12-14 15:40:38 -08:00 |
|
Awni Hannun
|
449f7a694b
|
use official HF for mixtral
|
2023-12-14 15:30:32 -08:00 |
|
Awni Hannun
|
95a1d50318
|
Merge pull request #106 from fahnub/main
minor dependency fix in phi-2
|
2023-12-14 14:15:19 -08:00 |
|
Fahad Nadeem
|
330e8e8bc9
|
minor dep fix in phi
|
2023-12-15 03:09:33 +05:00 |
|
Awni Hannun
|
53e58795c2
|
Merge pull request #77 from SarthakYadav/main
Added CIFAR-10 + ResNet example
|
2023-12-14 12:19:40 -08:00 |
|
Awni Hannun
|
e12e4d5825
|
typo / nits
|
2023-12-14 12:14:01 -08:00 |
|
Awni Hannun
|
5673716daa
|
updates + format
|
2023-12-14 12:09:10 -08:00 |
|
Awni Hannun
|
4cac181917
|
Merge pull request #103 from arpitingle/patch-1
added phi in readme
|
2023-12-14 10:19:40 -08:00 |
|
arpit
|
541265b74d
|
Update README.md
|
2023-12-14 23:40:50 +05:30 |
|
Awni Hannun
|
f4745d8576
|
Merge pull request #97 from jbarrow/main
Phi-2
|
2023-12-14 09:21:26 -08:00 |
|
Awni Hannun
|
fa9e34b041
|
cleanup conversion to use single qkv matrix
|
2023-12-14 09:19:44 -08:00 |
|
Awni Hannun
|
45c1800fc6
|
update readme
|
2023-12-14 08:37:34 -08:00 |
|
Awni Hannun
|
c2eb435697
|
change file name for consistency, update readme.
|
2023-12-14 08:34:24 -08:00 |
|
Awni Hannun
|
5822639f23
|
don't drop last tokens
|
2023-12-14 08:27:44 -08:00 |
|
Awni Hannun
|
c26eafc125
|
fix args, update README, remove extra files
|
2023-12-14 08:18:01 -08:00 |
|
Awni Hannun
|
05c82ddf5f
|
fix fp16 + nits
|
2023-12-14 08:08:28 -08:00 |
|
Sarthak Yadav
|
879a576fb6
|
updated header
|
2023-12-14 16:28:00 +01:00 |
|
Awni Hannun
|
bb44222a86
|
Merge pull request #98 from finnless/patch-1
Fix typo in stable_diffusion README
|
2023-12-14 07:13:19 -08:00 |
|
Awni Hannun
|
a2aadb24bd
|
Merge pull request #102 from burakbudanur/main
Corrected the typo in 'ffn_dim_multiplier' in and added 'rope_theta' …
|
2023-12-14 07:12:20 -08:00 |
|
Burak Budanur
|
f603d53bef
|
Corrected the typo in 'ffn_dim_multiplier' in and added 'rope_theta' to the list unused. Without these, llama examples did not run.
|
2023-12-14 14:02:11 +01:00 |
|
Sarthak Yadav
|
a3c0343b31
|
simplified ResNet, expanded README with throughput and performance
|
2023-12-14 09:05:04 +01:00 |
|
Awni Hannun
|
0301cbd88b
|
add cache + generation, clean up some stuff
|
2023-12-13 22:26:33 -08:00 |
|
Nolan
|
d526c19680
|
Fix typo in stable_diffusion README
|
2023-12-13 20:51:39 -08:00 |
|
Joe Barrow
|
1fe230910b
|
phi-2 draft
|
2023-12-13 22:23:38 -05:00 |
|
Awni Hannun
|
f6e24ea7aa
|
Merge pull request #96 from Stv-X/typo-fix
Typo fix in whisper/README
|
2023-12-13 16:28:03 -08:00 |
|
Stv.X
|
7bd67985e9
|
Corrected spelling of terms in whisper/README.md
|
2023-12-14 08:15:26 +08:00 |
|
Awni Hannun
|
5e8f8386ed
|
Merge pull request #51 from jbarrow/main
Update BERT to take advantage of bias param in MultiHeadAttention
|
2023-12-13 15:20:29 -08:00 |
|
Joe Barrow
|
bab982fced
|
Update to mlx>=0.0.5
|
2023-12-13 17:48:07 -05:00 |
|
Awni Hannun
|
0f37169f17
|
Merge pull request #94 from jbax3/patch-1
Update README.md to fix git-lfs command
|
2023-12-13 14:19:14 -08:00 |
|
jbax3
|
791075f91f
|
Update README.md to fix git-lfs command
|
2023-12-13 15:51:27 -06:00 |
|
Awni Hannun
|
15fd2981bc
|
Merge pull request #93 from jbochi/patch-1
Fix convert.py instructions for Bert model
|
2023-12-13 08:47:52 -08:00 |
|
Juarez Bochi
|
37a27c5500
|
Fix convert.py instructions for Bert model
It just adds the missing backslash.
|
2023-12-13 11:37:02 -05:00 |
|
Awni Hannun
|
2f2d5d6c1d
|
Merge pull request #90 from bofenghuang/fix-fp16
Fix whisper fp16 inference
|
2023-12-13 07:29:10 -08:00 |
|
Awni Hannun
|
4c5b9cf8a8
|
Merge pull request #88 from dastrobu/meta-form-url
fix "request access" form url for Llama models
|
2023-12-13 07:20:51 -08:00 |
|
bofenghuang
|
7871f2a0bf
|
Fix fp16
|
2023-12-13 11:07:47 +01:00 |
|
Daniel Strobusch
|
ba4cb95d8e
|
fix "request access" form url for Llama models
|
2023-12-13 10:19:29 +01:00 |
|
Awni Hannun
|
f599546337
|
Merge pull request #76 from bofenghuang/add-whisper-large-v3
Add whisper-large-v3
|
2023-12-12 20:22:31 -08:00 |
|
Awni Hannun
|
1a302bcfaa
|
Merge pull request #82 from ml-explore/llamav2
llama v2 with sharded weights
|
2023-12-12 17:08:24 -08:00 |
|
Awni Hannun
|
a3e413affb
|
hf correction
|
2023-12-12 17:08:04 -08:00 |
|