Commit Graph

130 Commits

Author SHA1 Message Date
Awni Hannun
6249f46215 incude instruct option 2023-12-14 15:40:38 -08:00
Awni Hannun
449f7a694b use official HF for mixtral 2023-12-14 15:30:32 -08:00
Awni Hannun
53e58795c2 Merge pull request #77 from SarthakYadav/main
Added CIFAR-10 + ResNet example
2023-12-14 12:19:40 -08:00
Awni Hannun
e12e4d5825 typo / nits 2023-12-14 12:14:01 -08:00
Awni Hannun
5673716daa updates + format 2023-12-14 12:09:10 -08:00
Awni Hannun
4cac181917 Merge pull request #103 from arpitingle/patch-1
added phi in readme
2023-12-14 10:19:40 -08:00
arpit
541265b74d Update README.md 2023-12-14 23:40:50 +05:30
Awni Hannun
f4745d8576 Merge pull request #97 from jbarrow/main
Phi-2
2023-12-14 09:21:26 -08:00
Awni Hannun
fa9e34b041 cleanup conversion to use single qkv matrix 2023-12-14 09:19:44 -08:00
Awni Hannun
45c1800fc6 update readme 2023-12-14 08:37:34 -08:00
Awni Hannun
c2eb435697 change file name for consistency, update readme. 2023-12-14 08:34:24 -08:00
Awni Hannun
5822639f23 don't drop last tokens 2023-12-14 08:27:44 -08:00
Awni Hannun
c26eafc125 fix args, update README, remove extra files 2023-12-14 08:18:01 -08:00
Awni Hannun
05c82ddf5f fix fp16 + nits 2023-12-14 08:08:28 -08:00
Sarthak Yadav
879a576fb6 updated header 2023-12-14 16:28:00 +01:00
Awni Hannun
bb44222a86 Merge pull request #98 from finnless/patch-1
Fix typo in stable_diffusion README
2023-12-14 07:13:19 -08:00
Awni Hannun
a2aadb24bd Merge pull request #102 from burakbudanur/main
Corrected the typo in 'ffn_dim_multiplier' in and added 'rope_theta' …
2023-12-14 07:12:20 -08:00
Burak Budanur
f603d53bef Corrected the typo in 'ffn_dim_multiplier' in and added 'rope_theta' to the list unused. Without these, llama examples did not run. 2023-12-14 14:02:11 +01:00
Sarthak Yadav
a3c0343b31 simplified ResNet, expanded README with throughput and performance 2023-12-14 09:05:04 +01:00
Awni Hannun
0301cbd88b add cache + generation, clean up some stuff 2023-12-13 22:26:33 -08:00
Nolan
d526c19680 Fix typo in stable_diffusion README 2023-12-13 20:51:39 -08:00
Joe Barrow
1fe230910b phi-2 draft 2023-12-13 22:23:38 -05:00
Awni Hannun
f6e24ea7aa Merge pull request #96 from Stv-X/typo-fix
Typo fix in whisper/README
2023-12-13 16:28:03 -08:00
Stv.X
7bd67985e9 Corrected spelling of terms in whisper/README.md 2023-12-14 08:15:26 +08:00
Awni Hannun
5e8f8386ed Merge pull request #51 from jbarrow/main
Update BERT to take advantage of bias param in MultiHeadAttention
2023-12-13 15:20:29 -08:00
Joe Barrow
bab982fced Update to mlx>=0.0.5 2023-12-13 17:48:07 -05:00
Awni Hannun
0f37169f17 Merge pull request #94 from jbax3/patch-1
Update README.md to fix git-lfs command
2023-12-13 14:19:14 -08:00
jbax3
791075f91f Update README.md to fix git-lfs command 2023-12-13 15:51:27 -06:00
Awni Hannun
15fd2981bc Merge pull request #93 from jbochi/patch-1
Fix convert.py instructions for Bert model
2023-12-13 08:47:52 -08:00
Juarez Bochi
37a27c5500 Fix convert.py instructions for Bert model
It just adds the missing backslash.
2023-12-13 11:37:02 -05:00
Awni Hannun
2f2d5d6c1d Merge pull request #90 from bofenghuang/fix-fp16
Fix whisper fp16 inference
2023-12-13 07:29:10 -08:00
Awni Hannun
4c5b9cf8a8 Merge pull request #88 from dastrobu/meta-form-url
fix "request access" form url for Llama models
2023-12-13 07:20:51 -08:00
bofenghuang
7871f2a0bf Fix fp16 2023-12-13 11:07:47 +01:00
Daniel Strobusch
ba4cb95d8e fix "request access" form url for Llama models 2023-12-13 10:19:29 +01:00
Awni Hannun
f599546337 Merge pull request #76 from bofenghuang/add-whisper-large-v3
Add whisper-large-v3
2023-12-12 20:22:31 -08:00
Awni Hannun
1a302bcfaa Merge pull request #82 from ml-explore/llamav2
llama v2 with sharded weights
2023-12-12 17:08:24 -08:00
Awni Hannun
a3e413affb hf correction 2023-12-12 17:08:04 -08:00
Awni Hannun
3391ccdeb5 Merge pull request #79 from ml-explore/whisper_fp16
Enable FP16 for Whisper
2023-12-12 17:05:21 -08:00
Awni Hannun
07e9360e8c Merge pull request #84 from iammerrick/patch-1
Update convert.py
2023-12-12 17:02:21 -08:00
Awni Hannun
d22313140f Merge pull request #86 from 1-ashraful-islam/patch-2
Update README.md with recently added examples
2023-12-12 17:01:02 -08:00
Ashraful Islam
59f317520a Update README.md
updates readme with recently added examples
2023-12-12 18:26:13 -06:00
Merrick Christensen
d5c4f4e9ca Update convert.py
Docs are right, however, the code has a typo.
2023-12-12 14:33:33 -07:00
Awni Hannun
61f81f2078 llama v1 request 2023-12-12 13:32:05 -08:00
Awni Hannun
c18b990f08 llama v2 with sharded weights 2023-12-12 12:48:15 -08:00
Awni Hannun
03a408fa2e Merge pull request #80 from 805karansaini/main
Typo Fix
2023-12-12 12:20:13 -08:00
805karansaini
28e1384267 Typo Fix 2023-12-13 01:45:50 +05:30
Awni Hannun
8bbb4366d0 Merge pull request #75 from ml-explore/mixtral
Mixtral
2023-12-12 10:41:25 -08:00
Sarthak Yadav
12ad979ac8 fixed doc for ResNet 2023-12-12 19:07:39 +01:00
Sarthak Yadav
ea9a4f878a added CIFAR10 + ResNet example 2023-12-12 19:01:06 +01:00
Awni Hannun
1b6213b3d3 nit 2023-12-12 08:42:32 -08:00