Commit Graph

156 Commits

Author SHA1 Message Date
Awni Hannun
fd00c22224 Citation + contributor acknowledgments section (#136)
* citation + acks section

* nits
2023-12-18 10:12:35 -08:00
Daniel Strobusch
ca5a8ec273 fix renamed arg (#140) 2023-12-18 10:11:51 -08:00
Awni Hannun
17d2efaebe support for tiny llama (#129) 2023-12-18 07:47:55 -08:00
Awni Hannun
6ae68777aa Rope theta to support Coda Llama (#121)
* rope theta for llama model

* llama chat/code

* nit
2023-12-15 19:51:51 -08:00
Awni Hannun
376b273b2f Merge pull request #115 from ml-explore/lora_custom
Customize dataset with lora
2023-12-15 13:54:58 -08:00
Awni Hannun
ad77bc4c3b minimum version 2023-12-15 13:54:31 -08:00
Pawel Kowalski
4c88163941 Stable diffusion - check model weights shape and support int for "attention_head_dim" (#85)
* Allow integer as attention_head_dim
* Reshape downloaded weights to match model if there is a mismatch
2023-12-15 13:01:02 -08:00
Awni Hannun
fc13e96e6c Merge pull request #116 from idoru/fix-phi-2-temp-arg
phi-2: fix --temp/--seed arguments.
2023-12-15 12:29:19 -08:00
Awni Hannun
e709b846ff 32 GB example 2023-12-15 12:20:15 -08:00
Awni Hannun
24a6de2acb 32 GB example 2023-12-15 12:18:29 -08:00
Sam Coward
a5bacfd04f Pass along temp argument to generate() 2023-12-15 15:16:41 -05:00
Awni Hannun
f019d836ee keep base weights in fp16 2023-12-15 10:42:18 -08:00
Awni Hannun
6dc067d30c use lower precision base weights 2023-12-15 10:29:42 -08:00
Awni Hannun
4444c7a4a5 more nits 2023-12-15 10:06:14 -08:00
Awni Hannun
c889e09773 fix readme 2023-12-15 09:59:07 -08:00
Awni Hannun
9550e1de17 custom data with lora 2023-12-15 09:56:10 -08:00
Awni Hannun
7d4a41ace8 Merge pull request #112 from ml-explore/fix_mixtral
[Bugfix] Fix RoPE base bug in Mixtral example
2023-12-15 08:39:02 -08:00
Awni Hannun
737db11152 Merge pull request #108 from devonthomas35/phi2_eos
Phi-2: Stop generating at eos token
2023-12-15 07:34:11 -08:00
Awni Hannun
001b5803ce fix RoPE bug + minor updates 2023-12-14 21:45:25 -08:00
devonthomas35
f6ac70c736 Refactor EOS check 2023-12-14 21:11:23 -08:00
Awni Hannun
12a5597ac3 Merge pull request #107 from ml-explore/hf_mixtral
Use official HF for mixtral
2023-12-14 16:57:19 -08:00
Awni Hannun
7cf66dc88c format 2023-12-14 16:56:50 -08:00
devonthomas35
7f992db5bc Remove unnecessary return 2023-12-14 15:52:22 -08:00
devonthomas35
8d496ba61a Stop generating at eos token 2023-12-14 15:50:59 -08:00
Awni Hannun
6249f46215 incude instruct option 2023-12-14 15:40:38 -08:00
Awni Hannun
449f7a694b use official HF for mixtral 2023-12-14 15:30:32 -08:00
Awni Hannun
95a1d50318 Merge pull request #106 from fahnub/main
minor dependency fix in phi-2
2023-12-14 14:15:19 -08:00
Fahad Nadeem
330e8e8bc9 minor dep fix in phi 2023-12-15 03:09:33 +05:00
Awni Hannun
53e58795c2 Merge pull request #77 from SarthakYadav/main
Added CIFAR-10 + ResNet example
2023-12-14 12:19:40 -08:00
Awni Hannun
e12e4d5825 typo / nits 2023-12-14 12:14:01 -08:00
Awni Hannun
5673716daa updates + format 2023-12-14 12:09:10 -08:00
Awni Hannun
4cac181917 Merge pull request #103 from arpitingle/patch-1
added phi in readme
2023-12-14 10:19:40 -08:00
arpit
541265b74d Update README.md 2023-12-14 23:40:50 +05:30
Awni Hannun
f4745d8576 Merge pull request #97 from jbarrow/main
Phi-2
2023-12-14 09:21:26 -08:00
Awni Hannun
fa9e34b041 cleanup conversion to use single qkv matrix 2023-12-14 09:19:44 -08:00
Awni Hannun
45c1800fc6 update readme 2023-12-14 08:37:34 -08:00
Awni Hannun
c2eb435697 change file name for consistency, update readme. 2023-12-14 08:34:24 -08:00
Awni Hannun
5822639f23 don't drop last tokens 2023-12-14 08:27:44 -08:00
Awni Hannun
c26eafc125 fix args, update README, remove extra files 2023-12-14 08:18:01 -08:00
Awni Hannun
05c82ddf5f fix fp16 + nits 2023-12-14 08:08:28 -08:00
Sarthak Yadav
879a576fb6 updated header 2023-12-14 16:28:00 +01:00
Awni Hannun
bb44222a86 Merge pull request #98 from finnless/patch-1
Fix typo in stable_diffusion README
2023-12-14 07:13:19 -08:00
Awni Hannun
a2aadb24bd Merge pull request #102 from burakbudanur/main
Corrected the typo in 'ffn_dim_multiplier' in and added 'rope_theta' …
2023-12-14 07:12:20 -08:00
Burak Budanur
f603d53bef Corrected the typo in 'ffn_dim_multiplier' in and added 'rope_theta' to the list unused. Without these, llama examples did not run. 2023-12-14 14:02:11 +01:00
Sarthak Yadav
a3c0343b31 simplified ResNet, expanded README with throughput and performance 2023-12-14 09:05:04 +01:00
Awni Hannun
0301cbd88b add cache + generation, clean up some stuff 2023-12-13 22:26:33 -08:00
Nolan
d526c19680 Fix typo in stable_diffusion README 2023-12-13 20:51:39 -08:00
Joe Barrow
1fe230910b phi-2 draft 2023-12-13 22:23:38 -05:00
Awni Hannun
f6e24ea7aa Merge pull request #96 from Stv-X/typo-fix
Typo fix in whisper/README
2023-12-13 16:28:03 -08:00
Stv.X
7bd67985e9 Corrected spelling of terms in whisper/README.md 2023-12-14 08:15:26 +08:00