Awni Hannun
|
08e862336a
|
Rope theta to support Coda Llama (#121)
* rope theta for llama model
* llama chat/code
* nit
|
2023-12-15 19:51:51 -08:00 |
|
Awni Hannun
|
db134d976d
|
Merge pull request #115 from ml-explore/lora_custom
Customize dataset with lora
|
2023-12-15 13:54:58 -08:00 |
|
Awni Hannun
|
8df211869e
|
minimum version
|
2023-12-15 13:54:31 -08:00 |
|
Sam Coward
|
877f88dfea
|
Pass along temp argument to generate()
|
2023-12-15 15:16:41 -05:00 |
|
Awni Hannun
|
71b40c474c
|
Merge pull request #108 from devonthomas35/phi2_eos
Phi-2: Stop generating at eos token
|
2023-12-15 07:34:11 -08:00 |
|
devonthomas35
|
d74d9453dd
|
Refactor EOS check
|
2023-12-14 21:11:23 -08:00 |
|
Awni Hannun
|
a3ecda22fe
|
Merge pull request #107 from ml-explore/hf_mixtral
Use official HF for mixtral
|
2023-12-14 16:57:19 -08:00 |
|
Awni Hannun
|
b863e7cca0
|
format
|
2023-12-14 16:56:50 -08:00 |
|
devonthomas35
|
d7d7aabded
|
Remove unnecessary return
|
2023-12-14 15:52:22 -08:00 |
|
devonthomas35
|
4549dcbbd0
|
Stop generating at eos token
|
2023-12-14 15:50:59 -08:00 |
|
Fahad Nadeem
|
9b887cef08
|
minor dep fix in phi
|
2023-12-15 03:09:33 +05:00 |
|
Awni Hannun
|
8f60d60814
|
cleanup conversion to use single qkv matrix
|
2023-12-14 09:19:44 -08:00 |
|
Awni Hannun
|
0c1c500714
|
update readme
|
2023-12-14 08:37:34 -08:00 |
|
Awni Hannun
|
3d2a23184a
|
change file name for consistency, update readme.
|
2023-12-14 08:34:24 -08:00 |
|
Awni Hannun
|
840c0c36c2
|
don't drop last tokens
|
2023-12-14 08:27:44 -08:00 |
|
Awni Hannun
|
1613e608a9
|
fix args, update README, remove extra files
|
2023-12-14 08:18:01 -08:00 |
|
Awni Hannun
|
a8d4149147
|
fix fp16 + nits
|
2023-12-14 08:08:28 -08:00 |
|
Awni Hannun
|
88d7b67e6e
|
add cache + generation, clean up some stuff
|
2023-12-13 22:26:33 -08:00 |
|
Joe Barrow
|
a466cc5191
|
phi-2 draft
|
2023-12-13 22:23:38 -05:00 |
|