Awni Hannun
|
6ae68777aa
|
Rope theta to support Coda Llama (#121)
* rope theta for llama model
* llama chat/code
* nit
|
2023-12-15 19:51:51 -08:00 |
|
Awni Hannun
|
376b273b2f
|
Merge pull request #115 from ml-explore/lora_custom
Customize dataset with lora
|
2023-12-15 13:54:58 -08:00 |
|
Awni Hannun
|
ad77bc4c3b
|
minimum version
|
2023-12-15 13:54:31 -08:00 |
|
Sam Coward
|
a5bacfd04f
|
Pass along temp argument to generate()
|
2023-12-15 15:16:41 -05:00 |
|
Awni Hannun
|
737db11152
|
Merge pull request #108 from devonthomas35/phi2_eos
Phi-2: Stop generating at eos token
|
2023-12-15 07:34:11 -08:00 |
|
devonthomas35
|
f6ac70c736
|
Refactor EOS check
|
2023-12-14 21:11:23 -08:00 |
|
Awni Hannun
|
12a5597ac3
|
Merge pull request #107 from ml-explore/hf_mixtral
Use official HF for mixtral
|
2023-12-14 16:57:19 -08:00 |
|
Awni Hannun
|
7cf66dc88c
|
format
|
2023-12-14 16:56:50 -08:00 |
|
devonthomas35
|
7f992db5bc
|
Remove unnecessary return
|
2023-12-14 15:52:22 -08:00 |
|
devonthomas35
|
8d496ba61a
|
Stop generating at eos token
|
2023-12-14 15:50:59 -08:00 |
|
Fahad Nadeem
|
330e8e8bc9
|
minor dep fix in phi
|
2023-12-15 03:09:33 +05:00 |
|
Awni Hannun
|
fa9e34b041
|
cleanup conversion to use single qkv matrix
|
2023-12-14 09:19:44 -08:00 |
|
Awni Hannun
|
45c1800fc6
|
update readme
|
2023-12-14 08:37:34 -08:00 |
|
Awni Hannun
|
c2eb435697
|
change file name for consistency, update readme.
|
2023-12-14 08:34:24 -08:00 |
|
Awni Hannun
|
5822639f23
|
don't drop last tokens
|
2023-12-14 08:27:44 -08:00 |
|
Awni Hannun
|
c26eafc125
|
fix args, update README, remove extra files
|
2023-12-14 08:18:01 -08:00 |
|
Awni Hannun
|
05c82ddf5f
|
fix fp16 + nits
|
2023-12-14 08:08:28 -08:00 |
|
Awni Hannun
|
0301cbd88b
|
add cache + generation, clean up some stuff
|
2023-12-13 22:26:33 -08:00 |
|
Joe Barrow
|
1fe230910b
|
phi-2 draft
|
2023-12-13 22:23:38 -05:00 |
|