Commit Graph

153 Commits

Author SHA1 Message Date
jbax3
791075f91f Update README.md to fix git-lfs command 2023-12-13 15:51:27 -06:00
Awni Hannun
15fd2981bc Merge pull request #93 from jbochi/patch-1
Fix convert.py instructions for Bert model
2023-12-13 08:47:52 -08:00
Juarez Bochi
37a27c5500 Fix convert.py instructions for Bert model
It just adds the missing backslash.
2023-12-13 11:37:02 -05:00
Awni Hannun
2f2d5d6c1d Merge pull request #90 from bofenghuang/fix-fp16
Fix whisper fp16 inference
2023-12-13 07:29:10 -08:00
Awni Hannun
4c5b9cf8a8 Merge pull request #88 from dastrobu/meta-form-url
fix "request access" form url for Llama models
2023-12-13 07:20:51 -08:00
bofenghuang
7871f2a0bf Fix fp16 2023-12-13 11:07:47 +01:00
Daniel Strobusch
ba4cb95d8e fix "request access" form url for Llama models 2023-12-13 10:19:29 +01:00
Awni Hannun
f599546337 Merge pull request #76 from bofenghuang/add-whisper-large-v3
Add whisper-large-v3
2023-12-12 20:22:31 -08:00
Awni Hannun
1a302bcfaa Merge pull request #82 from ml-explore/llamav2
llama v2 with sharded weights
2023-12-12 17:08:24 -08:00
Awni Hannun
a3e413affb hf correction 2023-12-12 17:08:04 -08:00
Awni Hannun
3391ccdeb5 Merge pull request #79 from ml-explore/whisper_fp16
Enable FP16 for Whisper
2023-12-12 17:05:21 -08:00
Awni Hannun
07e9360e8c Merge pull request #84 from iammerrick/patch-1
Update convert.py
2023-12-12 17:02:21 -08:00
Awni Hannun
d22313140f Merge pull request #86 from 1-ashraful-islam/patch-2
Update README.md with recently added examples
2023-12-12 17:01:02 -08:00
Ashraful Islam
59f317520a Update README.md
updates readme with recently added examples
2023-12-12 18:26:13 -06:00
Merrick Christensen
d5c4f4e9ca Update convert.py
Docs are right, however, the code has a typo.
2023-12-12 14:33:33 -07:00
Awni Hannun
61f81f2078 llama v1 request 2023-12-12 13:32:05 -08:00
Awni Hannun
c18b990f08 llama v2 with sharded weights 2023-12-12 12:48:15 -08:00
Awni Hannun
03a408fa2e Merge pull request #80 from 805karansaini/main
Typo Fix
2023-12-12 12:20:13 -08:00
805karansaini
28e1384267 Typo Fix 2023-12-13 01:45:50 +05:30
Awni Hannun
8bbb4366d0 Merge pull request #75 from ml-explore/mixtral
Mixtral
2023-12-12 10:41:25 -08:00
Sarthak Yadav
12ad979ac8 fixed doc for ResNet 2023-12-12 19:07:39 +01:00
Sarthak Yadav
ea9a4f878a added CIFAR10 + ResNet example 2023-12-12 19:01:06 +01:00
Awni Hannun
1b6213b3d3 nit 2023-12-12 08:42:32 -08:00
Awni Hannun
5af8ad3edc typos in readme 2023-12-12 08:41:28 -08:00
Awni Hannun
86b0c6d91d mixtral runs a bit faster 2023-12-12 08:36:40 -08:00
bofenghuang
b34d2fd04a Add large v3 2023-12-12 17:26:52 +01:00
Awni Hannun
8fe8fb13f1 initial mixtral 2023-12-12 07:44:23 -08:00
Awni Hannun
25bac93cdf whisper default in fp16 2023-12-12 07:37:35 -08:00
Awni Hannun
9ec14f7d47 Merge pull request #73 from jj701/mnist-requirements-txt
Adding Requirements.txt to the Mnist Example
2023-12-11 19:37:53 -08:00
jj701
40ead95e1b Adding Requirements.txt 2023-12-11 20:45:39 -06:00
Awni Hannun
7738f8e352 Merge pull request #69 from TristanBilot/main
Add Graph Convolutional Network example
2023-12-11 14:22:47 -08:00
Tristan Bilot
8aac12b54e fix comments before merge 2023-12-11 23:10:46 +01:00
Tristan Bilot
62484a49a3 use tree_flatten within L2 regularization 2023-12-11 20:15:11 +01:00
Tristan Bilot
d936ab5129 add GCN implementation 2023-12-11 17:48:07 +01:00
Awni Hannun
57cf34b186 Merge pull request #66 from Haixing-Hu/fix-issue-54
fix: fix issue #54, use CPU device to load the Torch model
2023-12-10 18:57:51 -08:00
Haixing Hu
60da4f8b24 fix: fix issue #54, use CPU device to load the Torch model 2023-12-11 10:54:55 +08:00
Awni Hannun
cba703d1f1 fix conversion 2023-12-10 16:56:41 -08:00
Awni Hannun
8e801fcc57 Merge pull request #52 from ricardo-larosa/mistral_batch_size
Mistral: Pass argument --tokens_per_eval for token generation
2023-12-10 11:25:23 -08:00
ricardo-larosa
e17beb0877 Add arg tokens_per_eval for token generation 2023-12-10 11:09:13 +01:00
Joe Barrow
ac942055aa Cleaner masking code 2023-12-09 21:21:24 -05:00
Awni Hannun
dc1924edf1 Merge pull request #53 from ml-explore/mistral_lora
Generalize lora finetuning to Mistral
2023-12-09 15:05:29 -08:00
Awni Hannun
5bb1506ae9 revert accidental change 2023-12-09 14:58:45 -08:00
Awni Hannun
00e21705c2 few more nits 2023-12-09 14:20:19 -08:00
Awni Hannun
dfe79a46f2 black format 2023-12-09 14:15:25 -08:00
Awni Hannun
8094503a68 generalize lora finetuning for llama and mistral 2023-12-09 14:13:55 -08:00
Joe Barrow
29208830ae Updating BERT model to take advantage of bias param in MultiHeadAttention 2023-12-09 12:07:33 -05:00
Awni Hannun
07cdcef452 Merge pull request #43 from jbarrow/main
BERT implementation
2023-12-09 09:03:49 -08:00
Joe Barrow
c5733b48fd Updating README for current example, making python>=3.8 compatibile, and fixing code type 2023-12-09 12:01:58 -05:00
Joe Barrow
187798967c Requirements for running BERT 2023-12-09 10:52:55 -05:00
Joe Barrow
04350eb0a6 Updating README 2023-12-09 10:48:34 -05:00