dmdaksh
|
7d7e236061
|
- Removed unused Python imports (#683)
- bert/model.py:10: tree_unflatten
- bert/model.py:2: dataclass
- bert/model.py:8: numpy
- cifar/resnet.py:6: Any
- clip/model.py:15: tree_flatten
- clip/model.py:9: Union
- gcn/main.py:8: download_cora
- gcn/main.py:9: cross_entropy
- llms/gguf_llm/models.py:12: tree_flatten, tree_unflatten
- llms/gguf_llm/models.py:9: numpy
- llms/mixtral/mixtral.py:12: tree_map
- llms/mlx_lm/models/dbrx.py:2: Dict, Union
- llms/mlx_lm/tuner/trainer.py:5: partial
- llms/speculative_decoding/decoder.py:1: dataclass, field
- llms/speculative_decoding/decoder.py:2: Optional
- llms/speculative_decoding/decoder.py:5: mlx.nn
- llms/speculative_decoding/decoder.py:6: numpy
- llms/speculative_decoding/main.py:2: glob
- llms/speculative_decoding/main.py:3: json
- llms/speculative_decoding/main.py:5: Path
- llms/speculative_decoding/main.py:8: mlx.nn
- llms/speculative_decoding/model.py:6: tree_unflatten
- llms/speculative_decoding/model.py:7: AutoTokenizer
- llms/tests/test_lora.py:13: yaml_loader
- lora/lora.py:14: tree_unflatten
- lora/models.py:11: numpy
- lora/models.py:3: glob
- speechcommands/kwt.py:1: Any
- speechcommands/main.py:7: mlx.data
- stable_diffusion/stable_diffusion/model_io.py:4: partial
- whisper/benchmark.py:5: sys
- whisper/test.py:5: subprocess
- whisper/whisper/audio.py:6: Optional
- whisper/whisper/decoding.py:8: mlx.nn
|
2024-04-16 07:50:32 -07:00 |
|
yzimmermann
|
4680ef4413
|
Enable more BERT models (#580)
* Update convert.py
* Update model.py
* Update test.py
* Update model.py
* Update convert.py
* Add files via upload
* Update convert.py
* format
* nit
* nit
---------
Co-authored-by: Awni Hannun <awni@apple.com>
|
2024-03-19 17:21:33 -07:00 |
|
Race
|
376bb9cc44
|
bert encoder inherits from nn.Module now (#571)
|
2024-03-13 10:24:21 -07:00 |
|
Yiğit Ö. Ünver
|
0f19237fb8
|
docs: added missing imports (#375)
* add: missing import
* add: missing import
|
2024-01-25 10:44:53 -08:00 |
|
Angelos Katharopoulos
|
1fa40067fe
|
Change tuple type definitions to use Tuple (#308)
|
2024-01-12 11:15:09 -08:00 |
|
Awni Hannun
|
bbd7172eef
|
Some fixes / cleanup for BERT example (#269)
* some fixes/cleaning for bert + test
* nit
|
2024-01-09 08:44:51 -08:00 |
|
Awni Hannun
|
27c0a8c002
|
Add llms subdir + update README (#145)
* add llms subdir + update README
* nits
* use same pre-commit as mlx
* update readmes a bit
* format
|
2023-12-20 10:22:25 -08:00 |
|
Awni Hannun
|
9c7e996ff0
|
Merge pull request #51 from jbarrow/main
Update BERT to take advantage of bias param in MultiHeadAttention
|
2023-12-13 15:20:29 -08:00 |
|
Joe Barrow
|
9f4e63acbf
|
Update to mlx>=0.0.5
|
2023-12-13 17:48:07 -05:00 |
|
Juarez Bochi
|
03fe6896de
|
Fix convert.py instructions for Bert model
It just adds the missing backslash.
|
2023-12-13 11:37:02 -05:00 |
|
Awni Hannun
|
a99e9d551e
|
hf correction
|
2023-12-12 17:08:04 -08:00 |
|
Joe Barrow
|
a577abc313
|
Cleaner masking code
|
2023-12-09 21:21:24 -05:00 |
|
Awni Hannun
|
5a5decf767
|
revert accidental change
|
2023-12-09 14:58:45 -08:00 |
|
Awni Hannun
|
98f4346c81
|
black format
|
2023-12-09 14:15:25 -08:00 |
|
Joe Barrow
|
5d4838b02e
|
Updating BERT model to take advantage of bias param in MultiHeadAttention
|
2023-12-09 12:07:33 -05:00 |
|
Joe Barrow
|
d873e10dfe
|
Updating README for current example, making python>=3.8 compatibile, and fixing code type
|
2023-12-09 12:01:58 -05:00 |
|
Joe Barrow
|
20d920a7eb
|
Requirements for running BERT
|
2023-12-09 10:52:55 -05:00 |
|
Joe Barrow
|
45ca4ed3f6
|
Updating README
|
2023-12-09 10:48:34 -05:00 |
|
Joe Barrow
|
7320456226
|
Cleaning implementation for merge
|
2023-12-09 10:41:15 -05:00 |
|
Joe Barrow
|
e05ee57bab
|
Update README for mlx-examples repo
|
2023-12-08 10:20:50 -05:00 |
|
Joe Barrow
|
4e5b8ceafe
|
BERT implementation
|
2023-12-08 05:14:11 -05:00 |
|