mlx-examples/bert/README.md

# BERT

An implementation of BERT [(Devlin, et al., 2019)](https://aclanthology.org/N19-1423/) in MLX.

## Setup 

Install the requirements:

```
pip install -r requirements.txt
```

Then convert the weights with:

```
python convert.py \
    --bert-model bert-base-uncased \
    --mlx-model weights/bert-base-uncased.npz
```

## Usage

To use the `Bert` model in your own code, you can load it with:

```python
import mlx.core as mx
from model import Bert, load_model

model, tokenizer = load_model(
    "bert-base-uncased",
    "weights/bert-base-uncased.npz")

batch = ["This is an example of BERT working on MLX."]
tokens = tokenizer(batch, return_tensors="np", padding=True)
tokens = {key: mx.array(v) for key, v in tokens.items()}

output, pooled = model(**tokens)
```

The `output` contains a `Batch x Tokens x Dims` tensor, representing a vector
for every input token. If you want to train anything at the **token-level**,
use this.

The `pooled` contains a `Batch x Dims` tensor, which is the pooled
representation for each input. If you want to train a **classification**
model, use this.


## Test

You can check the output for the default model (`bert-base-uncased`) matches the
Hugging Face version with:

```
python test.py
```
Cleaning implementation for merge 2023-12-09 10:41:15 -05:00			`# BERT`
BERT implementation 2023-12-08 05:14:11 -05:00
Enable more BERT models (#580) * Update convert.py * Update model.py * Update test.py * Update model.py * Update convert.py * Add files via upload * Update convert.py * format * nit * nit --------- Co-authored-by: Awni Hannun <awni@apple.com> 2024-03-20 01:21:33 +01:00			`An implementation of BERT [(Devlin, et al., 2019)](https://aclanthology.org/N19-1423/) in MLX.`
BERT implementation 2023-12-08 05:14:11 -05:00
Some fixes / cleanup for BERT example (#269) * some fixes/cleaning for bert + test * nit 2024-01-09 08:44:51 -08:00			`## Setup`
BERT implementation 2023-12-08 05:14:11 -05:00
Some fixes / cleanup for BERT example (#269) * some fixes/cleaning for bert + test * nit 2024-01-09 08:44:51 -08:00			`Install the requirements:`

			```
			`pip install -r requirements.txt`
			```

			`Then convert the weights with:`
BERT implementation 2023-12-08 05:14:11 -05:00
			```
			`python convert.py \`
Fix convert.py instructions for Bert model It just adds the missing backslash. 2023-12-13 11:37:02 -05:00			`--bert-model bert-base-uncased \`
BERT implementation 2023-12-08 05:14:11 -05:00			`--mlx-model weights/bert-base-uncased.npz`
			```

Updating README 2023-12-09 10:48:34 -05:00			`## Usage`

			To use the `Bert` model in your own code, you can load it with:

			```python
docs: added missing imports (#375) * add: missing import * add: missing import 2024-01-25 21:44:53 +03:00			`import mlx.core as mx`
Updating README 2023-12-09 10:48:34 -05:00			`from model import Bert, load_model`

			`model, tokenizer = load_model(`
			`"bert-base-uncased",`
			`"weights/bert-base-uncased.npz")`

			`batch = ["This is an example of BERT working on MLX."]`
			`tokens = tokenizer(batch, return_tensors="np", padding=True)`
			`tokens = {key: mx.array(v) for key, v in tokens.items()}`

			`output, pooled = model(**tokens)`
			```

Some fixes / cleanup for BERT example (#269) * some fixes/cleaning for bert + test * nit 2024-01-09 08:44:51 -08:00			The `output` contains a `Batch x Tokens x Dims` tensor, representing a vector
Enable more BERT models (#580) * Update convert.py * Update model.py * Update test.py * Update model.py * Update convert.py * Add files via upload * Update convert.py * format * nit * nit --------- Co-authored-by: Awni Hannun <awni@apple.com> 2024-03-20 01:21:33 +01:00			`for every input token. If you want to train anything at the token-level,`
			`use this.`
Updating README 2023-12-09 10:48:34 -05:00
Some fixes / cleanup for BERT example (#269) * some fixes/cleaning for bert + test * nit 2024-01-09 08:44:51 -08:00			The `pooled` contains a `Batch x Dims` tensor, which is the pooled
			`representation for each input. If you want to train a classification`
Enable more BERT models (#580) * Update convert.py * Update model.py * Update test.py * Update model.py * Update convert.py * Add files via upload * Update convert.py * format * nit * nit --------- Co-authored-by: Awni Hannun <awni@apple.com> 2024-03-20 01:21:33 +01:00			`model, use this.`
BERT implementation 2023-12-08 05:14:11 -05:00

Some fixes / cleanup for BERT example (#269) * some fixes/cleaning for bert + test * nit 2024-01-09 08:44:51 -08:00			`## Test`
BERT implementation 2023-12-08 05:14:11 -05:00
Some fixes / cleanup for BERT example (#269) * some fixes/cleaning for bert + test * nit 2024-01-09 08:44:51 -08:00			You can check the output for the default model (`bert-base-uncased`) matches the
			`Hugging Face version with:`
BERT implementation 2023-12-08 05:14:11 -05:00
			```
Some fixes / cleanup for BERT example (#269) * some fixes/cleaning for bert + test * nit 2024-01-09 08:44:51 -08:00			`python test.py`
BERT implementation 2023-12-08 05:14:11 -05:00			```