mlx-examples/llms/mixtral/params.json

2 lines
193 B
JSON
Raw Normal View History

2023-12-15 07:30:32 +08:00
{"dim": 4096, "n_layers": 32, "head_dim": 128, "hidden_dim": 14336, "n_heads": 32, "n_kv_heads": 8, "norm_eps": 1e-05, "vocab_size": 32000, "moe": {"num_experts_per_tok": 2, "num_experts": 8}}