mlx-examples/llms/mlx_lm/models at 2146bcd7ee76ce7ac46f585801648087726ad904 - mlx-examples - Gitea for Geophysics

zhangyiss/mlx-examples

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-12-16 02:08:55 +08:00

Files

History

Awni Hannun 2146bcd7ee Quantize embedding / Update quantize API (#680 )

* more async eval

* quantize embedding / update quantize api

* more updates for quantize

* update for quantize embeddings

* update sd quant API

* update sdxl quants

* error for datasets < batch_size

* async

* fix config loading

* fix quant

* fix tests

* fix req

* remove lm head if tie weights is true

* fix test

2024-04-18 18:16:10 -07:00

..

__init__.py

Mlx llm package (#301 )

2024-01-12 10:25:56 -08:00

base.py

Mlx llm package (#301 )

2024-01-12 10:25:56 -08:00

cohere.py

Quantize embedding / Update quantize API (#680 )

2024-04-18 18:16:10 -07:00

dbrx.py

- Removed unused Python imports (#683 )

2024-04-16 07:50:32 -07:00

gemma.py

Quantize embedding / Update quantize API (#680 )

2024-04-18 18:16:10 -07:00

llama.py

Switch to fast RMS/LN Norm (#603 )

2024-03-23 07:13:51 -07:00

mixtral.py

Fix argpartition call in Mixtral and other MOES (#676 )

2024-04-12 11:00:56 -07:00

olmo.py

Quantize embedding / Update quantize API (#680 )

2024-04-18 18:16:10 -07:00

phi.py

Switch to fast RMS/LN Norm (#603 )

2024-03-23 07:13:51 -07:00

phixtral.py

Fix argpartition call in Mixtral and other MOES (#676 )

2024-04-12 11:00:56 -07:00

plamo.py

Configurable LR schedulers (#604 )

2024-03-29 13:41:10 -07:00

qwen2_moe.py

Add support for qwen2moe (#640 )

2024-04-02 11:33:29 -07:00

qwen2.py

Quantize embedding / Update quantize API (#680 )

2024-04-18 18:16:10 -07:00

qwen.py

Switch to fast RMS/LN Norm (#603 )

2024-03-23 07:13:51 -07:00

stablelm.py

Stable lm 2 (#666 )

2024-04-08 14:18:55 -07:00

starcoder2.py

Quantize embedding / Update quantize API (#680 )

2024-04-18 18:16:10 -07:00