mlx-examples

zhangyiss/mlx-examples

Fork 0

mirror of https://github.com/ml-explore/mlx-examples.git synced 2025-06-28 12:13:25 +08:00

Commit Graph

Author	SHA1	Message	Date
Jaward Sesay	7c0962f4e2	Add Supported Quantized Phi-3-mini-4k-instruct gguf Weight (#717 ) * support for phi-3 4bits quantized gguf weights * Added link to 4 bits quantized model * removed some prints * Added correct comment * Added correct comment * removed print Since last condition already prints warning for when quantization is None	2024-04-29 20:11:32 -07:00
Juarez Bochi	f5b80c95fb	Example reading directly from gguf file (#222 ) * Draft of tiny llama from gguf * Transpose all * No transposition with new layout * Read config from gguf * Create tokenizer from gguf * move gguf and update to be similar to hf_llm * change model to HF style + updates to REAMDE * nits in REAMDE * nit readme * only use mlx for metadata * fix eos/bos tokenizer * fix tokenization * quantization runs * 8-bit works * tokenizer fix * bump mlx version --------- Co-authored-by: Juarez Bochi <juarez.bochi@grammarly.com> Co-authored-by: Awni Hannun <awni@apple.com>	2024-01-23 15:41:54 -08:00

Author

SHA1

Message

Date

Jaward Sesay

7c0962f4e2

Add Supported Quantized Phi-3-mini-4k-instruct gguf Weight (#717 )

* support for phi-3 4bits quantized gguf weights

* Added link to 4 bits quantized model

* removed some prints

* Added correct comment

* Added correct comment

* removed print

Since last condition already prints warning for when quantization is None

2024-04-29 20:11:32 -07:00

Juarez Bochi

f5b80c95fb

Example reading directly from gguf file (#222 )

* Draft of tiny llama from gguf

* Transpose all

* No transposition with new layout

* Read config from gguf

* Create tokenizer from gguf

* move gguf and update to be similar to hf_llm

* change model to HF style + updates to REAMDE

* nits in REAMDE

* nit readme

* only use mlx for metadata

* fix eos/bos tokenizer

* fix tokenization

* quantization runs

* 8-bit works

* tokenizer fix

* bump mlx version

---------

Co-authored-by: Juarez Bochi <juarez.bochi@grammarly.com>
Co-authored-by: Awni Hannun <awni@apple.com>

2024-01-23 15:41:54 -08:00

2 Commits