Awni Hannun
|
e03f0372b1
|
More shape type (#1705)
* more shape type
* fix
|
2024-12-19 08:08:20 -08:00 |
|
Awni Hannun
|
afc9c0ec1b
|
dtype is copy assignable (#1436)
|
2024-09-25 12:07:13 -07:00 |
|
Awni Hannun
|
df124e018a
|
fix gguf (#1273)
* fix gguf
* comment
|
2024-07-18 07:35:35 -07:00 |
|
Awni Hannun
|
8b1906abd0
|
Add compiler flags to disable safetensors and gguf (#1098)
* with docs
* nit
|
2024-05-09 17:39:44 -07:00 |
|
Awni Hannun
|
ed83908931
|
fix gguf loading quants (#1014)
* fix gguf loading quants
* fix nanobind install
* actual fix
|
2024-04-19 12:24:07 -07:00 |
|
Awni Hannun
|
741eb28443
|
fix a couple bugs (#952)
|
2024-04-02 12:07:41 -07:00 |
|
Cheng
|
46caf0bef0
|
Remove unnecessary string copies (#891)
1. Use string_view instead of string when there is no need for copy.
2. Otherwise move string when possible.
|
2024-03-28 13:14:59 -07:00 |
|
Jack Mousseau
|
0925af43b0
|
Remove unused variables (#706)
|
2024-02-18 12:50:10 -08:00 |
|
Juarez Bochi
|
4fe2fa2a64
|
GGUF: Avoid dequantization when format is compatible (#426)
* GGUF: Don't dequantize q4_1
* Fix weight order. First in low bits
* Add unpacking for q4_0
* Don't dequantize q8_0
* rebase quants and split file
* don't quantize every weight
* reapply patch
* error handling
---------
Co-authored-by: Awni Hannun <awni@apple.com>
|
2024-01-23 15:43:57 -08:00 |
|