Juarez Bochi
|
55d6267c31
|
Fix q4_1 dequantization
|
2024-01-10 10:17:13 -05:00 |
|
Juarez Bochi
|
dc69c608df
|
Add support for q4_0 and q4_1 quantizations
|
2024-01-09 18:04:18 -05:00 |
|
antirez
|
eec3dc9f54
|
F16 output for dequantization.
|
2024-01-09 18:46:26 +01:00 |
|
antirez
|
26e3a59233
|
Rename gguf_init/end to more obvious names.
|
2024-01-09 16:35:40 +01:00 |
|
antirez
|
6eb4aeb2fb
|
gguf_create(): take flags to be able to overwrite files. Fixes #7.
|
2024-01-09 16:32:10 +01:00 |
|
antirez
|
419d4706f6
|
Q2_K dequantization.
|
2024-01-05 23:38:47 +01:00 |
|
Juarez Bochi
|
5d10eaac8d
|
Add tensor type features for int types
|
2024-01-03 16:33:47 -05:00 |
|
antirez
|
b1f32c4088
|
Quantization functions refactoring.
|
2024-01-03 21:02:47 +01:00 |
|
Salvatore Sanfilippo
|
b4e7da4ceb
|
Merge pull request #1 from jbochi/typos
Fix some typos
|
2024-01-03 14:54:30 +01:00 |
|
Juarez Bochi
|
463fd63cf2
|
Check remap when appending kv/info/data
|
2024-01-03 08:01:00 -05:00 |
|
Juarez Bochi
|
e5cdcec626
|
Fix some typos
|
2024-01-03 07:34:12 -05:00 |
|
antirez
|
c8469c4a27
|
Q6_K quantization implemented.
|
2023-12-31 14:06:49 +01:00 |
|
antirez
|
a4858afb4d
|
Implement f16/f32 in gguf_tensor_to_float().
|
2023-12-30 17:23:27 +01:00 |
|
antirez
|
e2062eea2c
|
Q4_K dequantization.
|
2023-12-28 12:31:35 +01:00 |
|
antirez
|
c25ccfa02a
|
Q8_0 dequantization.
|
2023-12-27 21:22:33 +01:00 |
|
antirez
|
a77a4d061c
|
Mixtral experts extraction test.
|
2023-12-26 17:23:47 +01:00 |
|
antirez
|
7e9c2bd6a7
|
Better explain the tensor total size math.
|
2023-12-26 09:20:54 +01:00 |
|
antirez
|
3081d69b8e
|
split-mixtral: copying of keys + APIs needed.
|
2023-12-26 09:14:50 +01:00 |
|
antirez
|
96e7eb2d4c
|
gguf-tools: accept subcommands.
|
2023-12-26 00:07:56 +01:00 |
|
antirez
|
53fb176b3b
|
Initial API to create new GGUF files.
Also added a few libraries that will be needed soon.
The CLI was renamed with the final name of gguf-tools.
|
2023-12-25 22:10:07 +01:00 |
|
antirez
|
3eb30c1872
|
API to remap/rewind + mapping in write mode.
|
2023-12-25 10:45:38 +01:00 |
|
antirez
|
b3092d3860
|
Compute tensor size in bytes.
|
2023-12-24 23:44:24 +01:00 |
|
antirez
|
d54409bc9c
|
Some library layout.
|
2023-12-24 18:31:03 +01:00 |
|