Justine Tunney
ede59bb742
Add BF16 support and fix warnings
...
This change updates the data type definitions to be the same as the
latest source code. Support for the bfloat16 data type is available
however it can't interpret the IQ quantization formats yet. Cleanup
of compiler warnings and other nits have been fixed, but behavioral
changes have been avoided, and no new features are as of yet added.
2024-05-25 22:58:50 -07:00
Jack Mousseau
c2cef3d1d8
Prevent memory leak when tensor type is invalid
2024-02-18 07:24:44 -08:00
Juarez Bochi
55d6267c31
Fix q4_1 dequantization
2024-01-10 10:17:13 -05:00
Juarez Bochi
dc69c608df
Add support for q4_0 and q4_1 quantizations
2024-01-09 18:04:18 -05:00
antirez
eec3dc9f54
F16 output for dequantization.
2024-01-09 18:46:26 +01:00
antirez
26e3a59233
Rename gguf_init/end to more obvious names.
2024-01-09 16:35:40 +01:00
antirez
6eb4aeb2fb
gguf_create(): take flags to be able to overwrite files. Fixes #7 .
2024-01-09 16:32:10 +01:00
antirez
419d4706f6
Q2_K dequantization.
2024-01-05 23:38:47 +01:00
Juarez Bochi
5d10eaac8d
Add tensor type features for int types
2024-01-03 16:33:47 -05:00
antirez
b1f32c4088
Quantization functions refactoring.
2024-01-03 21:02:47 +01:00
Salvatore Sanfilippo
b4e7da4ceb
Merge pull request #1 from jbochi/typos
...
Fix some typos
2024-01-03 14:54:30 +01:00
Juarez Bochi
463fd63cf2
Check remap when appending kv/info/data
2024-01-03 08:01:00 -05:00
Juarez Bochi
e5cdcec626
Fix some typos
2024-01-03 07:34:12 -05:00
antirez
c8469c4a27
Q6_K quantization implemented.
2023-12-31 14:06:49 +01:00
antirez
a4858afb4d
Implement f16/f32 in gguf_tensor_to_float().
2023-12-30 17:23:27 +01:00
antirez
e2062eea2c
Q4_K dequantization.
2023-12-28 12:31:35 +01:00
antirez
c25ccfa02a
Q8_0 dequantization.
2023-12-27 21:22:33 +01:00
antirez
a77a4d061c
Mixtral experts extraction test.
2023-12-26 17:23:47 +01:00
antirez
7e9c2bd6a7
Better explain the tensor total size math.
2023-12-26 09:20:54 +01:00
antirez
3081d69b8e
split-mixtral: copying of keys + APIs needed.
2023-12-26 09:14:50 +01:00
antirez
96e7eb2d4c
gguf-tools: accept subcommands.
2023-12-26 00:07:56 +01:00
antirez
53fb176b3b
Initial API to create new GGUF files.
...
Also added a few libraries that will be needed soon.
The CLI was renamed with the final name of gguf-tools.
2023-12-25 22:10:07 +01:00
antirez
3eb30c1872
API to remap/rewind + mapping in write mode.
2023-12-25 10:45:38 +01:00
antirez
b3092d3860
Compute tensor size in bytes.
2023-12-24 23:44:24 +01:00
antirez
d54409bc9c
Some library layout.
2023-12-24 18:31:03 +01:00