25 Commits

Author SHA1 Message Date
Justine Tunney
ede59bb742 Add BF16 support and fix warnings
This change updates the data type definitions to be the same as the
latest source code. Support for the bfloat16 data type is available
however it can't interpret the IQ quantization formats yet. Cleanup
of compiler warnings and other nits have been fixed, but behavioral
changes have been avoided, and no new features are as of yet added.
2024-05-25 22:58:50 -07:00
Jack Mousseau
c2cef3d1d8 Prevent memory leak when tensor type is invalid 2024-02-18 07:24:44 -08:00
Juarez Bochi
55d6267c31 Fix q4_1 dequantization 2024-01-10 10:17:13 -05:00
Juarez Bochi
dc69c608df Add support for q4_0 and q4_1 quantizations 2024-01-09 18:04:18 -05:00
antirez
eec3dc9f54 F16 output for dequantization. 2024-01-09 18:46:26 +01:00
antirez
26e3a59233 Rename gguf_init/end to more obvious names. 2024-01-09 16:35:40 +01:00
antirez
6eb4aeb2fb gguf_create(): take flags to be able to overwrite files. Fixes #7. 2024-01-09 16:32:10 +01:00
antirez
419d4706f6 Q2_K dequantization. 2024-01-05 23:38:47 +01:00
Juarez Bochi
5d10eaac8d Add tensor type features for int types 2024-01-03 16:33:47 -05:00
antirez
b1f32c4088 Quantization functions refactoring. 2024-01-03 21:02:47 +01:00
Salvatore Sanfilippo
b4e7da4ceb Merge pull request #1 from jbochi/typos
Fix some typos
2024-01-03 14:54:30 +01:00
Juarez Bochi
463fd63cf2 Check remap when appending kv/info/data 2024-01-03 08:01:00 -05:00
Juarez Bochi
e5cdcec626 Fix some typos 2024-01-03 07:34:12 -05:00
antirez
c8469c4a27 Q6_K quantization implemented. 2023-12-31 14:06:49 +01:00
antirez
a4858afb4d Implement f16/f32 in gguf_tensor_to_float(). 2023-12-30 17:23:27 +01:00
antirez
e2062eea2c Q4_K dequantization. 2023-12-28 12:31:35 +01:00
antirez
c25ccfa02a Q8_0 dequantization. 2023-12-27 21:22:33 +01:00
antirez
a77a4d061c Mixtral experts extraction test. 2023-12-26 17:23:47 +01:00
antirez
7e9c2bd6a7 Better explain the tensor total size math. 2023-12-26 09:20:54 +01:00
antirez
3081d69b8e split-mixtral: copying of keys + APIs needed. 2023-12-26 09:14:50 +01:00
antirez
96e7eb2d4c gguf-tools: accept subcommands. 2023-12-26 00:07:56 +01:00
antirez
53fb176b3b Initial API to create new GGUF files.
Also added a few libraries that will be needed soon.
The CLI was renamed with the final name of gguf-tools.
2023-12-25 22:10:07 +01:00
antirez
3eb30c1872 API to remap/rewind + mapping in write mode. 2023-12-25 10:45:38 +01:00
antirez
b3092d3860 Compute tensor size in bytes. 2023-12-24 23:44:24 +01:00
antirez
d54409bc9c Some library layout. 2023-12-24 18:31:03 +01:00