Commit Graph

23 Commits

Author SHA1 Message Date
Juarez Bochi
55d6267c31 Fix q4_1 dequantization 2024-01-10 10:17:13 -05:00
Juarez Bochi
dc69c608df Add support for q4_0 and q4_1 quantizations 2024-01-09 18:04:18 -05:00
antirez
eec3dc9f54 F16 output for dequantization. 2024-01-09 18:46:26 +01:00
antirez
26e3a59233 Rename gguf_init/end to more obvious names. 2024-01-09 16:35:40 +01:00
antirez
6eb4aeb2fb gguf_create(): take flags to be able to overwrite files. Fixes #7. 2024-01-09 16:32:10 +01:00
antirez
419d4706f6 Q2_K dequantization. 2024-01-05 23:38:47 +01:00
Juarez Bochi
5d10eaac8d Add tensor type features for int types 2024-01-03 16:33:47 -05:00
antirez
b1f32c4088 Quantization functions refactoring. 2024-01-03 21:02:47 +01:00
Salvatore Sanfilippo
b4e7da4ceb Merge pull request #1 from jbochi/typos
Fix some typos
2024-01-03 14:54:30 +01:00
Juarez Bochi
463fd63cf2 Check remap when appending kv/info/data 2024-01-03 08:01:00 -05:00
Juarez Bochi
e5cdcec626 Fix some typos 2024-01-03 07:34:12 -05:00
antirez
c8469c4a27 Q6_K quantization implemented. 2023-12-31 14:06:49 +01:00
antirez
a4858afb4d Implement f16/f32 in gguf_tensor_to_float(). 2023-12-30 17:23:27 +01:00
antirez
e2062eea2c Q4_K dequantization. 2023-12-28 12:31:35 +01:00
antirez
c25ccfa02a Q8_0 dequantization. 2023-12-27 21:22:33 +01:00
antirez
a77a4d061c Mixtral experts extraction test. 2023-12-26 17:23:47 +01:00
antirez
7e9c2bd6a7 Better explain the tensor total size math. 2023-12-26 09:20:54 +01:00
antirez
3081d69b8e split-mixtral: copying of keys + APIs needed. 2023-12-26 09:14:50 +01:00
antirez
96e7eb2d4c gguf-tools: accept subcommands. 2023-12-26 00:07:56 +01:00
antirez
53fb176b3b Initial API to create new GGUF files.
Also added a few libraries that will be needed soon.
The CLI was renamed with the final name of gguf-tools.
2023-12-25 22:10:07 +01:00
antirez
3eb30c1872 API to remap/rewind + mapping in write mode. 2023-12-25 10:45:38 +01:00
antirez
b3092d3860 Compute tensor size in bytes. 2023-12-24 23:44:24 +01:00
antirez
d54409bc9c Some library layout. 2023-12-24 18:31:03 +01:00