Commit Graph

20 Commits

Author SHA1 Message Date
Justine Tunney
6deab767f9 Introduce --diffable flag
Sometimes it's useful to get an overview of how tensors changes when
using different quantization formats. For example:

  diff -u <(gguf-tools show --diffable ggml-model-bf16.gguf) \
          <(gguf-tools show --diffable ggml-model-Q6_K.gguf) | less

Is now able to produces nice clean output. Without this change, every
line would have been different due to the file offsets and byte sizes
which means `diff -u` would produce one gigantic unreadable chunk.
2024-05-26 00:23:41 -07:00
Justine Tunney
ede59bb742 Add BF16 support and fix warnings
This change updates the data type definitions to be the same as the
latest source code. Support for the bfloat16 data type is available
however it can't interpret the IQ quantization formats yet. Cleanup
of compiler warnings and other nits have been fixed, but behavioral
changes have been avoided, and no new features are as of yet added.
2024-05-25 22:58:50 -07:00
antirez
26e3a59233 Rename gguf_init/end to more obvious names. 2024-01-09 16:35:40 +01:00
antirez
6eb4aeb2fb gguf_create(): take flags to be able to overwrite files. Fixes #7. 2024-01-09 16:32:10 +01:00
Juarez Bochi
50e79b9ec0 Print tensor with correct strides 2024-01-05 09:59:59 -05:00
Salvatore Sanfilippo
e48ca317ea Merge pull request #5 from jbochi/inspect_shape
Inspect tensor taking dims into consideration
2024-01-04 20:32:19 +01:00
Juarez Bochi
58a0479bb4 Inspect tensor taking dims into consideration 2024-01-04 11:44:13 -05:00
Juarez Bochi
a7e99574e2 Print tensor dimensions 2024-01-03 17:41:33 -05:00
antirez
951ce0e3c4 Compare subcommand: report difference as %. 2023-12-30 15:43:44 +01:00
antirez
3663d73c22 Compare subcommand: just skip tensors we can't yet dequantize. 2023-12-30 10:13:38 +01:00
antirez
400f60b75b --verbose and README updated. 2023-12-29 22:50:41 +01:00
antirez
54946cbf14 Compare subcommand. 2023-12-28 17:24:05 +01:00
antirez
2a599dc5d0 Show subcommand: print total parameters. 2023-12-28 16:07:16 +01:00
antirez
e2062eea2c Q4_K dequantization. 2023-12-28 12:31:35 +01:00
antirez
c25ccfa02a Q8_0 dequantization. 2023-12-27 21:22:33 +01:00
antirez
bd4ecbda94 FP16 added. Split-mixtral improved. 2023-12-27 15:25:18 +01:00
antirez
a77a4d061c Mixtral experts extraction test. 2023-12-26 17:23:47 +01:00
antirez
3081d69b8e split-mixtral: copying of keys + APIs needed. 2023-12-26 09:14:50 +01:00
antirez
96e7eb2d4c gguf-tools: accept subcommands. 2023-12-26 00:07:56 +01:00
antirez
53fb176b3b Initial API to create new GGUF files.
Also added a few libraries that will be needed soon.
The CLI was renamed with the final name of gguf-tools.
2023-12-25 22:10:07 +01:00