gguf-tools

mirror of https://github.com/antirez/gguf-tools.git synced 2025-09-17 19:08:07 +08:00

Author	SHA1	Message	Date
Justine Tunney	6deab767f9	Introduce --diffable flag Sometimes it's useful to get an overview of how tensors changes when using different quantization formats. For example: diff -u <(gguf-tools show --diffable ggml-model-bf16.gguf) \ <(gguf-tools show --diffable ggml-model-Q6_K.gguf) \| less Is now able to produces nice clean output. Without this change, every line would have been different due to the file offsets and byte sizes which means `diff -u` would produce one gigantic unreadable chunk.	2024-05-26 00:23:41 -07:00
Justine Tunney	ede59bb742	Add BF16 support and fix warnings This change updates the data type definitions to be the same as the latest source code. Support for the bfloat16 data type is available however it can't interpret the IQ quantization formats yet. Cleanup of compiler warnings and other nits have been fixed, but behavioral changes have been avoided, and no new features are as of yet added.	2024-05-25 22:58:50 -07:00
antirez	26e3a59233	Rename gguf_init/end to more obvious names.	2024-01-09 16:35:40 +01:00
antirez	6eb4aeb2fb	gguf_create(): take flags to be able to overwrite files. Fixes #7 .	2024-01-09 16:32:10 +01:00
Juarez Bochi	50e79b9ec0	Print tensor with correct strides	2024-01-05 09:59:59 -05:00
Salvatore Sanfilippo	e48ca317ea	Merge pull request #5 from jbochi/inspect_shape Inspect tensor taking dims into consideration	2024-01-04 20:32:19 +01:00
Juarez Bochi	58a0479bb4	Inspect tensor taking dims into consideration	2024-01-04 11:44:13 -05:00
Juarez Bochi	a7e99574e2	Print tensor dimensions	2024-01-03 17:41:33 -05:00
antirez	951ce0e3c4	Compare subcommand: report difference as %.	2023-12-30 15:43:44 +01:00
antirez	3663d73c22	Compare subcommand: just skip tensors we can't yet dequantize.	2023-12-30 10:13:38 +01:00
antirez	400f60b75b	--verbose and README updated.	2023-12-29 22:50:41 +01:00
antirez	54946cbf14	Compare subcommand.	2023-12-28 17:24:05 +01:00
antirez	2a599dc5d0	Show subcommand: print total parameters.	2023-12-28 16:07:16 +01:00
antirez	e2062eea2c	Q4_K dequantization.	2023-12-28 12:31:35 +01:00
antirez	c25ccfa02a	Q8_0 dequantization.	2023-12-27 21:22:33 +01:00
antirez	bd4ecbda94	FP16 added. Split-mixtral improved.	2023-12-27 15:25:18 +01:00
antirez	a77a4d061c	Mixtral experts extraction test.	2023-12-26 17:23:47 +01:00
antirez	3081d69b8e	split-mixtral: copying of keys + APIs needed.	2023-12-26 09:14:50 +01:00
antirez	96e7eb2d4c	gguf-tools: accept subcommands.	2023-12-26 00:07:56 +01:00
antirez	53fb176b3b	Initial API to create new GGUF files. Also added a few libraries that will be needed soon. The CLI was renamed with the final name of gguf-tools.	2023-12-25 22:10:07 +01:00

20 Commits