Commit Graph

44 Commits

Author SHA1 Message Date
antirez
6eb4aeb2fb gguf_create(): take flags to be able to overwrite files. Fixes #7. 2024-01-09 16:32:10 +01:00
Salvatore Sanfilippo
81dbf8f8d2 Merge pull request #6 from jbochi/reverse_stride
Print tensor with correct strides
2024-01-09 15:48:46 +01:00
antirez
419d4706f6 Q2_K dequantization. 2024-01-05 23:38:47 +01:00
Juarez Bochi
50e79b9ec0 Print tensor with correct strides 2024-01-05 09:59:59 -05:00
Salvatore Sanfilippo
e48ca317ea Merge pull request #5 from jbochi/inspect_shape
Inspect tensor taking dims into consideration
2024-01-04 20:32:19 +01:00
Salvatore Sanfilippo
a42344e197 Merge pull request #4 from jbochi/show_shape
Print tensor dimensions
2024-01-04 20:31:23 +01:00
Salvatore Sanfilippo
92e1c67b8b Merge pull request #3 from jbochi/int_type_features
Add tensor type features for int types
2024-01-04 20:30:32 +01:00
Juarez Bochi
58a0479bb4 Inspect tensor taking dims into consideration 2024-01-04 11:44:13 -05:00
Juarez Bochi
a7e99574e2 Print tensor dimensions 2024-01-03 17:41:33 -05:00
Juarez Bochi
5d10eaac8d Add tensor type features for int types 2024-01-03 16:33:47 -05:00
antirez
b1f32c4088 Quantization functions refactoring. 2024-01-03 21:02:47 +01:00
antirez
ff16bc3dcf Speed: use the right compilation flags to dequantize faster. 2024-01-03 21:02:47 +01:00
Salvatore Sanfilippo
b4e7da4ceb Merge pull request #1 from jbochi/typos
Fix some typos
2024-01-03 14:54:30 +01:00
Salvatore Sanfilippo
04ec28ed35 Merge pull request #2 from jbochi/check_remap
Check remap when appending kv/info/data
2024-01-03 14:53:41 +01:00
Juarez Bochi
463fd63cf2 Check remap when appending kv/info/data 2024-01-03 08:01:00 -05:00
Juarez Bochi
e5cdcec626 Fix some typos 2024-01-03 07:34:12 -05:00
antirez
c8469c4a27 Q6_K quantization implemented. 2023-12-31 14:06:49 +01:00
antirez
54b93edecb README: grammar. 2023-12-30 18:08:27 +01:00
antirez
4a5dfdcdad README: show subcommand example output. 2023-12-30 18:02:21 +01:00
antirez
53e7b2b156 README: grammar. 2023-12-30 18:00:23 +01:00
antirez
e8b405aac8 README updated. 2023-12-30 17:29:44 +01:00
antirez
a4858afb4d Implement f16/f32 in gguf_tensor_to_float(). 2023-12-30 17:23:27 +01:00
antirez
136e04977c README: add compare example. 2023-12-30 15:47:52 +01:00
antirez
951ce0e3c4 Compare subcommand: report difference as %. 2023-12-30 15:43:44 +01:00
antirez
3663d73c22 Compare subcommand: just skip tensors we can't yet dequantize. 2023-12-30 10:13:38 +01:00
antirez
400f60b75b --verbose and README updated. 2023-12-29 22:50:41 +01:00
antirez
54946cbf14 Compare subcommand. 2023-12-28 17:24:05 +01:00
antirez
2a599dc5d0 Show subcommand: print total parameters. 2023-12-28 16:07:16 +01:00
antirez
e2062eea2c Q4_K dequantization. 2023-12-28 12:31:35 +01:00
antirez
c25ccfa02a Q8_0 dequantization. 2023-12-27 21:22:33 +01:00
antirez
558c7c3c6d Clarify the need for FP16 implementation. 2023-12-27 18:54:36 +01:00
antirez
bd4ecbda94 FP16 added. Split-mixtral improved. 2023-12-27 15:25:18 +01:00
antirez
a77a4d061c Mixtral experts extraction test. 2023-12-26 17:23:47 +01:00
antirez
7e9c2bd6a7 Better explain the tensor total size math. 2023-12-26 09:20:54 +01:00
antirez
3081d69b8e split-mixtral: copying of keys + APIs needed. 2023-12-26 09:14:50 +01:00
antirez
96e7eb2d4c gguf-tools: accept subcommands. 2023-12-26 00:07:56 +01:00
antirez
53fb176b3b Initial API to create new GGUF files.
Also added a few libraries that will be needed soon.
The CLI was renamed with the final name of gguf-tools.
2023-12-25 22:10:07 +01:00
antirez
3eb30c1872 API to remap/rewind + mapping in write mode. 2023-12-25 10:45:38 +01:00
antirez
f400e8a36f README added. 2023-12-24 23:46:46 +01:00
antirez
b3092d3860 Compute tensor size in bytes. 2023-12-24 23:44:24 +01:00
antirez
d54409bc9c Some library layout. 2023-12-24 18:31:03 +01:00
antirez
55a15a4230 Tensors parsing. 2023-12-24 17:20:04 +01:00
antirez
4ff25fb178 Limit array items printed. 2023-12-24 12:21:41 +01:00
antirez
b47eaca8d1 GGUF parsing, initial design and functionalities. 2023-12-24 12:06:32 +01:00