6 Commits

Author SHA1 Message Date
Justine Tunney
ede59bb742 Add BF16 support and fix warnings
This change updates the data type definitions to be the same as the
latest source code. Support for the bfloat16 data type is available
however it can't interpret the IQ quantization formats yet. Cleanup
of compiler warnings and other nits have been fixed, but behavioral
changes have been avoided, and no new features are as of yet added.
2024-05-25 22:58:50 -07:00
antirez
ff16bc3dcf Speed: use the right compilation flags to dequantize faster. 2024-01-03 21:02:47 +01:00
antirez
bd4ecbda94 FP16 added. Split-mixtral improved. 2023-12-27 15:25:18 +01:00
antirez
53fb176b3b Initial API to create new GGUF files.
Also added a few libraries that will be needed soon.
The CLI was renamed with the final name of gguf-tools.
2023-12-25 22:10:07 +01:00
antirez
d54409bc9c Some library layout. 2023-12-24 18:31:03 +01:00
antirez
b47eaca8d1 GGUF parsing, initial design and functionalities. 2023-12-24 12:06:32 +01:00