Adds C++ and nn quantization utilities (#230)

* Add C++ de-/quantize ops
* Add quantize functions to the docs and tests
* Add a QuantizedLinear module
This commit is contained in:
Angelos Katharopoulos
2023-12-20 14:17:38 -08:00
committed by GitHub
parent 4912ff3ec2
commit 57fe918cf8
12 changed files with 451 additions and 68 deletions

View File

@@ -26,3 +26,4 @@ Layers
RoPE
MultiHeadAttention
Sequential
QuantizedLinear

View File

@@ -34,6 +34,7 @@ Operations
conv2d
cos
cosh
dequantize
divide
equal
erf
@@ -73,6 +74,8 @@ Operations
partition
pad
prod
quantize
quantized_matmul
reciprocal
reshape
round