QuantUpperTriangularLinear¶
Bases: QuantBase
Quantized :class:UpperTriangularLinear.
out <= in: topoutrows of thein x inupper-triangular matrix.out > in: leadingout - inrows are fully dense; trailinginrows form a square upper-triangular block.
Fake-quantization is applied to weight * mask, so masked-out entries
remain exactly zero through quantization.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
in_features
|
int
|
Number of input features. |
required |
out_features
|
int
|
Number of output features. |
required |
bias
|
bool
|
Whether to include an additive bias. |
True
|
act_func
|
str | None
|
Either |
None
|
ema_constant
|
float
|
EMA smoothing factor for observers. |
0.01
|
device
|
str
|
Torch device for the parameters. |
'cpu'
|