Quantizes float data to Q4_0 format with optional importance matrix.
quantize_q4_0(src_data, n_rows, n_per_row, imatrix = NULL)quantize_q4_1(src_data, n_rows, n_per_row, imatrix = NULL)
quantize_q5_0(src_data, n_rows, n_per_row, imatrix = NULL)
quantize_q5_1(src_data, n_rows, n_per_row, imatrix = NULL)
quantize_q8_0(src_data, n_rows, n_per_row, imatrix = NULL)
Raw vector of quantized data
Numeric vector of float values to quantize
Number of rows
Number of elements per row
Optional importance matrix (numeric vector or NULL)
Other quantization:
dequantize_row_iq2_xxs(),
dequantize_row_mxfp4(),
dequantize_row_q2_K(),
dequantize_row_q4_0(),
dequantize_row_tq1_0(),
ggml_quant_block_info(),
iq2xs_free_impl(),
iq2xs_init_impl(),
iq3xs_free_impl(),
iq3xs_init_impl(),
quantize_iq2_xxs(),
quantize_mxfp4(),
quantize_q2_K(),
quantize_row_iq3_xxs_ref(),
quantize_row_mxfp4_ref(),
quantize_row_q2_K_ref(),
quantize_row_q4_0_ref(),
quantize_row_tq1_0_ref(),
quantize_tq1_0()