Quantizes float data to K-quant format with optional importance matrix. K-quants provide better quality/size tradeoffs than basic quants.
quantize_q2_K(src_data, n_rows, n_per_row, imatrix = NULL)quantize_q3_K(src_data, n_rows, n_per_row, imatrix = NULL)
quantize_q4_K(src_data, n_rows, n_per_row, imatrix = NULL)
quantize_q5_K(src_data, n_rows, n_per_row, imatrix = NULL)
quantize_q6_K(src_data, n_rows, n_per_row, imatrix = NULL)
Raw vector of quantized data
Numeric vector of float values to quantize
Number of rows
Number of elements per row
Optional importance matrix (numeric vector or NULL)
Other quantization:
dequantize_row_iq2_xxs(),
dequantize_row_mxfp4(),
dequantize_row_q2_K(),
dequantize_row_q4_0(),
dequantize_row_tq1_0(),
ggml_quant_block_info(),
iq2xs_free_impl(),
iq2xs_init_impl(),
iq3xs_free_impl(),
iq3xs_init_impl(),
quantize_iq2_xxs(),
quantize_mxfp4(),
quantize_q4_0(),
quantize_row_iq3_xxs_ref(),
quantize_row_mxfp4_ref(),
quantize_row_q2_K_ref(),
quantize_row_q4_0_ref(),
quantize_row_tq1_0_ref(),
quantize_tq1_0()