ggml_layer_embedding

Looks up dense vectors for integer token indices. The input must be an
integer matrix of 0-based indices in <code>[0, vocab_size - 1]</code> (use
<code>ggml_input(shape, dtype = "int32")</code> in Functional mode).

Provides 'R' bindings to the 'GGML' tensor library for machine
learning, designed primarily for 'Vulkan' GPU acceleration with full CPU
fallback. 'Vulkan' support is auto-detected at build time on Linux (when
'libvulkan-dev' and 'glslc' are installed) and on Windows (when 'Vulkan'
'SDK' is installed and 'VULKAN_SDK' environment variable is set); all
operations fall back to CPU transparently when no GPU is available.
Implements tensor operations, neural network layers, quantization, and a
'Keras'-like sequential model API for building and training networks.
Includes 'AdamW' (Adam with Weight decay) and 'SGD' (Stochastic Gradient
Descent) optimizers with 'MSE' (Mean Squared Error) and cross-entropy
losses. Also provides a dynamic 'autograd' engine ('PyTorch'-style) with
data-parallel training via 'dp_train()', broadcast arithmetic, 'f16'
(half-precision) support on 'Vulkan' GPU, and a multi-head attention layer
for building Transformer architectures. Serves as backend for 'LLM' (Large
Language Model) inference via 'llamaR' and Stable Diffusion image
generation via 'sdR'. See <https://github.com/ggml-org/ggml> for more
information about the underlying library.

Yuri Baramykov

ggmlR

'GGML' Tensor Operations for Machine Learning

Georgi Gerganov

Jeffrey Quesnelle

Bowen Peng

Mozilla Foundation 

ggml_layer_embedding function

<dl><dt>model</dt>
<dd>A <code>ggml_sequential_model</code> or <code>ggml_tensor_node</code>.</dd>
<dt>vocab_size</dt>
<dd>Number of distinct tokens (vocabulary size).</dd>
<dt>dim</dt>
<dd>Embedding dimension (vector length per token).</dd>
<dt>name</dt>
<dd>Optional layer name.</dd>
<dt>trainable</dt>
<dd>Logical; whether embedding weights are updated during training.</dd></dl>

Arguments

ggml stores tensors in column-major order, so the output shape is
<code>[dim, seq_len]</code> per sample (ggml convention) rather than
<code>[seq_len, dim]</code> as in Keras. When you call <code>ggml_layer_flatten()</code>
after embedding the result is the same flattened vector regardless of order,
but if you access raw output tensors be aware of this transposition.

Axis order (ggml vs Keras)

Indices must be in <code>[0, vocab_size - 1]</code>. Out-of-range values cause
undefined behaviour inside the ggml kernel (no bounds check is performed at
the R level).

Index validation

Add Embedding Layer — ggml_layer_embedding

<dl>

<dt>model</dt>
<dd>A <code>ggml_sequential_model</code> or <code>ggml_tensor_node</code>.</dd>


<dt>vocab_size</dt>
<dd>Number of distinct tokens (vocabulary size).</dd>


<dt>dim</dt>
<dd>Embedding dimension (vector length per token).</dd>


<dt>name</dt>
<dd>Optional layer name.</dd>


<dt>trainable</dt>
<dd>Logical; whether embedding weights are updated during training.</dd>

</dl>

ggml stores tensors in column-major order, so the output shape is
<code>[dim, seq_len]</code> per sample (ggml convention) rather than
<code>[seq_len, dim]</code> as in Keras. When you call <code>ggml_layer_flatten()</code>
after embedding the result is the same flattened vector regardless of order,
but if you access raw output tensors be aware of this transposition.

Indices must be in <code>[0, vocab_size - 1]</code>. Out-of-range values cause
undefined behaviour inside the ggml kernel (no bounds check is performed at
the R level).

ggml_layer_embedding: Add Embedding Layer

Description

Usage

Value

Arguments

Axis order (ggml vs Keras)

Index validation

Examples