- mode
A single character string for the type of model. Possible values
are "unknown", "regression", or "classification".
- engine
A single character string specifying what computational engine
to use for fitting. Currently only "kindling" is supported.
- hidden_neurons
An integer vector for the number of units in each hidden
layer. Can be tuned.
- activations
A character vector of activation function names for each
hidden layer (e.g., "relu", "tanh", "sigmoid"). Can be tuned.
- output_activation
A character string for the output activation function.
Can be tuned.
- bias
Logical for whether to include bias terms. Can be tuned.
- epochs
An integer for the number of training iterations. Can be tuned.
- batch_size
An integer for the batch size during training. Can be tuned.
- penalty
A number for the regularization penalty (lambda). Default 0
(no regularization). Higher values increase regularization strength. Can be tuned.
- mixture
A number between 0 and 1 for the elastic net mixing parameter.
Default 0 (pure L2/Ridge regularization).
0: Pure L2 regularization (Ridge)
1: Pure L1 regularization (Lasso)
0 < mixture < 1: Elastic net (combination of L1 and L2)
Only relevant when penalty > 0. Can be tuned.
- learn_rate
A number for the learning rate. Can be tuned.
- optimizer
A character string for the optimizer type ("adam", "sgd",
"rmsprop"). Can be tuned.
- optimizer_args
A named list of additional arguments passed to the optimizer.
Cannot be tuned.
- loss
A character string for the loss function ("mse", "mae",
"cross_entropy", "bce"). Cannot be tuned.
- validation_split
A number between 0 and 1 for the proportion of data
used for validation. Can be tuned.
- device
A character string for the device to use ("cpu", "cuda", "mps").
If NULL, auto-detects available GPU. Cannot be tuned.
- verbose
Logical for whether to print training progress. Default FALSE.
Cannot be tuned.