optimizer_adam

Adam optimizer as described in [Adam - A Method for Stochastic
Optimization](https://arxiv.org/abs/1412.6980v8).

Implements additional models, simulation tools, and interfaces as extensions to 'SPOT'.
It provides tools for hyperparameter tuning via 'keras/tensorflow', interfacing 'mlr', for performing Markov chain simulations,
and for sensitivity analysis based on sequential bifurcation methods as described in Bettonvil and Kleijnen (1996).
Furthermore, additional plotting functions for output from 'SPOT' runs are implemented.
Bartz-Beielstein T, Lasarczyk C W G, Preuss M (2005) <doi:10.1109/CEC.2005.1554761>.
Bartz-Beielstein T, Zaefferer M, Rehbach F (2021) <arXiv:1712.04076>.
Bartz-Beielstein T, Rehbach F, Sen A, Zaefferer M <arXiv:2105.14625>.
Bettonvil, B, Kleijnen JPC (1996) <doi:10.1016/S0377-2217(96)00156-7>.

Thomas Bartz-Beielstein

SPOTMisc

Misc Extensions for the 'SPOT' Package

Martin Zaefferer

Frederik Rehbach

optimizer_adam function

<dl><dt>learning_rate</dt>
<dd>float &gt;= 0. Learning rate.</dd>
<dt>beta_1</dt>
<dd>The exponential decay rate for the 1st moment estimates. float,
0 &lt; beta &lt; 1. Generally close to 1.</dd>
<dt>beta_2</dt>
<dd>The exponential decay rate for the 2nd moment estimates. float,
0 &lt; beta &lt; 1. Generally close to 1.</dd>
<dt>epsilon</dt>
<dd>float &gt;= 0. Fuzz factor. If `NULL`, defaults to `k_epsilon()`.</dd>
<dt>decay</dt>
<dd>float &gt;= 0. Learning rate decay over each update.</dd>
<dt>amsgrad</dt>
<dd>Whether to apply the AMSGrad variant of this algorithm from
the paper "On the Convergence of Adam and Beyond".</dd>
<dt>clipnorm</dt>
<dd>Gradients will be clipped when their L2 norm exceeds this
value.</dd>
<dt>clipvalue</dt>
<dd>Gradients will be clipped when their absolute value exceeds
this value.</dd>
<dt>...</dt>
<dd>Unused, present only for backwards compatability</dd></dl>

Arguments

Adam optimizer — optimizer_adam

<dl>

<dt>learning_rate</dt>
<dd>float &gt;= 0. Learning rate.</dd>


<dt>beta_1</dt>
<dd>The exponential decay rate for the 1st moment estimates. float,
0 &lt; beta &lt; 1. Generally close to 1.</dd>


<dt>beta_2</dt>
<dd>The exponential decay rate for the 2nd moment estimates. float,
0 &lt; beta &lt; 1. Generally close to 1.</dd>


<dt>epsilon</dt>
<dd>float &gt;= 0. Fuzz factor. If `NULL`, defaults to `k_epsilon()`.</dd>


<dt>decay</dt>
<dd>float &gt;= 0. Learning rate decay over each update.</dd>


<dt>amsgrad</dt>
<dd>Whether to apply the AMSGrad variant of this algorithm from
the paper "On the Convergence of Adam and Beyond".</dd>


<dt>clipnorm</dt>
<dd>Gradients will be clipped when their L2 norm exceeds this
value.</dd>


<dt>clipvalue</dt>
<dd>Gradients will be clipped when their absolute value exceeds
this value.</dd>


<dt>...</dt>
<dd>Unused, present only for backwards compatability</dd>

</dl>

optimizer_adam: Adam optimizer

Description

Usage

Arguments

References

See Also