powered by
DiffGrad
DiffGrad(betas = c(0.9, 0.999), eps = 1e-08, weight_decay = 0)
Anonymous function that returns optimizer when called.
betas
eps
weight_decay