
Last chance! 50% off unlimited learning
Sale ends in
Creates a criterion that uses a squared term if the absolute
element-wise error falls below 1 and an L1 term otherwise.
It is less sensitive to outliers than the MSELoss
and in some cases
prevents exploding gradients (e.g. see Fast R-CNN
paper by Ross Girshick).
Also known as the Huber loss:
nn_smooth_l1_loss(reduction = "mean")
(string, optional): Specifies the reduction to apply to the output:
'none'
| 'mean'
| 'sum'
. 'none'
: no reduction will be applied,
'mean'
: the sum of the output will be divided by the number of
elements in the output, 'sum'
: the output will be summed. Note: size_average
and reduce
are in the process of being deprecated, and in the meantime,
specifying either of those two args will override reduction
. Default: 'mean'
Input:
Target:
Output: scalar. If reduction
is 'none'
, then
where
reduction = 'sum'
.