Wrapper around tensorflow layer_norm function. From tensorflow documentation:
Adds a Layer Normalization layer. Based on the paper:
https://arxiv.org/abs/1607.06450.
Note: begin_norm_axis
: The first normalization dimension:
normalization will be performed along dimensions (begin_norm_axis :
rank(inputs) )
begin_params_axis
: The first parameter (beta, gamma) dimension: scale
and centering parameters will have dimensions (begin_params_axis :
rank(inputs) ) and will be broadcast with the normalized inputs accordingly.