whisper_decoder_layer: Whisper Decoder
Description
Transformer decoder with cross-attention to encoder outputs.
Decoder Layer
Pre-norm transformer decoder layer with self-attention and cross-attention.
Usage
whisper_decoder_layer(n_state, n_head)
Arguments
- n_state
Hidden dimension
- n_head
Number of attention heads