powered by
Transformer encoder for processing mel spectrograms. Multi-Head Self-Attention
whisper_attention(n_state, n_head)
Hidden dimension
Number of attention heads