17. Dig into the components
In this chapter, we will dig into the core components of the Transformer model. Especially, we will focus on analyzing the limitations of the original Transformer and explore key improvements introduced by subsequent research.
Reference
- A survey of transformers (20.Oct.2022)
Part Contents
17.1. Multi-Head Attention
17.2. Positional Encoding
17.3. Position-wise Feed-Forward Networks (FFN)