Transition to Transformers
Encoder
Decoder
h0
h1
h2
h3
h4
h5
I
enjoyed
the
movie
transformers
Naan
transfarmar
padaththai
s0=h5
s1
s2
s3
Naan
transfarmar
padaththai
rasiththen
Transition to Transformers
Encoder
Decoder
Naan
transfarmar
padaththai
s1
s2
s3
Naan
transfarmar
padaththai
rasiththen
h1
h2
h3
h4
h5
α11
α12
α13
α14
α15
Transition to Transformers
Encoder
Decoder
s1
s2
s3
Naan
transfarmar
padaththai
rasiththen
h1
h2
h3
h4
h5
Feed Forward Networks
Encoder-Decoder Attention
Self-Attention
Feed Forward Networks
Self-Attention
s4
s5
z1
z2
z3
z4
z5
Word Embedding
I
enjoyed
the
movie
transformers
We will see each of these components (and a few more) one at a time in detail and connect them together to synthesize the final architecture
Transition to Transformers
Encoder
Self-Attention
I
enjoyed
the
movie
transformers
| 1 | 0 | 0 |
|---|
| 1 | 0 | 1 |
|---|
| 1 | 1 | 0 |
|---|
| 1 | 1 | 1 |
|---|
| 0 | 1 | 0 |
|---|
Transition to Transformers
By Sshubam Verma
Transition to Transformers
- 13