Research Paper Replica

Attention Is All You Need (Transformer)

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin

Attention-only sequence transduction architecture that replaces recurrence and convolution while improving translation quality and training efficiency.

Open PDF arXiv

Jun 201714 pages16 min read

Page 1Attention Is All You Need (Transformer)

Page 2Attention Is All You Need (Transformer)

Page 3Attention Is All You Need (Transformer)

Page 4Attention Is All You Need (Transformer)

Page 5Attention Is All You Need (Transformer)

Page 6Attention Is All You Need (Transformer)

Page 7Attention Is All You Need (Transformer)

Page 8Attention Is All You Need (Transformer)

Page 9Attention Is All You Need (Transformer)

Page 10Attention Is All You Need (Transformer)

Page 11Attention Is All You Need (Transformer)

Page 12Attention Is All You Need (Transformer)

Page 13Attention Is All You Need (Transformer)

Page 14Attention Is All You Need (Transformer)

Paper Snapshot

Jun 2017

14 pages

16 min read

Authors

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, Illia Polosukhin

Topics

TransformerSelf-AttentionSequence Modeling