Status已發表Published
Modeling Localness for Self-Attention Networks
Yang, B.; Tu, Z.; Wong, F.; Meng, F.; Chao, L.; Zhang, T.
2018-10-31
Source PublicationProceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
Pages4449-4458
Publication PlaceBrussels
PublisherAssociation for Computational Linguistics
AbstractSelf-attention networks have proven to be of profound value for its strength of capturing global dependencies. In this work, we propose to model localness for self-attention networks, which enhances the ability of capturing useful local context. We cast localness modeling as a learnable Gaussian bias, which indicates the central and scope of the local region to be paid more attention. The bias is then incorporated into the original attention distribution to form a revised distribution. To maintain the strength of capturing long distance dependencies and enhance the ability of capturing short-range dependencies, we only apply localness modeling to lower layers of self-attention networks. Quantitative and qualitative analyses on Chinese⇒English and English⇒German translation tasks demonstrate the effectiveness and universality of the proposed approach.
KeywordNeural Machine Translation Transformer Self-Attention Models Local Context
Language英語English
The Source to ArticlePB_Publication
PUB ID42494
Document TypeConference paper
CollectionDEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
Corresponding AuthorWong, F.
Recommended Citation
GB/T 7714
Yang, B.,Tu, Z.,Wong, F.,et al. Modeling Localness for Self-Attention Networks[C], Brussels:Association for Computational Linguistics, 2018, 4449-4458.
APA Yang, B.., Tu, Z.., Wong, F.., Meng, F.., Chao, L.., & Zhang, T. (2018). Modeling Localness for Self-Attention Networks. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 4449-4458.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Yang, B.]'s Articles
[Tu, Z.]'s Articles
[Wong, F.]'s Articles
Baidu academic
Similar articles in Baidu academic
[Yang, B.]'s Articles
[Tu, Z.]'s Articles
[Wong, F.]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Yang, B.]'s Articles
[Tu, Z.]'s Articles
[Wong, F.]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.