Residential College | false |
Status | 已發表Published |
Revisiting Commonsense Reasoning in Machine Translation: Training, Evaluation and Challenge | |
Xuebo Liu2![]() ![]() ![]() ![]() | |
2023-07 | |
Conference Name | The 61st Annual Meeting of the Association for Computational Linguistics |
Source Publication | Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics
![]() |
Volume | 1: Long Papers |
Pages | 15536-15550 |
Conference Date | JULY 09-14, 2023 |
Conference Place | Toronto |
Country | Candada |
Author of Source | Anna Rogers ; Jordan Boyd-Graber ; Naoaki Okazaki |
Publisher | Association for Computational Linguistics (ACL) |
Abstract | The ability of commonsense reasoning (CR) decides whether a neural machine translation (NMT) model can move beyond pattern recognition. Despite the rapid advancement of NMT and the use of pretraining to enhance NMT models, research on CR in NMT is still in its infancy, leaving much to be explored in terms of effectively training NMT models with high CR abilities and devising accurate automatic evaluation metrics. This paper presents a comprehensive study aimed at expanding the understanding of CR in NMT.For the training, we confirm the effectiveness of incorporating pretrained knowledge into NMT models and subsequently utilizing these models as robust testbeds for investigating CR in NMT. For the evaluation, we propose a novel entity-aware evaluation method that takes into account both the NMT candidate and important entities in the candidate, which is more aligned with human judgement. Based on the strong testbed and evaluation methods, we identify challenges in training NMT models with high CR abilities and suggest directions for further unlabeled data utilization and model design. We hope that our methods and findings will contribute to advancing the research of CR in NMT. Source data, code and scripts are freely available at https://github.com/YutongWang1216/CR-NMT. |
DOI | 10.18653/v1/2023.acl-long.866 |
URL | View the original |
Indexed By | CPCI-S |
WOS Research Area | Computer Science |
WOS Subject | Computer Science, Artificial Intelligence ; Computer Science, Information Systems ; Computer Science, Theory & Methods |
WOS ID | WOS:001190962507019 |
Scopus ID | 2-s2.0-85174407342 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | Faculty of Science and Technology DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Corresponding Author | Xuebo Liu; Derek F. Wong |
Affiliation | 1.Department of Computer and Information Science, University of Macau 2.Institute of Computing and Intelligence, Harbin Institute of Technology, Shenzhen |
Corresponding Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Xuebo Liu,Yutong Wang,Derek F. Wong,et al. Revisiting Commonsense Reasoning in Machine Translation: Training, Evaluation and Challenge[C]. Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki:Association for Computational Linguistics (ACL), 2023, 15536-15550. |
APA | Xuebo Liu., Yutong Wang., Derek F. Wong., Runzhe Zhan., Liangxuan Yu., & Min Zhang (2023). Revisiting Commonsense Reasoning in Machine Translation: Training, Evaluation and Challenge. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics, 1: Long Papers, 15536-15550. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment