Residential College | false |
Status | 已發表Published |
Domain Adaptation for Medical Text Translation using Web Resources | |
Yi Lu; Longyue Wang; Derek F. Wong; Lidia S. Chao; Yiming Wang; Francisco Oliveira | |
2014 | |
Conference Name | Proceedings of the Ninth Workshop on Statistical Machine Translation |
Source Publication | the Ninth Workshop on Statistical Machine Translation |
Pages | 233–238 |
Conference Date | June 26–27, 2014 |
Conference Place | Baltimore, Maryland USA |
Abstract | This paper describes adapting statistical machine translation (SMT) systems to medical domain using in-domain and general-domain data as well as webcrawled in-domain resources. In order to complement the limited in-domain corpora, we apply domain focused webcrawling approaches to acquire indomain monolingual data and bilingual lexicon from the Internet. The collected data is used for adapting the language model and translation model to boost the overall translation quality. Besides, we propose an alternative filtering approach to clean the crawled data and to further optimize the domain-specific SMT system. We attend the medical summary sentence unconstrained translation task of the Ninth Workshop on Statistical Machine Translation (WMT2014). Our systems achieve the second best BLEU scores for Czech-English, fourth for French-English, English-French language pairs and the third best results for reminding pairs. |
DOI | 10.3115/v1/W14-3328 |
URL | View the original |
Language | 英語English |
Scopus ID | 2-s2.0-84981699321 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | Faculty of Science and Technology DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Affiliation | Natural Language Processing & Portuguese-Chinese Machine Translation Laboratory, Department of Computer and Information Science, University of Macau, Macau, China |
First Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Yi Lu,Longyue Wang,Derek F. Wong,et al. Domain Adaptation for Medical Text Translation using Web Resources[C], 2014, 233–238. |
APA | Yi Lu., Longyue Wang., Derek F. Wong., Lidia S. Chao., Yiming Wang., & Francisco Oliveira (2014). Domain Adaptation for Medical Text Translation using Web Resources. the Ninth Workshop on Statistical Machine Translation, 233–238. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment