UM  > Faculty of Science and Technology
Residential Collegefalse
Status已發表Published
Domain Adaptation for Medical Text Translation using Web Resources
Yi Lu; Longyue Wang; Derek F. Wong; Lidia S. Chao; Yiming Wang; Francisco Oliveira
2014
Conference NameProceedings of the Ninth Workshop on Statistical Machine Translation
Source Publicationthe Ninth Workshop on Statistical Machine Translation
Pages233–238
Conference DateJune 26–27, 2014
Conference PlaceBaltimore, Maryland USA
Abstract

This paper describes adapting statistical machine translation (SMT) systems to medical domain using in-domain and general-domain data as well as webcrawled in-domain resources. In order to complement the limited in-domain corpora, we apply domain focused webcrawling approaches to acquire indomain monolingual data and bilingual lexicon from the Internet. The collected data is used for adapting the language model and translation model to boost the overall translation quality. Besides, we propose an alternative filtering approach to clean the crawled data and to further optimize the domain-specific SMT system. We attend the medical summary sentence unconstrained translation task of the Ninth Workshop on Statistical Machine Translation (WMT2014). Our systems achieve the second best BLEU scores for Czech-English, fourth for French-English, English-French language pairs and the third best results for reminding pairs.

DOI10.3115/v1/W14-3328
URLView the original
Language英語English
Scopus ID2-s2.0-84981699321
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionFaculty of Science and Technology
DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
AffiliationNatural Language Processing & Portuguese-Chinese Machine Translation Laboratory, Department of Computer and Information Science, University of Macau, Macau, China
First Author AffilicationUniversity of Macau
Recommended Citation
GB/T 7714
Yi Lu,Longyue Wang,Derek F. Wong,et al. Domain Adaptation for Medical Text Translation using Web Resources[C], 2014, 233–238.
APA Yi Lu., Longyue Wang., Derek F. Wong., Lidia S. Chao., Yiming Wang., & Francisco Oliveira (2014). Domain Adaptation for Medical Text Translation using Web Resources. the Ninth Workshop on Statistical Machine Translation, 233–238.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Yi Lu]'s Articles
[Longyue Wang]'s Articles
[Derek F. Wong]'s Articles
Baidu academic
Similar articles in Baidu academic
[Yi Lu]'s Articles
[Longyue Wang]'s Articles
[Derek F. Wong]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Yi Lu]'s Articles
[Longyue Wang]'s Articles
[Derek F. Wong]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.