Residential College | false |
Status | 已發表Published |
Augmented parsing of unknown word by graph-based semi-supervised learning | |
Huang Q.; Wong D.F.; Chao L.S.; Zeng X.; He L. | |
2013 | |
Conference Name | the 27th Pacific Asia Conference on Language, Information, and Computation (PACLIC 27) |
Source Publication | Proceedings of the 27th Pacific Asia Conference on Language, Information, and Computation (PACLIC 27) |
Pages | 474-482 |
Conference Date | 2013 November |
Conference Place | Taipei, Taiwan |
Abstract | This paper presents a novel method using graph-based semi-supervised learning (SSL) to improve the syntax parsing of unknown words. Different from conventional approaches that uses hand-crafted rules, rich morphological features, or a character-based model to handle unknown words, this method is based on a graph-based label propagation technique. It gives greater improvement on grammars trained on a smaller amount of labeled data and a large amount of unlabeled one. A transductiv1 graph-based SSL method is employed to propagate POS and derive the emission distributions from labeled data to unlabeled one. The derived distributions are incorporated into the parsing process. The proposed method effectively augments the original supervised parsing model by contributing 2.28% and 1.72% absolute improvement on the accuracy of POS tagging and syntax parsing for Penn Chinese Treebank respectively. |
URL | View the original |
Language | 英語English |
Fulltext Access | |
Document Type | Conference paper |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Affiliation | Universidade de Macau |
First Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Huang Q.,Wong D.F.,Chao L.S.,et al. Augmented parsing of unknown word by graph-based semi-supervised learning[C], 2013, 474-482. |
APA | Huang Q.., Wong D.F.., Chao L.S.., Zeng X.., & He L. (2013). Augmented parsing of unknown word by graph-based semi-supervised learning. Proceedings of the 27th Pacific Asia Conference on Language, Information, and Computation (PACLIC 27), 474-482. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment