Status | 已發表Published |
A Universal Phrase Tagset for Multilingual Treebanks | |
Han, A.![]() ![]() ![]() ![]() ![]() | |
2014-10-01 | |
Source Publication | Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data
![]() |
Pages | 247-258 |
Publisher | Springer |
Abstract | Many syntactic treebanks and parser toolkits are developed in the past twenty years, including dependency structure parsers and phrase structure parsers. For the phrase structure parsers, they usually utilize different phrase tagsets for different languages, which results in an inconvenience when conducting the multilingual research. This paper designs a refined universal phrase tagset that contains 9 commonly used phrase categories. Furthermore, the mapping covers 25 constituent treebanks and 21 languages. The experiments show that the universal phrase tagset can generally reduce the costs in the parsing models and even improve the parsing accuracy. |
Keyword | Universal phrase tagset Phrase tagset mapping Multilingual treebanks Parsing |
Language | 英語English |
The Source to Article | PB_Publication |
PUB ID | 24993 |
Document Type | Conference paper |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Recommended Citation GB/T 7714 | Han, A.,Wong, F.,Chao, S.,et al. A Universal Phrase Tagset for Multilingual Treebanks[C]:Springer, 2014, 247-258. |
APA | Han, A.., Wong, F.., Chao, S.., Lu, Y.., He, L.., & Tian, L. (2014). A Universal Phrase Tagset for Multilingual Treebanks. Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 247-258. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment