Residential College | false |
Status | 已發表Published |
Expert Knowledge-Guided Length-Variant Hierarchical Label Generation for Proposal Classification | |
Xiao, Meng; Qiao, Ziyue; Fu, Yanjie; Du, Yi; Wang, Pengyang; Zhou, Yuanchun | |
2021-12 | |
Conference Name | IEEE International Conference on Data Mining, |
Source Publication | Proceedings - IEEE International Conference on Data Mining, ICDM |
Volume | 2021-December |
Pages | 757-766 |
Conference Date | 2021-12-07 |
Conference Place | Auckland |
Country | New Zealand |
Abstract | To advance the development of science and technology, research proposals are submitted to open-court competitive programs developed by government agencies (e.g., NSF). Proposal classification is one of the most important tasks to achieve effective and fair review assignment. Proposal classification aims to classify a proposal into a length-variant sequence of labels. In this paper, we formulate the proposal classification problem into a hierarchical multi-label classification task. Although there are certain prior studies, proposal classification exhibit unique features: 1) the classification result of a proposal is in a hierarchical discipline structure with different levels of granularity; 2) proposals contain multiple types of documents; 3) domain experts can empirically provide partial labels that can be leveraged to improve task performances. In this paper, we focus on developing a new deep proposal classification framework to jointly model the three features. We design a deep transformer-based encoder-decoder framework. In this framework, we use a two-level (word-level and document-level) Transformer structure as an encoder to learn the embedding feature vectors of proposals. The decoder generates labels from the starting coarse-grained level to a certain fine-grained level to form the hierarchical discipline tree. In particular, to sequentially generate labels, we leverage previously-generated labels to predict the label of next level; to integrate partial labels from experts, we use the embedding of these empirical partial labels to initialize the state of neural networks. Our model can automatically identify the best length of label sequence to stop next label prediction. Finally, we present extensive results to demonstrate that our method can jointly model partial labels, textual information, and semantic dependencies in label sequences and, thus, achieve advanced performances. |
Document Type | Conference paper |
Collection | THE STATE KEY LABORATORY OF INTERNET OF THINGS FOR SMART CITY (UNIVERSITY OF MACAU) |
Affiliation | 1.Computer Network Information Center, Chinese Academy of Sciences 2.Computer Network Information Center, Chinese Academy of Sciences 3.Department of Computer Science, University of Central Florida 4.Computer Network Information Center, Chinese Academy of Sciences 5.University of Macau 6.Computer Network Information Center, Chinese Academy of Sciences |
Recommended Citation GB/T 7714 | Xiao, Meng,Qiao, Ziyue,Fu, Yanjie,et al. Expert Knowledge-Guided Length-Variant Hierarchical Label Generation for Proposal Classification[C], 2021, 757-766. |
APA | Xiao, Meng., Qiao, Ziyue., Fu, Yanjie., Du, Yi., Wang, Pengyang., & Zhou, Yuanchun (2021). Expert Knowledge-Guided Length-Variant Hierarchical Label Generation for Proposal Classification. Proceedings - IEEE International Conference on Data Mining, ICDM, 2021-December, 757-766. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment