Residential College | false |
Status | 已發表Published |
Study of Data Imbalanced Problem in Protein-peptide Binding Prediction | |
Gao, Lu; Siu, Shirley W.I. | |
2020-07-10 | |
Conference Name | 2020 12th International Conference on Bioinformatics and Biomedical Technology |
Source Publication | ICBBT 2020: Proceedings of the 2020 12th International Conference on Bioinformatics and Biomedical Technology |
Pages | 61-66 |
Conference Date | 2020/05/22-2020/05/24 |
Conference Place | Xi'an |
Publisher | ICST |
Abstract | Peptide-binding proteins are excessive in living cells and proteinpeptide interactions mediate a wide range of cellular functions. Prediction of protein-peptide binding residues has been vital and popular in the past decades and machine learning methods have gained more attention in recent years. However, the data imbalance problem has not been dealt with effectively. On this matter, we study the effects of sampling methods and degrees of imbalance on data classes on construction of prediction model. We first developed the NearMiss under-sampling method (NMUS) as a way to screen out a given number of quality data samples from majority class to balance the data sets. The remarkable sensitivity (SEN) with 0.818 shows the advantage of NMUS in handling class imbalance problem. This research carried on valuable analysis on data imbalance problem and achieved a better prediction of protein-peptide binding interaction. |
Keyword | Protein-peptide Binding Residues Data Imbalance Nearmiss Under-sampling |
DOI | 10.1145/3405758.3405764 |
URL | View the original |
Language | 英語English |
Scopus ID | 2-s2.0-85092625815 |
Fulltext Access | |
Citation statistics | |
Document Type | Conference paper |
Collection | DEPARTMENT OF COMPUTER AND INFORMATION SCIENCE |
Affiliation | University of Macau, Macau, China, Avenida da Universidade, Taipa, Macau, China |
First Author Affilication | University of Macau |
Recommended Citation GB/T 7714 | Gao, Lu,Siu, Shirley W.I.. Study of Data Imbalanced Problem in Protein-peptide Binding Prediction[C]:ICST, 2020, 61-66. |
APA | Gao, Lu., & Siu, Shirley W.I. (2020). Study of Data Imbalanced Problem in Protein-peptide Binding Prediction. ICBBT 2020: Proceedings of the 2020 12th International Conference on Bioinformatics and Biomedical Technology, 61-66. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment