Residential Collegefalse
Status已發表Published
Study of Data Imbalanced Problem in Protein-peptide Binding Prediction
Gao, Lu; Siu, Shirley W.I.
2020-07-10
Conference Name2020 12th International Conference on Bioinformatics and Biomedical Technology
Source PublicationICBBT 2020: Proceedings of the 2020 12th International Conference on Bioinformatics and Biomedical Technology
Pages61-66
Conference Date2020/05/22-2020/05/24
Conference PlaceXi'an
PublisherICST
Abstract

Peptide-binding proteins are excessive in living cells and proteinpeptide interactions mediate a wide range of cellular functions. Prediction of protein-peptide binding residues has been vital and popular in the past decades and machine learning methods have gained more attention in recent years. However, the data imbalance problem has not been dealt with effectively. On this matter, we study the effects of sampling methods and degrees of imbalance on data classes on construction of prediction model. We first developed the NearMiss under-sampling method (NMUS) as a way to screen out a given number of quality data samples from majority class to balance the data sets. The remarkable sensitivity (SEN) with 0.818 shows the advantage of NMUS in handling class imbalance problem. This research carried on valuable analysis on data imbalance problem and achieved a better prediction of protein-peptide binding interaction.

KeywordProtein-peptide Binding Residues Data Imbalance Nearmiss Under-sampling
DOI10.1145/3405758.3405764
URLView the original
Language英語English
Scopus ID2-s2.0-85092625815
Fulltext Access
Citation statistics
Document TypeConference paper
CollectionDEPARTMENT OF COMPUTER AND INFORMATION SCIENCE
AffiliationUniversity of Macau, Macau, China, Avenida da Universidade, Taipa, Macau, China
First Author AffilicationUniversity of Macau
Recommended Citation
GB/T 7714
Gao, Lu,Siu, Shirley W.I.. Study of Data Imbalanced Problem in Protein-peptide Binding Prediction[C]:ICST, 2020, 61-66.
APA Gao, Lu., & Siu, Shirley W.I. (2020). Study of Data Imbalanced Problem in Protein-peptide Binding Prediction. ICBBT 2020: Proceedings of the 2020 12th International Conference on Bioinformatics and Biomedical Technology, 61-66.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Gao, Lu]'s Articles
[Siu, Shirley W.I.]'s Articles
Baidu academic
Similar articles in Baidu academic
[Gao, Lu]'s Articles
[Siu, Shirley W.I.]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Gao, Lu]'s Articles
[Siu, Shirley W.I.]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.