Residential Collegefalse
Status已發表Published
Prediction of small-molecule compound solubility in organic solvents by machine learning algorithms
Ye, Zhuyifan; Ouyang, Defang
2021-12-11
Source PublicationJournal of Cheminformatics
ISSN1758-2946
Volume13Issue:1Pages:98
Abstract

Rapid solvent selection is of great significance in chemistry. However, solubility prediction remains a crucial challenge. This study aimed to develop machine learning models that can accurately predict compound solubility in organic solvents. A dataset containing 5081 experimental temperature and solubility data of compounds in organic solvents was extracted and standardized. Molecular fingerprints were selected to characterize structural features. lightGBM was compared with deep learning and traditional machine learning (PLS, Ridge regression, kNN, DT, ET, RF, SVM) to develop models for predicting solubility in organic solvents at different temperatures. Compared to other models, lightGBM exhibited significantly better overall generalization (logS ± 0.20). For unseen solutes, our model gave a prediction accuracy (logS ± 0.59) close to the expected noise level of experimental solubility data. lightGBM revealed the physicochemical relationship between solubility and structural features. Our method enables rapid solvent screening in chemistry and may be applied to solubility prediction in other solvents.

KeywordDeep Learning Lightgbm Machine Learning Organic Solvents Qspr Solubility Prediction
DOI10.1186/s13321-021-00575-3
URLView the original
Indexed BySCIE
Language英語English
WOS Research AreaChemistry ; Computer Science
WOS SubjectChemistry, Multidisciplinary ; Computer Science, Information Systems ; Computer Science, Interdisciplinary Applications
WOS IDWOS:000729235100002
Scopus ID2-s2.0-85121035802
Fulltext Access
Citation statistics
Document TypeJournal article
CollectionTHE STATE KEY LABORATORY OF QUALITY RESEARCH IN CHINESE MEDICINE (UNIVERSITY OF MACAU)
Corresponding AuthorOuyang, Defang
AffiliationState Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences (ICMS), University of Macau, Macao
First Author AffilicationInstitute of Chinese Medical Sciences
Corresponding Author AffilicationInstitute of Chinese Medical Sciences
Recommended Citation
GB/T 7714
Ye, Zhuyifan,Ouyang, Defang. Prediction of small-molecule compound solubility in organic solvents by machine learning algorithms[J]. Journal of Cheminformatics, 2021, 13(1), 98.
APA Ye, Zhuyifan., & Ouyang, Defang (2021). Prediction of small-molecule compound solubility in organic solvents by machine learning algorithms. Journal of Cheminformatics, 13(1), 98.
MLA Ye, Zhuyifan,et al."Prediction of small-molecule compound solubility in organic solvents by machine learning algorithms".Journal of Cheminformatics 13.1(2021):98.
Files in This Item:
There are no files associated with this item.
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Ye, Zhuyifan]'s Articles
[Ouyang, Defang]'s Articles
Baidu academic
Similar articles in Baidu academic
[Ye, Zhuyifan]'s Articles
[Ouyang, Defang]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Ye, Zhuyifan]'s Articles
[Ouyang, Defang]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.