Residential College | false |
Status | 已發表Published |
Prediction of small-molecule compound solubility in organic solvents by machine learning algorithms | |
Ye, Zhuyifan; Ouyang, Defang | |
2021-12-11 | |
Source Publication | Journal of Cheminformatics |
ISSN | 1758-2946 |
Volume | 13Issue:1Pages:98 |
Abstract | Rapid solvent selection is of great significance in chemistry. However, solubility prediction remains a crucial challenge. This study aimed to develop machine learning models that can accurately predict compound solubility in organic solvents. A dataset containing 5081 experimental temperature and solubility data of compounds in organic solvents was extracted and standardized. Molecular fingerprints were selected to characterize structural features. lightGBM was compared with deep learning and traditional machine learning (PLS, Ridge regression, kNN, DT, ET, RF, SVM) to develop models for predicting solubility in organic solvents at different temperatures. Compared to other models, lightGBM exhibited significantly better overall generalization (logS ± 0.20). For unseen solutes, our model gave a prediction accuracy (logS ± 0.59) close to the expected noise level of experimental solubility data. lightGBM revealed the physicochemical relationship between solubility and structural features. Our method enables rapid solvent screening in chemistry and may be applied to solubility prediction in other solvents. |
Keyword | Deep Learning Lightgbm Machine Learning Organic Solvents Qspr Solubility Prediction |
DOI | 10.1186/s13321-021-00575-3 |
URL | View the original |
Indexed By | SCIE |
Language | 英語English |
WOS Research Area | Chemistry ; Computer Science |
WOS Subject | Chemistry, Multidisciplinary ; Computer Science, Information Systems ; Computer Science, Interdisciplinary Applications |
WOS ID | WOS:000729235100002 |
Scopus ID | 2-s2.0-85121035802 |
Fulltext Access | |
Citation statistics | |
Document Type | Journal article |
Collection | THE STATE KEY LABORATORY OF QUALITY RESEARCH IN CHINESE MEDICINE (UNIVERSITY OF MACAU) |
Corresponding Author | Ouyang, Defang |
Affiliation | State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences (ICMS), University of Macau, Macao |
First Author Affilication | Institute of Chinese Medical Sciences |
Corresponding Author Affilication | Institute of Chinese Medical Sciences |
Recommended Citation GB/T 7714 | Ye, Zhuyifan,Ouyang, Defang. Prediction of small-molecule compound solubility in organic solvents by machine learning algorithms[J]. Journal of Cheminformatics, 2021, 13(1), 98. |
APA | Ye, Zhuyifan., & Ouyang, Defang (2021). Prediction of small-molecule compound solubility in organic solvents by machine learning algorithms. Journal of Cheminformatics, 13(1), 98. |
MLA | Ye, Zhuyifan,et al."Prediction of small-molecule compound solubility in organic solvents by machine learning algorithms".Journal of Cheminformatics 13.1(2021):98. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment