Abstract
Structure-based drug design is critically dependent on accuracy of molecular docking scoring functions, and there is of significant interest to advance scoring functions with machine learning approaches. In this work, by judiciously expanding the training set, exploring new features related to explicit mediating water molecules as well as ligand conformation stability, and applying extreme gradient boosting (XGBoost) with Δ-Vina parametrization, we have improved robustness and applicability of machine-learning scoring functions. The new scoring function Δ vinaXGB can not only perform consistently among the top compared to classical scoring functions for the CASF-2016 benchmark but also achieves significantly better prediction accuracy in different types of structures that mimic real docking applications.
Original language | English (US) |
---|---|
Pages (from-to) | 4540-4549 |
Number of pages | 10 |
Journal | Journal of Chemical Information and Modeling |
Volume | 59 |
Issue number | 11 |
DOIs | |
State | Published - Nov 25 2019 |
Keywords
- Databases, Protein
- Drug Design
- Humans
- Ligands
- Machine Learning
- Molecular Conformation
- Molecular Docking Simulation
- Protein Binding
- Proteins/chemistry
- Small Molecule Libraries/chemistry
- Water/chemistry
ASJC Scopus subject areas
- Chemical Engineering(all)
- Chemistry(all)
- Library and Information Sciences
- Computer Science Applications