Abstract
The development of ultracoarse-grained models for large biomolecules needs to derive the optimal number of coarse-grained (CG) sites to represent the targets. In this work, we propose to use the statistical internal cluster validation indexes to determine the optimal number of CG sites that are optimized based on the essential dynamics coarse-graining method. The calculated curves of Calinski-Harabasz and Silhouette Coefficient indexes exhibit the extrema corresponding to the similar CG numbers. The calculated ratios of the optimal CG numbers to the residue numbers of fine-grained models are in the range from 4 to 2. The comparison of the stability of index results indicates that Calinski-Harabasz index is the better choice to determine the optimal CG representation in coarse-graining.
Original language | English (US) |
---|---|
Pages (from-to) | 14-20 |
Number of pages | 7 |
Journal | Journal of Computational Chemistry |
Volume | 41 |
Issue number | 1 |
DOIs | |
State | Published - Jan 5 2020 |
Keywords
- CH index
- SC index
- coarse-graining
- internal cluster validation index
- optimal CG sites
ASJC Scopus subject areas
- General Chemistry
- Computational Mathematics