TY - JOUR
T1 - CovBinderInPDB
T2 - A Structure-Based Covalent Binder Database
AU - Guo, Xiao Kang
AU - Zhang, Yingkai
N1 - Funding Information:
This work was supported by the U.S. National Institutes of Health (R35-GM127040). The authors thank NYU-ITS for providing computational resources.
Publisher Copyright:
© 2022 American Chemical Society. All rights reserved.
PY - 2022/12/12
Y1 - 2022/12/12
N2 - Covalent inhibition has emerged as a promising orthogonal approach for drug discovery, despite the significant challenge in achieving target specificity. To facilitate the structure-based rational design of target-specific covalent modulators, we developed an integrated computational protocol to curate covalent binders from the RCSB Protein Data Bank (PDB). Starting from the macromolecular crystallographic information files (mmCIF) in the PDB archive, covalent bond records, which indicate the side chain modification of amino acid residue by a covalent binder, were collected and cleaned. Then, residue-binder adducts, which are products of chemical reactions between targeted residues and covalent binders, were recovered with the help of the Chemical Component Dictionary in PDB. Finally, several strategies were employed to curate the pre-reaction forms of covalent binders from the adducts. Our curated CovBinderInPDB database contains 7375 covalent modifications in which 2189 unique covalent binders target nine types of amino acid residues (Cys, Lys, Ser, Asp, Glu, His, Met, Thr, and Tyr) from 3555 complex structures of 1170 unique protein chains. This database would set a solid foundation for developing and benchmarking computational strategies for covalent modulator design and is freely accessible at https://yzhang.hpc.nyu.edu/CovBinderInPDB.
AB - Covalent inhibition has emerged as a promising orthogonal approach for drug discovery, despite the significant challenge in achieving target specificity. To facilitate the structure-based rational design of target-specific covalent modulators, we developed an integrated computational protocol to curate covalent binders from the RCSB Protein Data Bank (PDB). Starting from the macromolecular crystallographic information files (mmCIF) in the PDB archive, covalent bond records, which indicate the side chain modification of amino acid residue by a covalent binder, were collected and cleaned. Then, residue-binder adducts, which are products of chemical reactions between targeted residues and covalent binders, were recovered with the help of the Chemical Component Dictionary in PDB. Finally, several strategies were employed to curate the pre-reaction forms of covalent binders from the adducts. Our curated CovBinderInPDB database contains 7375 covalent modifications in which 2189 unique covalent binders target nine types of amino acid residues (Cys, Lys, Ser, Asp, Glu, His, Met, Thr, and Tyr) from 3555 complex structures of 1170 unique protein chains. This database would set a solid foundation for developing and benchmarking computational strategies for covalent modulator design and is freely accessible at https://yzhang.hpc.nyu.edu/CovBinderInPDB.
UR - http://www.scopus.com/inward/record.url?scp=85143397814&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85143397814&partnerID=8YFLogxK
U2 - 10.1021/acs.jcim.2c01216
DO - 10.1021/acs.jcim.2c01216
M3 - Article
C2 - 36453831
AN - SCOPUS:85143397814
VL - 62
SP - 6057
EP - 6068
JO - Journal of Chemical Information and Modeling
JF - Journal of Chemical Information and Modeling
SN - 1549-9596
IS - 23
ER -