TY - JOUR
T1 - An ensemble 3D deep-learning model to predict protein metal-binding site
AU - Mohamadi, Ahmad
AU - Cheng, Tianfan
AU - Jin, Lijian
AU - Wang, Junwen
AU - Sun, Hongzhe
AU - Koohi-Moghadam, Mohamad
N1 - Funding Information:
The authors gratefully acknowledge support of the University of Hong Kong for seed funding for new staff to M.K.-M. and Research Grants Council of Hong Kong ( 17308921 , 2122-7S04 , and 17318322 ) to H.S.
Funding Information:
The authors gratefully acknowledge support of the University of Hong Kong for seed funding for new staff to M.K.-M. and Research Grants Council of Hong Kong (17308921, 2122-7S04, and 17318322) to H.S. For the work described herein, M.K.-M. H.S. and A.M. conceived the idea. A.M. implemented the codes and analyzed the results. A.M. and T.C. prepared and validated the dataset. A.M. and M.K.-M. performed result validation. A.M. M.K.-M. and T.C. wrote the paper. L.J. and J.W. commented on and edited the manuscript. M.K.-M. and H.S. provided overall project leadership. The authors declare no competing interests.
Publisher Copyright:
© 2022 The Author(s)
PY - 2022/9/21
Y1 - 2022/9/21
N2 - Predicting metal-binding sites in proteins is critical for understanding the protein's biological function. Here, we develop an ensemble deep convolutional neural network (CNN) method for predicting metal-binding sites based on their three-dimensional (3D) structure. We build multi-channel 3D voxels based on biophysical characteristics obtained from raw atom coordinates of each protein-binding pocket. Then, we use these 3D voxels as the input of an ensemble 3D CNN model. We train and evaluate the model using a curated dataset of 3D protein structures. Our proposed model shows high performance in predicting metal-binding sites for Zn, Fe, Mg, Mn, Ca, and Na. Our approach offers a framework to use 3D spatial features to train 3D-CNN, which may be used to predict complicated metal-binding sites directly from their biophysical characteristics. The source code and webserver of the model are publicly available.
AB - Predicting metal-binding sites in proteins is critical for understanding the protein's biological function. Here, we develop an ensemble deep convolutional neural network (CNN) method for predicting metal-binding sites based on their three-dimensional (3D) structure. We build multi-channel 3D voxels based on biophysical characteristics obtained from raw atom coordinates of each protein-binding pocket. Then, we use these 3D voxels as the input of an ensemble 3D CNN model. We train and evaluate the model using a curated dataset of 3D protein structures. Our proposed model shows high performance in predicting metal-binding sites for Zn, Fe, Mg, Mn, Ca, and Na. Our approach offers a framework to use 3D spatial features to train 3D-CNN, which may be used to predict complicated metal-binding sites directly from their biophysical characteristics. The source code and webserver of the model are publicly available.
KW - 3D voxels
KW - ensemble 3D deep learning
KW - metal-binding sites
KW - metalloprotein
KW - spatial features
UR - http://www.scopus.com/inward/record.url?scp=85138187995&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85138187995&partnerID=8YFLogxK
U2 - 10.1016/j.xcrp.2022.101046
DO - 10.1016/j.xcrp.2022.101046
M3 - Article
AN - SCOPUS:85138187995
VL - 3
JO - Cell Reports Physical Science
JF - Cell Reports Physical Science
SN - 2666-3864
IS - 9
M1 - 101046
ER -