DELPHI: accurate deep ensemble model for protein interaction sites prediction.

Clicks: 184
ID: 110432
2020
Proteins usually perform their functions by interacting with other proteins, which is why accurately predicting protein-protein interaction (PPI) binding sites is a fundamental problem. Experimental methods are slow and expensive. Therefore, great efforts are being made towards increasing the performance of computational methods.We propose DELPHI (DEep Learning Prediction of Highly probable protein Interaction sites), a new sequence-based deep learning suite for PPI binding sites prediction. DELPHI has an ensemble structure which combines a CNN and a RNN component with fine tuning technique. Three novel features, HSP, position information, and ProtVec are used in addition to nine existing ones. We comprehensively compare DELPHI to nine state-of-the-art programs on five datasets, and DELPHI outperforms the competing methods in all metrics even though its training dataset shares the least similarities with the testing datasets. In the most important metrics, AUPRC and MCC, it surpasses the second best programs by as much as 18.5% and 27.7%, resp. We also demonstrated that the improvement is essentially due to using the ensemble model and, especially, the three new features. Using DELPHI it is shown that there is a strong correlation with protein-binding residues (PBRs) and sites with strong evolutionary conservation. In addition DELPHI's predicted PBR sites closely match known data from Pfam. DELPHI is available as open sourced standalone software and web server.The DELPHI web server can be found at www.csd.uwo.ca/~yli922/index.php, with all datasets and results in this study. The trained models, the DELPHI standalone source code, and the feature computation pipeline are freely available at github.com/lucian-ilie/DELPHI.Supplementary data are available at Bioinformatics online.
Reference Key
li2020delphibioinformatics Use this key to autocite in the manuscript while using SciMatic Manuscript Manager or Thesis Manager
Authors Li, Yiwei;Golding, G Brian;Ilie, Lucian;
Journal Bioinformatics
Year 2020
DOI btaa750
URL
Keywords

Citations

No citations found. To add a citation, contact the admin at info@scimatic.org

No comments yet. Be the first to comment on this article.