ISSEC: inferring contacts among protein secondary structure elements using deep object detection

Abstract Background The formation of contacts among protein secondary structure elements (SSEs) is an important step in protein folding as it determines topology of protein tertiary structure; hence, inferring inter-SSE contacts is crucial to protein structure prediction. One of the existing strategies infers inter-SSE contacts directly from the predicted possibilities of inter-residue contacts without any preprocessing, and thus suffers from the excessive noises existing in the predicted inter-residue contacts. Another strategy defines SSEs based on protein secondary structure prediction first, and then judges whether each candidate SSE pair could form contact or not. However, it is difficult to accurately determine boundary of SSEs due to the errors in secondary structure prediction. The incorrectly-deduced SSEs definitely hinder subsequent prediction of the contacts among them. Results We here report an accurate approach to infer the inter-SSE contacts (thus called as ISSEC) using the deep object detection technique. The design of ISSEC is based on the observation that, in the inter-residue contact map, the contacting SSEs usually form rectangle regions with characteristic patterns. Therefore, ISSEC infers inter-SSE contacts through detecting such rectangle regions. Unlike the existing approach directly using the predicted probabilities of inter-residue contact, ISSEC applies the deep convolution technique to extract high-level features from the inter-residue contacts. More importantly, ISSEC does not rely on the pre-defined SSEs. Instead, ISSEC enumerates multiple candidate rectangle regions in the predicted inter-residue contact map, and for each region, ISSEC calculates a confidence score to measure whether it has characteristic patterns or not. ISSEC employs greedy strategy to select non-overlapping regions with high confidence score, and finally infers inter-SSE contacts according to these regions. Conclusions Comprehensive experimental results suggested that ISSEC outperformed the state-of-the-art approaches in predicting inter-SSE contacts. We further demonstrated the successful applications of ISSEC to improve prediction of both inter-residue contacts and tertiary structure as well.

Tags
Data and Resources
To access the resources you must log in

This item has no data

Identity

Description: The Identity category includes attributes that support the identification of the resource.

Field Value
PID https://www.doi.org/10.6084/m9.figshare.c.5198889
PID https://www.doi.org/10.6084/m9.figshare.c.5198889.v1
URL http://dx.doi.org/10.6084/m9.figshare.c.5198889.v1
URL http://dx.doi.org/10.6084/m9.figshare.c.5198889
Access Modality

Description: The Access Modality category includes attributes that report the modality of exploitation of the resource.

Field Value
Access Right not available
Attribution

Description: Authorships and contributors

Field Value
Author Zhang, Qi
Author Jianwei Zhu
Author Fusong Ju
Author Lupeng Kong
Author Shiwei Sun
Author Wei-Mou Zheng
Author Bu, Dongbo, 0000-0003-4119-4238
Publishing

Description: Attributes about the publishing venue (e.g. journal) and deposit location (e.g. repository)

Field Value
Collected From Datacite
Hosted By figshare
Publication Date 2020-01-01
Publisher figshare
Additional Info
Field Value
Language UNKNOWN
Resource Type Collection
keyword FOS: Mathematics
keyword FOS: Physical sciences
keyword FOS: Sociology
keyword FOS: Biological sciences
keyword FOS: Computer and information sciences
system:type other
Management Info
Field Value
Source https://science-innovation-policy.openaire.eu/search/other?orpId=dedup_wf_001::d02f8e52ed6c05c911b6d48b4d6bccbe
Author jsonws_user
Last Updated 20 December 2020, 03:28 (CET)
Created 20 December 2020, 03:28 (CET)