A fast exact sequential algorithm for the partial digest problem

Background Restriction site analysis involves determining the locations of restriction sites after the process of digestion by reconstructing their positions based on the lengths of the cut DNA. Using different reaction times with a single enzyme to cut DNA is a technique known as a partial digestion. Determining the exact locations of restriction sites following a partial digestion is challenging due to the computational time required even with the best known practical algorithm. Results In this paper, we introduce an efficient algorithm to find the exact solution for the partial digest problem. The algorithm is able to find all possible solutions for the input and works by traversing the solution tree with a breadth-first search in two stages and deleting all repeated subproblems. Two types of simulated data, random and Zhang, are used to measure the efficiency of the algorithm. We also apply the algorithm to real data for the Luciferase gene and the E. coli K12 genome. Conclusion Our algorithm is a fast tool to find the exact solution for the partial digest problem. The percentage of improvement is more than 75% over the best known practical algorithm for the worst case. For large numbers of inputs, our algorithm is able to solve the problem in a suitable time, while the best known practical algorithm is unable. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-1365-2) contains supplementary material, which is available to authorized users.

Tags
Data and Resources
To access the resources you must log in

This item has no data

Identity

Description: The Identity category includes attributes that support the identification of the resource.

Field Value
PID https://www.doi.org/10.1186/s12859-016-1365-2
PID pmid:28155644
PID pmc:PMC5259970
URL http://link.springer.com/content/pdf/10.1186/s12859-016-1365-2.pdf
URL https://link.springer.com/article/10.1186%2Fs12859-016-1365-2
URL https://dx.doi.org/10.1186/s12859-016-1365-2
URL http://europepmc.org/articles/PMC5259970
URL https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-016-1365-2
URL https://qfrd.pure.elsevier.com/en/publications/a-fast-exact-sequential-algorithm-for-the-partial-digest-problem
URL https://doi.org/10.1186/s12859-016-1365-2
URL https://academic.microsoft.com/#/detail/2562002492
URL http://dx.doi.org/10.1186/s12859-016-1365-2
URL https://bmcbioinformatics.biomedcentral.com/track/pdf/10.1186/s12859-016-1365-2
URL https://dblp.uni-trier.de/db/journals/bmcbi/bmcbi17S.html#AbbasB16
URL https://core.ac.uk/display/81855525
URL http://europepmc.org/abstract/MED/28155644
Access Modality

Description: The Access Modality category includes attributes that report the modality of exploitation of the resource.

Field Value
Access Right Open Access
Attribution

Description: Authorships and contributors

Field Value
Author Mostafa M. Abbas
Author Hazem M. Bahig
Publishing

Description: Attributes about the publishing venue (e.g. journal) and deposit location (e.g. repository)

Field Value
Collected From Europe PubMed Central; PubMed Central; Datacite; UnpayWall; Crossref; Microsoft Academic Graph; CORE (RIOXX-UK Aggregator)
Hosted By Europe PubMed Central; SpringerOpen; BMC Bioinformatics
Publication Date 2016-12-22
Publisher Springer Science and Business Media LLC
Additional Info
Field Value
Language UNKNOWN
Resource Type Other literature type; Article; UNKNOWN
system:type publication
Management Info
Field Value
Source https://science-innovation-policy.openaire.eu/search/publication?articleId=dedup_wf_001::2f2ecd7a3f9ee9aaaaf7a1c3656479c5
Author jsonws_user
Last Updated 25 December 2020, 02:04 (CET)
Created 25 December 2020, 02:04 (CET)