Je, a versatile suite to handle multiplexed NGS libraries with unique molecular identifiers

Background The yield obtained from next generation sequencers has increased almost exponentially in recent years, making sample multiplexing common practice. While barcodes (known sequences of fixed length) primarily encode the sample identity of sequenced DNA fragments, barcodes made of random sequences (Unique Molecular Identifier or UMIs) are often used to distinguish between PCR duplicates and transcript abundance in, for example, single-cell RNA sequencing (scRNA-seq). In paired-end sequencing, different barcodes can be inserted at each fragment end to either increase the number of multiplexed samples in the library or to use one of the barcodes as UMI. Alternatively, UMIs can be combined with the sample barcodes into composite barcodes, or with standard Illumina® indexing. Subsequent analysis must take read duplicates and sample identity into account, by identifying UMIs. Results Existing tools do not support these complex barcoding configurations and custom code development is frequently required. Here, we present Je, a suite of tools that accommodates complex barcoding strategies, extracts UMIs and filters read duplicates taking UMIs into account. Using Je on publicly available scRNA-seq and iCLIP data containing UMIs, the number of unique reads increased by up to 36 %, compared to when UMIs are ignored. Conclusions Je is implemented in JAVA and uses the Picard API. Code, executables and documentation are freely available at http://gbcs.embl.de/Je. Je can also be easily installed in Galaxy through the Galaxy toolshed. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-1284-2) contains supplementary material, which is available to authorized users.

Tags
Data and Resources
To access the resources you must log in

This item has no data

Identity

Description: The Identity category includes attributes that support the identification of the resource.

Field Value
PID https://www.doi.org/10.1186/s12859-016-1284-2
PID pmc:PMC5055726
PID pmid:27717304
URL https://bmcbioinformatics.biomedcentral.com/track/pdf/10.1186/s12859-016-1284-2
URL http://europepmc.org/abstract/MED/27717304
URL https://dx.doi.org/10.1186/s12859-016-1284-2
URL https://academic.microsoft.com/#/detail/2529694725
URL http://dx.doi.org/10.1186/s12859-016-1284-2
URL http://link.springer.com/content/pdf/10.1186/s12859-016-1284-2.pdf
URL https://link.springer.com/content/pdf/10.1186%2Fs12859-016-1284-2.pdf
URL https://core.ac.uk/display/81858342
URL https://doi.org/10.1186/s12859-016-1284-2
URL https://dblp.uni-trier.de/db/journals/bmcbi/bmcbi17.html#GirardotSSSF16
URL https://link.springer.com/article/10.1186/s12859-016-1284-2
URL https://paperity.org/p/78014160/je-a-versatile-suite-to-handle-multiplexed-ngs-libraries-with-unique-molecular
URL http://europepmc.org/articles/PMC5055726
URL https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5055726/
URL https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-016-1284-2
Access Modality

Description: The Access Modality category includes attributes that report the modality of exploitation of the resource.

Field Value
Access Right Open Access
Attribution

Description: Authorships and contributors

Field Value
Author Girardot, Charles, 0000-0003-4301-3920
Author Scholtalbers, Jelle
Author Sauer, Sajoscha
Author Su, Shu-Yi
Author Furlong, Eileen E.M., 0000-0002-9544-8339
Publishing

Description: Attributes about the publishing venue (e.g. journal) and deposit location (e.g. repository)

Field Value
Collected From Europe PubMed Central; PubMed Central; ORCID; Datacite; UnpayWall; Crossref; Microsoft Academic Graph; CORE (RIOXX-UK Aggregator)
Hosted By Europe PubMed Central; SpringerOpen; BMC Bioinformatics
Publication Date 2016-10-01
Publisher Springer Science and Business Media LLC
Additional Info
Field Value
Language UNKNOWN
Resource Type Other literature type; Article; UNKNOWN
system:type publication
Management Info
Field Value
Source https://science-innovation-policy.openaire.eu/search/publication?articleId=dedup_wf_001::d88aa4246038eba8187a5dd0bfbc405e
Author jsonws_user
Last Updated 25 December 2020, 23:51 (CET)
Created 25 December 2020, 23:51 (CET)