MultiDataSet: an R package for encapsulating multiple data sets with application to omic data integration

BACKGROUND: Reduction in the cost of genomic assays has generated large amounts of biomedical-related data. As a result, current studies perform multiple experiments in the same subjects. While Bioconductor's methods and classes implemented in different packages manage individual experiments, there is not a standard class to properly manage different omic datasets from the same subjects. In addition, most R/Bioconductor packages that have been designed to integrate and visualize biological data often use basic data structures with no clear general methods, such as subsetting or selecting samples. RESULTS: To cover this need, we have developed MultiDataSet, a new R class based on Bioconductor standards, designed to encapsulate multiple data sets. MultiDataSet deals with the usual difficulties of managing multiple and non-complete data sets while offering a simple and general way of subsetting features and selecting samples. We illustrate the use of MultiDataSet in three common situations: 1) performing integration analysis with third party packages; 2) creating new methods and functions for omic data integration; 3) encapsulating new unimplemented data from any biological experiment.CONCLUSIONS: MultiDataSet is a suitable class for data integration under R and Bioconductor framework.

Tags
Data and Resources
To access the resources you must log in

This item has no data

Identity

Description: The Identity category includes attributes that support the identification of the resource.

Field Value
PID https://www.doi.org/10.1186/s12859-016-1455-1
PID pmid:28095799
PID pmc:PMC5240259
URL http://hdl.handle.net/10230/34978
URL https://repositori.upf.edu/handle/10230/34978
URL http://dx.doi.org/10.1186/s12859-016-1455-1
URL http://hdl.handle.net/2445/106469
URL http://europepmc.org/articles/PMC5240259
URL https://paperity.org/p/79125187/multidataset-an-r-package-for-encapsulating-multiple-data-sets-with-application-to-omic
URL https://doi.org/10.1186/s12859-016-1455-1
URL https://dblp.uni-trier.de/db/journals/bmcbi/bmcbi18.html#Hernandez-Ferrer17
URL http://diposit.ub.edu/dspace/bitstream/2445/106469/1/hernandez-ferrer2017_2366.pdf
URL https://link.springer.com/article/10.1186/s12859-016-1455-1
URL https://academic.microsoft.com/#/detail/2579218984
URL https://dx.doi.org/10.1186/s12859-016-1455-1
URL http://link.springer.com/content/pdf/10.1186/s12859-016-1455-1.pdf
URL https://europepmc.org/articles/PMC5240259/
URL https://pubmed.ncbi.nlm.nih.gov/28095799/
URL https://bmcbioinformatics.biomedcentral.com/track/pdf/10.1186/s12859-016-1455-1
URL https://core.ac.uk/display/159408653
URL http://diposit.ub.edu/dspace/handle/2445/106469?mode=full
URL https://zenodo.org/record/344065
URL https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-016-1455-1
Access Modality

Description: The Access Modality category includes attributes that report the modality of exploitation of the resource.

Field Value
Access Right Open Access
Attribution

Description: Authorships and contributors

Field Value
Author Juan R González, 0000-0003-3267-2146
Author Carles Hernandez-Ferrer, 0000-0002-8029-7160
Author Carlos Ruiz-Arenas, 0000-0002-6014-3498
Publishing

Description: Attributes about the publishing venue (e.g. journal) and deposit location (e.g. repository)

Field Value
Collected From Europe PubMed Central; ZENODO; PubMed Central; ORCID; Datacite; UnpayWall; UPF Digital Repository; Diposit Digital de la Universitat de Barcelona; Research Repository of Catalonia; Crossref; Microsoft Academic Graph; CORE (RIOXX-UK Aggregator)
Hosted By Europe PubMed Central; SpringerOpen; ZENODO; UPF Digital Repository; Diposit Digital de la Universitat de Barcelona; Research Repository of Catalonia; BMC Bioinformatics
Publication Date 2017-01-17
Additional Info
Field Value
Country Spain
Description This work has been partly funded by the Spanish Ministry of Economy and Competitiveness (MTM2015-68140-R). CH-F was supported by a grant from European Community’s Seventh Framework Programme (FP7/2007-2013) under grant agreement no 308333 – the HELIX project. CR-A was supported by a FI fellowship from Catalan Government (#016FI_B 00272)
Format application/pdf; 7 p.
Language Undetermined
Resource Type Article; UNKNOWN
keyword R
keyword Genòmica
system:type publication
Management Info
Field Value
Source https://science-innovation-policy.openaire.eu/search/publication?articleId=dedup_wf_001::752626ab44a88fb926ef560bda4c9cb6
Author jsonws_user
Last Updated 26 December 2020, 15:25 (CET)
Created 26 December 2020, 15:25 (CET)