-
Drug-target interaction prediction via class imbalance-aware ensemble learning
Background Multiple computational methods for predicting drug-target interactions have been developed to facilitate the drug discovery process. These methods use available data... -
IPED: a highly efficient denoising tool for Illumina MiSeq Paired-end 16S rRN...
Background The development of high-throughput sequencing technologies has revolutionized the field of microbial ecology via the sequencing of phylogenetic marker genes (e.g. 16S... -
PyPedia: using the wiki paradigm as crowd sourcing environment for bioinforma...
Background Today researchers can choose from many bioinformatics protocols for all types of life sciences research, computational environments and coding languages. Although the... -
Evaluation of logistic regression models and effect of covariates for case–co...
Background Next generation sequencing provides a count of RNA molecules in the form of short reads, yielding discrete, often highly non-normally distributed gene expression... -
CytoSpectre: a tool for spectral analysis of oriented structures on cellular ...
Background Orientation and the degree of isotropy are important in many biological systems such as the sarcomeres of cardiomyocytes and other fibrillar structures of the... -
Towards a phenome-wide catalog of human clinical traits impacted by genetic a...
Background Racial/ethnic differences for commonly measured clinical variables are well documented, and it has been postulated that population-specific genetic factors may play a... -
ClonoCalc and ClonoPlot: immune repertoire analysis from raw files to publica...
Background Next generation sequencing (NGS) technologies enable studies and analyses of the diversity of both T and B cell receptors (TCR and BCR) in human and animal systems to... -
SnoReport 2.0: new features and a refined Support Vector Machine to improve s...
Background snoReport uses RNA secondary structure prediction combined with machine learning as the basis to identify the two main classes of small nucleolar RNAs, the box H/ACA... -
AutoSOME: a clustering method for identifying gene expression modules without...
Abstract Background Clustering the information content of large high-dimensional gene expression datasets has widespread application in "omics" biology. Unfortunately, the... -
"CellProfiler Tracer: exploring and validating high-throughput, time-lapse mi...
Background Time-lapse analysis of cellular images is an important and growing need in biology. Algorithms for cell tracking are widely available; what researchers have been... -
An Eigenvalue test for spatial principal component analysis
AbstractBackgroundThe spatial Principal Component Analysis (sPCA, Jombart 2008) is designed to investigate non-random spatial distributions of genetic variation. Unfortunately,... -
MetaDiff: differential isoform expression analysis using random-effects meta-...
Background RNA sequencing (RNA-Seq) allows an unbiased survey of the entire transcriptome in a high-throughput manner. A major application of RNA-Seq is to detect differential... -
Fostering serendipitous knowledge discovery using an adaptive multigraph-base...
Serendipity, the art of making an unsought finding plays also an important role in the emerging field of data science, allowing the discovery of interesting and valuable facts... -
A generic schema and data collection forms applicable to diverse entomologica...
Background Standardized schemas, databases, and public data repositories are needed for the studies of malaria vectors that encompass a remarkably diverse array of designs and... -
Handling missing rows in multi-omics data integration: multiple imputation in...
Background In omics data integration studies, it is common, for a variety of reasons, for some individuals to not be present in all data tables. Missing row values are... -
QUADrATiC: scalable gene expression connectivity mapping for repurposing FDA-...
Background Gene expression connectivity mapping has proven to be a powerful and flexible tool for research. Its application has been shown in a broad range of research topics,... -
Leveraging 3D chemical similarity, target and phenotypic data in the identifi...
Background Drug-target identification is crucial to discover novel applications for existing drugs and provide more insights about mechanisms of biological actions, such as... -
Bridging centrality as an indicator to measure the ‘bridging role’ of actors ...
In the recent past, we can observe growing interest in STI studies in the notion of positioning indicators, shifting emphasis to actors in the innovation process and their... -
Tackling the challenges of matching biomedical ontologies
Background Biomedical ontologies pose several challenges to ontology matching due both to the complexity of the biomedical domain and to the characteristics of the ontologies... -
Badapple: promiscuity patterns from noisy evidence.
Background Bioassay data analysis continues to be an essential, routine, yet challenging task in modern drug discovery and chemical biology research. The challenge is to infer...