dedup_wf_001--2bec4529e1503cfec64c373ca52f5803

This describes the output files for the BioWordlists project. These files are ancillary data for other text mining projects. Each file is a tab-delimited file with one term per line. The first column is a unique ID. The second column is the main name of the term. The third column is a pipe-delimited set of the synonyms for this term (including the main term). terms_genes.tsv: This is a list of all human genes with synonyms. The first column is the HUGO gene ID. It includes an additional fourth column is the Entrez gene ID. Genes are built using the NCBI Gene resource with synonyms from the UMLS Metathesaurus. terms_drugs.tsv: This is a list of all drugs from the WikiData resource. It also includes some more general terms and inhibitors terms for all genes in the gene list. terms_cancers.tsv: This is a list of specific cancer types from the Disease Ontology. General cancer terms have been removed and synonyms added from the UMLS Metathesaurus. terms_variants.tsv: Common mutations, aberrations and other 'omic events that may occur to a gene, especially in the cancer setting. terms_conflicting.tsv: Several common biomedical terms that are easily confused with other useful concepts. An examples is "Cox Regression". This list is used to identify these to reduce ambiguity. terms_proteins.tsv: Human protein names from UniProt with synonyms.

Tags
Data and Resources
To access the resources you must log in

This item has no data

Identity

Description: The Identity category includes attributes that support the identification of the resource.

Field Value
PID https://www.doi.org/10.5281/zenodo.2528593
URL https://figshare.com/articles/BioWordlists/7545605
URL https://zenodo.org/record/2528593
URL http://dx.doi.org/10.5281/zenodo.2528593
Access Modality

Description: The Access Modality category includes attributes that report the modality of exploitation of the resource.

Field Value
Access Right Open Access
Attribution

Description: Authorships and contributors

Field Value
Author Jake Lever
Publishing

Description: Attributes about the publishing venue (e.g. journal) and deposit location (e.g. repository)

Field Value
Collected From Zenodo; figshare; Datacite
Hosted By Zenodo; figshare
Publication Date 2018-12-29
Additional Info
Field Value
Language UNKNOWN
Resource Type Dataset
system:type dataset
Management Info
Field Value
Source https://science-innovation-policy.openaire.eu/search/dataset?datasetId=dedup_wf_001::2bec4529e1503cfec64c373ca52f5803
Author jsonws_user
Last Updated 16 December 2020, 20:50 (CET)
Created 16 December 2020, 20:50 (CET)