Enzymes from the BRENDA and CAZy databases annotated with organism growth temperatures and predicted Topt

This repo is an updated version of repo Gang Li, & Martin KM Engqvist. (2019). Enzymes from the BRENDA database annotated with organism growth temperatures and predicted Topt (Version 1.0) [Data set]. Zenodo. http://doi.org/10.5281/zenodo.2539114.  Experimental as well as predicted organism growth temperatures were used to annotate enzymes from the BRENDA database (doi: 10.1093/nar/gky1048, https://www.brenda-enzymes.org) version 2018.2 (July 2018) and CAZy database (http://www.cazy.org/).   An updated machine learning model was applied to predict the optimal functional temperature of enzymes from BRENDA and CAZy.  There are four files in this repo: 1. 'annotated_brenda.tsv' is a tab-seperated file that contains the annotated enzymes from BRENDA. There are 9 columns in the file: index column; "ec", EC number; "uniprot_id", protein id in Uniprot database; "domain", the domain of life (superkingdom), either Archaea, Bacteria, or Eukarya; "organism", species name; "ogt", optimal growth temperature of the organism; "ogt_note", whether the experimental or predicted ogt is used; "topt", the optimal functional temperature of the enzyme; "topt_note", whether the experimental or predicted topt is used. 2. 'annotated_cazy.tsv' is a tab-seperated file that contains the annotated enzymes from CAZy. There are 12 columns in the file: index column; "family", CAZy family id; "genbank", genbank id; "Protein Name", the protein name from CAZy database; "ec", EC number; "organism", strain name; "uniprot_id", protein id in Uniprot database; "PDB/3D", structure id in PDB database; "ogt", optimal growth temperature of the organism; "ogt_note", whether the experimental or predicted ogt is used; "topt", the optimal functional temperature of the enzyme; "topt_note", whether the experimental or predicted topt is used. 3. 'brenda.sql', which is a SQLite3 database version of 'annotated_brenda.tsv', with an additional column of enzyme sequences. 4. 'cazy.sql', which is a SQLite3 database version of 'annotated_cazy.tsv'', with an additional column of enzyme sequences. The SQLite3 databases are for the Tome tool (https://github.com/EngqvistLab/Tome), version 2.0.

Tags
Data and Resources
To access the resources you must log in

This item has no data

Identity

Description: The Identity category includes attributes that support the identification of the resource.

Field Value
PID https://www.doi.org/10.5281/zenodo.3578467
PID https://www.doi.org/10.5281/zenodo.3578468
URL http://dx.doi.org/10.5281/zenodo.3578468
URL https://zenodo.org/record/3578468
URL http://dx.doi.org/10.5281/zenodo.3578467
URL https://figshare.com/articles/Enzymes_from_the_BRENDA_and_CAZy_databases_annotated_with_organism_growth_temperatures_and_predicted_Topt/11445570
Access Modality

Description: The Access Modality category includes attributes that report the modality of exploitation of the resource.

Field Value
Access Right Open Access
Attribution

Description: Authorships and contributors

Field Value
Author Li, Gang, 0000-0001-6778-2842
Author Engqvist, Martin KM, 0000-0003-2174-2225
Contributor European Commission
Publishing

Description: Attributes about the publishing venue (e.g. journal) and deposit location (e.g. repository)

Field Value
Collected From Zenodo; Datacite; figshare
Hosted By Zenodo; figshare
Publication Date 2019-12-16
Publisher Zenodo
Additional Info
Field Value
Language UNKNOWN
Resource Type Dataset
system:type dataset
Management Info
Field Value
Source https://science-innovation-policy.openaire.eu/search/dataset?datasetId=dedup_wf_001::12265a37e6db71f7fd0c8f1c5b00f7a3
Author jsonws_user
Version 1.0
Last Updated 15 December 2020, 22:39 (CET)
Created 15 December 2020, 22:39 (CET)