-
Mendeley_annotated_250_of_11k.csv
Manual annotation of a random 250 papers from the 10,555 papers in the study. Manual examination was to determine whether the study did indeed generate gene expression... -
manuscript source file (knitr)
The source file for running the statistics and inserting the results into the manuscript text. The manuscript does not include final wording revisions made in the peer review... -
pubmed_pmc_ratios.csv
The fraction of PubMed in PMC that are indexed with the MeSH term “gene expression profiling”, by year of publication, 2000-2011, as measured in 2012. -
GEO_dataset_attributes.csv
One row for every GEO dataset reuse detected by searching PMC for GEO accession numbers. Columns list GEO accession, gse number, gds number, related submit_pmids, identified... -
PLoSONE2011_rawdata.txt
Data from Piwowar HA (2011) Data from: Who shares? Who doesn’t? Factors associated with openly archiving raw research data. Dryad Digital Repository. doi:10.5061/dryad.mf1sd.... -
pubmed_gse_count.csv
The number of GSE data sets added to the NCBI's GEO repository each year, 2000-2011. -
preprocess_raw_data.r
Helper R functions used by stats_knitr_.md to preprocess data before running the statistics. -
manuscript compiled file, with stats
This file is the result of running stats_knitr_.md through knitr. It does not include changes made during the peer review process. -
tracking1k_20111008.csv
Manually annotated instances of citation context to papers that created publicly available datasets. This study explores the subset of the dataset related to GEO data: "dataset...