A Twitter Dataset for Spatial Infectious Disease Surveillance

Dengue is a mosquito-borne viral disease which infects millions of people every year, specially in developing countries. Some of the main challenges facing the disease are reporting risk indicators and rapidly detecting outbreaks. Traditional surveillance systems rely on passive reporting from health-care facilities, often ignoring human mobility and locating each individual by their home address. Yet, geolocated data are becoming commonplace in social media, which is widely used as means to discuss a large variety of health topics, including the users' health status. In this dataset paper, we make available two large collections of dengue related labeled Twitter data. One is a set of tweets available through the Streaming API using the keywords dengue and aedes from 2010 to 2016. The other is the set of all geolocated tweets in Brazil during the year of 2015 (available also through the Streaming API). We detail the process of collecting and labeling each tweet containing keywords related to dengue in one of 5 categories: personal experience, information, opinion, campaign, and joke. This dataset can be useful for the development of models for spatial disease surveillance, but also scenarios such as understanding health-related content in a language other than English, and studying human mobility.

Tags
Data and Resources
To access the resources you must log in

This item has no data

Identity

Description: The Identity category includes attributes that support the identification of the resource.

Field Value
PID https://www.doi.org/10.5281/zenodo.2541439
PID https://www.doi.org/10.5281/zenodo.2541440
URL https://figshare.com/articles/A_Twitter_Dataset_for_Spatial_Infectious_Disease_Surveillance/7598927
URL http://dx.doi.org/10.5281/zenodo.2541439
URL https://zenodo.org/record/2541440
URL http://dx.doi.org/10.5281/zenodo.2541440
Access Modality

Description: The Access Modality category includes attributes that report the modality of exploitation of the resource.

Field Value
Access Right Open Access
Attribution

Description: Authorships and contributors

Field Value
Author C.S.N.P. Souza, Roberto
Author Horta Ribeiro, Manoel
Author Meira Jr., Wagner
Author M. Assuncao, Renato
Author dos Santos, Walter
Publishing

Description: Attributes about the publishing venue (e.g. journal) and deposit location (e.g. repository)

Field Value
Collected From Zenodo; Datacite; figshare
Hosted By Zenodo; figshare
Publication Date 2019-01-15
Publisher Zenodo
Additional Info
Field Value
Language Portuguese
Resource Type Dataset
system:type dataset
Management Info
Field Value
Source https://science-innovation-policy.openaire.eu/search/dataset?datasetId=dedup_wf_001::fdf99c5e27924d12774371e190569e6a
Author jsonws_user
Version 1
Last Updated 31 December 2020, 15:49 (CET)
Created 31 December 2020, 15:49 (CET)