A New Tidy Data Structure to Support Exploration and Modeling of Temporal Data

Mining temporal data for information is often inhibited by a multitude of formats: regular or irregular time intervals, point events that need aggregating, multiple observational units or repeated measurements on multiple individuals, and heterogeneous data types. This work presents a cohesive and conceptual framework for organizing and manipulating temporal data, which in turn flows into visualization, modeling, and forecasting routines. Tidy data principles are extended to temporal data by: (1) mapping the semantics of a dataset into its physical layout; (2) including an explicitly declared “index” variable representing time; (3) incorporating a “key” comprising single or multiple variables to uniquely identify units over time. This tidy data representation most naturally supports thinking of operations on the data as building blocks, forming part of a “data pipeline” in time-based contexts. A sound data pipeline facilitates a fluent workflow for analyzing temporal data. The infrastructure of tidy temporal data has been implemented in the R package, called tsibble. Supplementary materials for this article are available online.

Tags
Data and Resources
To access the resources you must log in

This item has no data

Identity

Description: The Identity category includes attributes that support the identification of the resource.

Field Value
PID https://www.doi.org/10.6084/m9.figshare.10770992.v2
PID https://www.doi.org/10.6084/m9.figshare.10770992
PID https://www.doi.org/10.6084/m9.figshare.10770992.v1
URL http://dx.doi.org/10.6084/m9.figshare.10770992.v1
URL http://dx.doi.org/10.6084/m9.figshare.10770992.v2
URL https://dx.doi.org/10.6084/m9.figshare.10770992.v1
URL https://dx.doi.org/10.6084/m9.figshare.10770992
URL http://dx.doi.org/10.6084/m9.figshare.10770992
URL https://figshare.com/articles/A_new_tidy_data_structure_to_support_exploration_and_modeling_of_temporal_data/10770992
URL https://dx.doi.org/10.6084/m9.figshare.10770992.v2
Access Modality

Description: The Access Modality category includes attributes that report the modality of exploitation of the resource.

Field Value
Access Right Open Access
Attribution

Description: Authorships and contributors

Field Value
Author Earo Wang
Author Cook, Dianne
Author Hyndman, Rob J
Publishing

Description: Attributes about the publishing venue (e.g. journal) and deposit location (e.g. repository)

Field Value
Collected From Datacite; figshare
Hosted By figshare
Publication Date 2019-11-22
Publisher Taylor & Francis
Additional Info
Field Value
Language Undetermined
Resource Type Dataset
keyword FOS: Chemical sciences
keyword FOS: Biological sciences
keyword FOS: Computer and information sciences
system:type dataset
Management Info
Field Value
Source https://science-innovation-policy.openaire.eu/search/dataset?datasetId=dedup_wf_001::3805aea39d973b3b8690517a744b0081
Author jsonws_user
Last Updated 16 December 2020, 22:25 (CET)
Created 16 December 2020, 22:25 (CET)