Shared acoustic codes underlie emotional communication in music and speech—Evidence from deep transfer learning

Music and speech exhibit striking similarities in the communication of emotions in the acoustic domain, in such a way that the communication of specific emotions is achieved, at least to a certain extent, by means of shared acoustic patterns. From an Affective Sciences points of view, determining the degree of overlap between both domains is fundamental to understand the shared mechanisms underlying such phenomenon. From a Machine learning perspective, the overlap between acoustic codes for emotional expression in music and speech opens new possibilities to enlarge the amount of data available to develop music and speech emotion recognition systems. In this article, we investigate time-continuous predictions of emotion (Arousal and Valence) in music and speech, and the Transfer Learning between these domains. We establish a comparative framework including intra- (i.e., models trained and tested on the same modality, either music or speech) and cross-domain experiments (i.e., models trained in one modality and tested on the other). In the cross-domain context, we evaluated two strategies-the direct transfer between domains, and the contribution of Transfer Learning techniques (feature-representation-transfer based on Denoising Auto Encoders) for reducing the gap in the feature space distributions. Our results demonstrate an excellent cross-domain generalisation performance with and without feature representation transfer in both directions. In the case of music, cross-domain approaches outperformed intra-domain models for Valence estimation, whereas for Speech intra-domain models achieve the best performance. This is the first demonstration of shared acoustic codes for emotional expression in music and speech in the time-continuous domain.

Tags
Data and Resources
To access the resources you must log in

This item has no data

Identity

Description: The Identity category includes attributes that support the identification of the resource.

Field Value
PID https://www.doi.org/10.1371/journal.pone.0179289
PID pmc:PMC5489171
PID pmid:28658285
PID pmid:29352291
URL https://opus.bibliothek.uni-augsburg.de/opus4/files/71765/journal.pone.0179289.pdf
URL http://europepmc.org/articles/PMC5489171
URL https://academic.microsoft.com/#/detail/2735654625
URL https://opus.bibliothek.uni-augsburg.de/opus4/frontdoor/index/index/docId/71765
URL https://ui.adsabs.harvard.edu/abs/2018PLoSO..1391754C/abstract
URL https://nbn-resolving.org/urn:nbn:de:bvb:384-opus4-717654
URL http://dx.plos.org/10.1371/journal.pone.0179289
URL https://www.ncbi.nlm.nih.gov/pubmed/28658285
URL https://dx.doi.org/10.1371/journal.pone.0179289
URL https://paperity.org/p/80416506/shared-acoustic-codes-underlie-emotional-communication-in-music-and-speech-evidence-from
URL http://datacat.liverpool.ac.uk/718/
URL http://livrepository.liverpool.ac.uk/3008278/1/journal.pone.0179289.pdf
URL https://doaj.org/article/1caaad9c0e8d470a87c95e27edeeff44
URL https://doi.org/10.1371/journal.pone.0179289
URL https://core.ac.uk/display/131165773
URL https://livrepository.liverpool.ac.uk/3072453/
URL https://doaj.org/toc/1932-6203
URL https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0179289&type=printable
URL https://dx.plos.org/10.1371/journal.pone.0179289
URL http://dx.doi.org/10.1371/journal.pone.0179289
URL https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0179289
URL http://europepmc.org/articles/PMC5489171?pdf=render
Access Modality

Description: The Access Modality category includes attributes that report the modality of exploitation of the resource.

Field Value
Access Right Open Access
Attribution

Description: Authorships and contributors

Field Value
Author Coutinho, Eduardo, 0000-0001-5234-1497
Author Schuller, Björn
Publishing

Description: Attributes about the publishing venue (e.g. journal) and deposit location (e.g. repository)

Field Value
Collected From Europe PubMed Central; PubMed Central; OPUS Augsburg; ORCID; UnpayWall; Datacite; DOAJ-Articles; Crossref; Microsoft Academic Graph; CORE (RIOXX-UK Aggregator)
Hosted By Europe PubMed Central; OPUS Augsburg; PLoS ONE; University of Liverpool Repository
Publication Date 2017-06-28
Publisher Public Library of Science (PLoS)
Additional Info
Field Value
Country United Kingdom; Germany
Format application/pdf
Language English
Resource Type Article; UNKNOWN
keyword ddc.ddc:004
keyword Q
keyword R
keyword keywords.General Biochemistry, Genetics and Molecular Biology
system:type publication
Management Info
Field Value
Source https://science-innovation-policy.openaire.eu/search/publication?articleId=dedup_wf_001::98f0a4ee087e2bc5457178a66f99cc4d
Author jsonws_user
Last Updated 26 December 2020, 14:39 (CET)
Created 26 December 2020, 14:39 (CET)