Towards Robust Deep Neural Networks for Affect and Depression Recognition from Speech

Othmani, Alice; Kadoch, Daoud; Bentounes, Kamil; Rejaibi, Emna; Alfred, Romain; Hadid, Abdenour

Computer Science > Human-Computer Interaction

arXiv:1911.00310 (cs)

[Submitted on 1 Nov 2019 (v1), last revised 18 Nov 2020 (this version, v4)]

Title:Towards Robust Deep Neural Networks for Affect and Depression Recognition from Speech

Authors:Alice Othmani, Daoud Kadoch, Kamil Bentounes, Emna Rejaibi, Romain Alfred, Abdenour Hadid

View PDF

Abstract:Intelligent monitoring systems and affective computing applications have emerged in recent years to enhance healthcare. Examples of these applications include assessment of affective states such as Major Depressive Disorder (MDD). MDD describes the constant expression of certain emotions: negative emotions (low Valence) and lack of interest (low Arousal). High-performing intelligent systems would enhance MDD diagnosis in its early stages. In this paper, we present a new deep neural network architecture, called EmoAudioNet, for emotion and depression recognition from speech. Deep EmoAudioNet learns from the time-frequency representation of the audio signal and the visual representation of its spectrum of frequencies. Our model shows very promising results in predicting affect and depression. It works similarly or outperforms the state-of-the-art methods according to several evaluation metrics on RECOLA and on DAIC-WOZ datasets in predicting arousal, valence, and depression. Code of EmoAudioNet is publicly available on GitHub: this https URL

Comments:	16 pages, 2 figures, 1 algorithm and 6 tables
Subjects:	Human-Computer Interaction (cs.HC); Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:1911.00310 [cs.HC]
	(or arXiv:1911.00310v4 [cs.HC] for this version)
	https://doi.org/10.48550/arXiv.1911.00310
Journal reference:	ICPR CAIHA 2020 workshop

Submission history

From: Emna Rejaibi [view email]
[v1] Fri, 1 Nov 2019 11:38:58 UTC (4,866 KB)
[v2] Fri, 27 Mar 2020 16:23:15 UTC (503 KB)
[v3] Fri, 17 Apr 2020 08:49:25 UTC (475 KB)
[v4] Wed, 18 Nov 2020 16:17:03 UTC (426 KB)

Computer Science > Human-Computer Interaction

Title:Towards Robust Deep Neural Networks for Affect and Depression Recognition from Speech

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Human-Computer Interaction

Title:Towards Robust Deep Neural Networks for Affect and Depression Recognition from Speech

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators