A network of deep neural networks for distant speech recognition

Ravanelli, Mirco; Brakel, Philemon; Omologo, Maurizio; Bengio, Yoshua

Computer Science > Computation and Language

arXiv:1703.08002 (cs)

[Submitted on 23 Mar 2017]

Title:A network of deep neural networks for distant speech recognition

Authors:Mirco Ravanelli, Philemon Brakel, Maurizio Omologo, Yoshua Bengio

View PDF

Abstract:Despite the remarkable progress recently made in distant speech recognition, state-of-the-art technology still suffers from a lack of robustness, especially when adverse acoustic conditions characterized by non-stationary noises and reverberation are met. A prominent limitation of current systems lies in the lack of matching and communication between the various technologies involved in the distant speech recognition process. The speech enhancement and speech recognition modules are, for instance, often trained independently. Moreover, the speech enhancement normally helps the speech recognizer, but the output of the latter is not commonly used, in turn, to improve the speech enhancement. To address both concerns, we propose a novel architecture based on a network of deep neural networks, where all the components are jointly trained and better cooperate with each other thanks to a full communication scheme between them. Experiments, conducted using different datasets, tasks and acoustic conditions, revealed that the proposed framework can overtake other competitive solutions, including recent joint training approaches.

Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:1703.08002 [cs.CL]
	(or arXiv:1703.08002v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1703.08002

Submission history

From: Mirco Ravanelli [view email]
[v1] Thu, 23 Mar 2017 11:02:47 UTC (168 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2017-03

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Mirco Ravanelli
Philemon Brakel
Maurizio Omologo
Yoshua Bengio

export BibTeX citation

Computer Science > Computation and Language

Title:A network of deep neural networks for distant speech recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:A network of deep neural networks for distant speech recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators