Active Learning for Speech Recognition: the Power of Gradients

Huang, Jiaji; Child, Rewon; Rao, Vinay; Liu, Hairong; Satheesh, Sanjeev; Coates, Adam

Computer Science > Computation and Language

arXiv:1612.03226 (cs)

[Submitted on 10 Dec 2016]

Title:Active Learning for Speech Recognition: the Power of Gradients

Authors:Jiaji Huang, Rewon Child, Vinay Rao, Hairong Liu, Sanjeev Satheesh, Adam Coates

View PDF

Abstract:In training speech recognition systems, labeling audio clips can be expensive, and not all data is equally valuable. Active learning aims to label only the most informative samples to reduce cost. For speech recognition, confidence scores and other likelihood-based active learning methods have been shown to be effective. Gradient-based active learning methods, however, are still not well-understood. This work investigates the Expected Gradient Length (EGL) approach in active learning for end-to-end speech recognition. We justify EGL from a variance reduction perspective, and observe that EGL's measure of informativeness picks novel samples uncorrelated with confidence scores. Experimentally, we show that EGL can reduce word errors by 11\%, or alternatively, reduce the number of samples to label by 50\%, when compared to random sampling.

Comments:	published as a workshop paper at NIPS 2016
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1612.03226 [cs.CL]
	(or arXiv:1612.03226v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1612.03226

Submission history

From: Jiaji Huang Dr. [view email]
[v1] Sat, 10 Dec 2016 00:09:45 UTC (363 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2016-12

Change to browse by:

cs
cs.LG
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jiaji Huang
Rewon Child
Vinay Rao
Hairong Liu
Sanjeev Satheesh

…

export BibTeX citation

Computer Science > Computation and Language

Title:Active Learning for Speech Recognition: the Power of Gradients

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Active Learning for Speech Recognition: the Power of Gradients

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators