Increasing Compactness Of Deep Learning Based Speech Enhancement Models With Parameter Pruning And Quantization Techniques

Wu, Jyun-Yi; Yu, Cheng; Fu, Szu-Wei; Liu, Chih-Ting; Chien, Shao-Yi; Tsao, Yu

doi:10.1109/LSP.2019.2951950

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:1906.01078 (eess)

[Submitted on 31 May 2019 (v1), last revised 31 Jul 2019 (this version, v2)]

Title:Increasing Compactness Of Deep Learning Based Speech Enhancement Models With Parameter Pruning And Quantization Techniques

Authors:Jyun-Yi Wu, Cheng Yu, Szu-Wei Fu, Chih-Ting Liu, Shao-Yi Chien, Yu Tsao

View PDF

Abstract:Most recent studies on deep learning based speech enhancement (SE) focused on improving denoising performance. However, successful SE applications require striking a desirable balance between denoising performance and computational cost in real scenarios. In this study, we propose a novel parameter pruning (PP) technique, which removes redundant channels in a neural network. In addition, a parameter quantization (PQ) technique was applied to reduce the size of a neural network by representing weights with fewer cluster centroids. Because the techniques are derived based on different concepts, the PP and PQ can be integrated to provide even more compact SE models. The experimental results show that the PP and PQ techniques produce a compacted SE model with a size of only 10.03% compared to that of the original model, resulting in minor performance losses of 1.43% (from 0.70 to 0.69) for STOI and 3.24% (from 1.85 to 1.79) for PESQ. The promising results suggest that the PP and PQ techniques can be used in a SE system in devices with limited storage and computation resources.

Comments:	4pages, 6 figures
Subjects:	Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
Cite as:	arXiv:1906.01078 [eess.AS]
	(or arXiv:1906.01078v2 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.1906.01078
Related DOI:	https://doi.org/10.1109/LSP.2019.2951950

Submission history

From: Jyun-Yi Wu [view email]
[v1] Fri, 31 May 2019 04:07:20 UTC (799 KB)
[v2] Wed, 31 Jul 2019 18:22:51 UTC (915 KB)

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Increasing Compactness Of Deep Learning Based Speech Enhancement Models With Parameter Pruning And Quantization Techniques

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Increasing Compactness Of Deep Learning Based Speech Enhancement Models With Parameter Pruning And Quantization Techniques

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators