AdaptivFloat: A Floating-point based Data Type for Resilient Deep Learning Inference

Tambe, Thierry; Yang, En-Yu; Wan, Zishen; Deng, Yuntian; Reddi, Vijay Janapa; Rush, Alexander; Brooks, David; Wei, Gu-Yeon

Computer Science > Machine Learning

arXiv:1909.13271 (cs)

[Submitted on 29 Sep 2019 (v1), last revised 11 Feb 2020 (this version, v3)]

Title:AdaptivFloat: A Floating-point based Data Type for Resilient Deep Learning Inference

Authors:Thierry Tambe, En-Yu Yang, Zishen Wan, Yuntian Deng, Vijay Janapa Reddi, Alexander Rush, David Brooks, Gu-Yeon Wei

View PDF

Abstract:Conventional hardware-friendly quantization methods, such as fixed-point or integer, tend to perform poorly at very low word sizes as their shrinking dynamic ranges cannot adequately capture the wide data distributions commonly seen in sequence transduction models. We present AdaptivFloat, a floating-point inspired number representation format for deep learning that dynamically maximizes and optimally clips its available dynamic range, at a layer granularity, in order to create faithful encoding of neural network parameters. AdaptivFloat consistently produces higher inference accuracies compared to block floating-point, uniform, IEEE-like float or posit encodings at very low precision ($\leq$ 8-bit) across a diverse set of state-of-the-art neural network topologies. And notably, AdaptivFloat is seen surpassing baseline FP32 performance by up to +0.3 in BLEU score and -0.75 in word error rate at weight bit widths that are $\leq$ 8-bit. Experimental results on a deep neural network (DNN) hardware accelerator, exploiting AdaptivFloat logic in its computational datapath, demonstrate per-operation energy and area that is 0.9$\times$ and 1.14$\times$, respectively, that of equivalent bit width integer-based accelerator variants.

Comments:	10 pages
Subjects:	Machine Learning (cs.LG); Hardware Architecture (cs.AR); Machine Learning (stat.ML)
Cite as:	arXiv:1909.13271 [cs.LG]
	(or arXiv:1909.13271v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1909.13271

Submission history

From: Thierry Tambe [view email]
[v1] Sun, 29 Sep 2019 12:41:46 UTC (9,003 KB)
[v2] Tue, 15 Oct 2019 16:00:21 UTC (9,003 KB)
[v3] Tue, 11 Feb 2020 09:30:21 UTC (9,003 KB)

Computer Science > Machine Learning

Title:AdaptivFloat: A Floating-point based Data Type for Resilient Deep Learning Inference

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:AdaptivFloat: A Floating-point based Data Type for Resilient Deep Learning Inference

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators