Profiling based Out-of-core Hybrid Method for Large Neural Networks

Ito, Yuki; Imai, Haruki; Duc, Tung Le; Negishi, Yasushi; Kawachiya, Kiyokuni; Matsumiya, Ryo; Endo, Toshio

Computer Science > Machine Learning

arXiv:1907.05013 (cs)

[Submitted on 11 Jul 2019]

Title:Profiling based Out-of-core Hybrid Method for Large Neural Networks

Authors:Yuki Ito, Haruki Imai, Tung Le Duc, Yasushi Negishi, Kiyokuni Kawachiya, Ryo Matsumiya, Toshio Endo

View PDF

Abstract:GPUs are widely used to accelerate deep learning with NNs (NNs). On the other hand, since GPU memory capacity is limited, it is difficult to implement efficient programs that compute large NNs on GPU. To compute NNs exceeding GPU memory capacity, data-swapping method and recomputing method have been proposed in existing work. However, in these methods, performance overhead occurs due to data movement or increase of computation. In order to reduce the overhead, it is important to consider characteristics of each layer such as sizes and cost for recomputation. Based on this direction, we proposed Profiling based out-of-core Hybrid method (PoocH). PoocH determines target layers of swapping or recomputing based on runtime profiling. We implemented PoocH by extending a deep learning framework, Chainer, and we evaluated its performance. With PoocH, we successfully computed an NN requiring 50 GB memory on a single GPU with 16 GB memory. Compared with in-core cases, performance degradation was 38 \% on x86 machine and 28 \% on POWER9 machine.

Comments:	15 pages
Subjects:	Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Performance (cs.PF)
Cite as:	arXiv:1907.05013 [cs.LG]
	(or arXiv:1907.05013v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1907.05013

Submission history

From: Toshio Endo [view email]
[v1] Thu, 11 Jul 2019 06:31:38 UTC (748 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-07

Change to browse by:

cs
cs.DC
cs.PF

References & Citations

DBLP - CS Bibliography

listing | bibtex

Yuki Ito
Haruki Imai
Tung Le Duc
Yasushi Negishi
Kiyokuni Kawachiya

…

export BibTeX citation

Computer Science > Machine Learning

Title:Profiling based Out-of-core Hybrid Method for Large Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Profiling based Out-of-core Hybrid Method for Large Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators