Design and Implementation of Deep Neural Network for Edge Computing

Junyang ZHANG; Yang GUO; Xiao HU; Rongzhen LI

doi:10.1587/transinf.2018EDP7044

IEICE TRANSACTIONS on Information

Design and Implementation of Deep Neural Network for Edge Computing

Junyang ZHANG, Yang GUO, Xiao HU, Rongzhen LI

Full Text Views

0

Cite this

Summary :

In recent years, deep learning based image recognition, speech recognition, text translation and other related applications have brought great convenience to people's lives. With the advent of the era of internet of everything, how to run a computationally intensive deep learning algorithm on a limited resources edge device is a major challenge. For an edge oriented computing vector processor, combined with a specific neural network model, a new data layout method for putting the input feature maps in DDR, rearrangement of the convolutional kernel parameters in the nuclear memory bank is proposed. Aiming at the difficulty of parallelism of two-dimensional matrix convolution, a method of parallelizing the matrix convolution calculation in the third dimension is proposed, by setting the vector register with zero as the initial value of the max pooling to fuse the rectified linear unit (ReLU) activation function and pooling operations to reduce the repeated access to intermediate data. On the basis of single core implementation, a multi-core implementation scheme of Inception structure is proposed. Finally, based on the proposed vectorization method, we realize five kinds of neural network models, namely, AlexNet, VGG16, VGG19, GoogLeNet, ResNet18, and performance statistics and analysis based on CPU, gtx1080TI and FT2000 are presented. Experimental results show that the vector processor has better computing advantages than CPU and GPU, and can calculate large-scale neural network model in real time.

Publication: IEICE TRANSACTIONS on Information Vol.E101-D No.8 pp.1982-1996

Publication Date: 2018/08/01

Publicized: 2018/05/02

Online ISSN: 1745-1361

DOI: 10.1587/transinf.2018EDP7044

Type of Manuscript: PAPER

Category: Fundamentals of Information Systems

Authors

Junyang ZHANG
  National University of Defense Technology
Yang GUO
  National University of Defense Technology
Xiao HU
  National University of Defense Technology
Rongzhen LI
  National University of Defense Technology

Keyword

edge computing, vector processor, convolutional neural network, multi-core optimization

Cite this

Copy

Junyang ZHANG, Yang GUO, Xiao HU, Rongzhen LI, "Design and Implementation of Deep Neural Network for Edge Computing" in IEICE TRANSACTIONS on Information, vol. E101-D, no. 8, pp. 1982-1996, August 2018, doi: 10.1587/transinf.2018EDP7044.
Abstract: In recent years, deep learning based image recognition, speech recognition, text translation and other related applications have brought great convenience to people's lives. With the advent of the era of internet of everything, how to run a computationally intensive deep learning algorithm on a limited resources edge device is a major challenge. For an edge oriented computing vector processor, combined with a specific neural network model, a new data layout method for putting the input feature maps in DDR, rearrangement of the convolutional kernel parameters in the nuclear memory bank is proposed. Aiming at the difficulty of parallelism of two-dimensional matrix convolution, a method of parallelizing the matrix convolution calculation in the third dimension is proposed, by setting the vector register with zero as the initial value of the max pooling to fuse the rectified linear unit (ReLU) activation function and pooling operations to reduce the repeated access to intermediate data. On the basis of single core implementation, a multi-core implementation scheme of Inception structure is proposed. Finally, based on the proposed vectorization method, we realize five kinds of neural network models, namely, AlexNet, VGG16, VGG19, GoogLeNet, ResNet18, and performance statistics and analysis based on CPU, gtx1080TI and FT2000 are presented. Experimental results show that the vector processor has better computing advantages than CPU and GPU, and can calculate large-scale neural network model in real time.
URL: https://global.ieice.org/en_transactions/information/10.1587/transinf.2018EDP7044/_p

Copy

@ARTICLE{e101-d_8_1982,
author={Junyang ZHANG, Yang GUO, Xiao HU, Rongzhen LI, },
journal={IEICE TRANSACTIONS on Information},
title={Design and Implementation of Deep Neural Network for Edge Computing},
year={2018},
volume={E101-D},
number={8},
pages={1982-1996},
abstract={In recent years, deep learning based image recognition, speech recognition, text translation and other related applications have brought great convenience to people's lives. With the advent of the era of internet of everything, how to run a computationally intensive deep learning algorithm on a limited resources edge device is a major challenge. For an edge oriented computing vector processor, combined with a specific neural network model, a new data layout method for putting the input feature maps in DDR, rearrangement of the convolutional kernel parameters in the nuclear memory bank is proposed. Aiming at the difficulty of parallelism of two-dimensional matrix convolution, a method of parallelizing the matrix convolution calculation in the third dimension is proposed, by setting the vector register with zero as the initial value of the max pooling to fuse the rectified linear unit (ReLU) activation function and pooling operations to reduce the repeated access to intermediate data. On the basis of single core implementation, a multi-core implementation scheme of Inception structure is proposed. Finally, based on the proposed vectorization method, we realize five kinds of neural network models, namely, AlexNet, VGG16, VGG19, GoogLeNet, ResNet18, and performance statistics and analysis based on CPU, gtx1080TI and FT2000 are presented. Experimental results show that the vector processor has better computing advantages than CPU and GPU, and can calculate large-scale neural network model in real time.},
keywords={},
doi={10.1587/transinf.2018EDP7044},
ISSN={1745-1361},
month={August},}

Copy

TY - JOUR
TI - Design and Implementation of Deep Neural Network for Edge Computing
T2 - IEICE TRANSACTIONS on Information
SP - 1982
EP - 1996
AU - Junyang ZHANG
AU - Yang GUO
AU - Xiao HU
AU - Rongzhen LI
PY - 2018
DO - 10.1587/transinf.2018EDP7044
JO - IEICE TRANSACTIONS on Information
SN - 1745-1361
VL - E101-D
IS - 8
JA - IEICE TRANSACTIONS on Information
Y1 - August 2018
AB - In recent years, deep learning based image recognition, speech recognition, text translation and other related applications have brought great convenience to people's lives. With the advent of the era of internet of everything, how to run a computationally intensive deep learning algorithm on a limited resources edge device is a major challenge. For an edge oriented computing vector processor, combined with a specific neural network model, a new data layout method for putting the input feature maps in DDR, rearrangement of the convolutional kernel parameters in the nuclear memory bank is proposed. Aiming at the difficulty of parallelism of two-dimensional matrix convolution, a method of parallelizing the matrix convolution calculation in the third dimension is proposed, by setting the vector register with zero as the initial value of the max pooling to fuse the rectified linear unit (ReLU) activation function and pooling operations to reduce the repeated access to intermediate data. On the basis of single core implementation, a multi-core implementation scheme of Inception structure is proposed. Finally, based on the proposed vectorization method, we realize five kinds of neural network models, namely, AlexNet, VGG16, VGG19, GoogLeNet, ResNet18, and performance statistics and analysis based on CPU, gtx1080TI and FT2000 are presented. Experimental results show that the vector processor has better computing advantages than CPU and GPU, and can calculate large-scale neural network model in real time.
ER -

IEICE TRANSACTIONS on Information