The search functionality is under construction.

IEICE TRANSACTIONS on Electronics

Open Access
In Search of the Performance- and Energy-Efficient CNN Accelerators

Stanislav SEDUKHIN, Yoichi TOMIOKA, Kohei YAMAMOTO

  • Full Text Views

    60

  • Cite this
  • Free PDF (1.2MB)

Summary :

In this paper, starting from the algorithm, a performance- and energy-efficient 3D structure or shape of the Tensor Processing Engine (TPE) for CNN acceleration is systematically searched and evaluated. An optimal accelerator's shape maximizes the number of concurrent MAC operations per clock cycle while minimizes the number of redundant operations. The proposed 3D vector-parallel TPE architecture with an optimal shape can be very efficiently used for considerable CNN acceleration. Due to implemented support of inter-block image data independency, it is possible to use multiple of such TPEs for the additional CNN acceleration. Moreover, it is shown that the proposed TPE can also be uniformly used for acceleration of the different CNN models such as VGG, ResNet, YOLO, and SSD. We also demonstrate that our theoretical efficiency analysis is matched with the result of a real implementation for an SSD model to which a state-of-the-art channel pruning technique is applied.

Publication
IEICE TRANSACTIONS on Electronics Vol.E105-C No.6 pp.209-221
Publication Date
2022/06/01
Publicized
2021/12/03
Online ISSN
1745-1353
DOI
10.1587/transele.2021LHP0003
Type of Manuscript
Special Section PAPER (Special Section on Low-Power and High-Speed Chips)
Category

Authors

Stanislav SEDUKHIN
  University of Aizu
Yoichi TOMIOKA
  University of Aizu
Kohei YAMAMOTO
  Oki Electric Industry Co., Ltd.

Keyword