The search functionality is under construction.
The search functionality is under construction.

Real-Time and Energy-Efficient Face Detection on CPU-GPU Heterogeneous Embedded Platforms

Chanyoung OH, Saehanseul YI, Youngmin YI

  • Full Text Views

    0

  • Cite this

Summary :

As energy efficiency has become a major design constraint or objective, heterogeneous manycore architectures have emerged as mainstream target platforms not only in server systems but also in embedded systems. Manycore accelerators such as GPUs are getting also popular in embedded domains, as well as the heterogeneous CPU cores. However, as the number of cores in an embedded GPU is far less than that of a server GPU, it is important to utilize both heterogeneous multi-core CPUs and GPUs to achieve the desired throughput with the minimal energy consumption. In this paper, we present a case study of mapping LBP-based face detection onto a recent CPU-GPU heterogeneous embedded platform, which exploits both task parallelism and data parallelism to achieve maximal energy efficiency with a real-time constraint. We first present the parallelization technique of each task for the GPU execution, then we propose performance and energy models for both task-parallel and data-parallel executions on heterogeneous processors, which are used in design space exploration for the optimal mapping. The design space is huge since not only processor heterogeneity such as CPU-GPU and big.LITTLE, but also various data partitioning ratios for the data-parallel execution on these heterogeneous processors are considered. In our case study of LBP face detection on Exynos 5422, the estimation error of the proposed performance and energy models were on average -2.19% and -3.67% respectively. By systematically finding the optimal mappings with the proposed models, we could achieve 28.6% less energy consumption compared to the manual mapping, while still meeting the real-time constraint.

Publication
IEICE TRANSACTIONS on Information Vol.E101-D No.12 pp.2878-2888
Publication Date
2018/12/01
Publicized
2018/09/18
Online ISSN
1745-1361
DOI
10.1587/transinf.2018PAP0004
Type of Manuscript
Special Section PAPER (Special Section on Parallel and Distributed Computing and Networking)
Category
Real-time Systems

Authors

Chanyoung OH
  University of Seoul
Saehanseul YI
  University of Seoul
Youngmin YI
  University of Seoul

Keyword