GAN-based Image Translation Model with Self-Attention for Nighttime Dashcam Data Augmentation

Rebeka SULTANA; Gosuke OHASHI

doi:10.1587/transfun.2022IMP0004

IEICE TRANSACTIONS on Fundamentals

GAN-based Image Translation Model with Self-Attention for Nighttime Dashcam Data Augmentation

Rebeka SULTANA, Gosuke OHASHI

Full Text Views

0

Cite this

Summary :

High-performance deep learning-based object detection models can reduce traffic accidents using dashcam images during nighttime driving. Deep learning requires a large-scale dataset to obtain a high-performance model. However, existing object detection datasets are mostly daytime scenes and a few nighttime scenes. Increasing the nighttime dataset is laborious and time-consuming. In such a case, it is possible to convert daytime images to nighttime images by image-to-image translation model to augment the nighttime dataset with less effort so that the translated dataset can utilize the annotations of the daytime dataset. Therefore, in this study, a GAN-based image-to-image translation model is proposed by incorporating self-attention with cycle consistency and content/style separation for nighttime data augmentation that shows high fidelity to annotations of the daytime dataset. Experimental results highlight the effectiveness of the proposed model compared with other models in terms of translated images and FID scores. Moreover, the high fidelity of translated images to the annotations is verified by a small object detection model according to detection results and mAP. Ablation studies confirm the effectiveness of self-attention in the proposed model. As a contribution to GAN-based data augmentation, the source code of the proposed image translation model is publicly available at https://github.com/subecky/Image-Translation-With-Self-Attention

Publication: IEICE TRANSACTIONS on Fundamentals Vol.E106-A No.9 pp.1202-1210

Publication Date: 2023/09/01

Publicized: 2023/06/27

Online ISSN: 1745-1337

DOI: 10.1587/transfun.2022IMP0004

Type of Manuscript: Special Section PAPER (Special Section on Image Media Quality)

Category: Intelligent Transport System

Authors

Rebeka SULTANA
Shizuoka University
Gosuke OHASHI
Shizuoka University

Keyword

GAN, image-to-image translation, self-attention, data augmentation, nighttime dashcam image, object detection, ADAS

Cite this

Copy

Rebeka SULTANA, Gosuke OHASHI, "GAN-based Image Translation Model with Self-Attention for Nighttime Dashcam Data Augmentation" in IEICE TRANSACTIONS on Fundamentals, vol. E106-A, no. 9, pp. 1202-1210, September 2023, doi: 10.1587/transfun.2022IMP0004.
Abstract: High-performance deep learning-based object detection models can reduce traffic accidents using dashcam images during nighttime driving. Deep learning requires a large-scale dataset to obtain a high-performance model. However, existing object detection datasets are mostly daytime scenes and a few nighttime scenes. Increasing the nighttime dataset is laborious and time-consuming. In such a case, it is possible to convert daytime images to nighttime images by image-to-image translation model to augment the nighttime dataset with less effort so that the translated dataset can utilize the annotations of the daytime dataset. Therefore, in this study, a GAN-based image-to-image translation model is proposed by incorporating self-attention with cycle consistency and content/style separation for nighttime data augmentation that shows high fidelity to annotations of the daytime dataset. Experimental results highlight the effectiveness of the proposed model compared with other models in terms of translated images and FID scores. Moreover, the high fidelity of translated images to the annotations is verified by a small object detection model according to detection results and mAP. Ablation studies confirm the effectiveness of self-attention in the proposed model. As a contribution to GAN-based data augmentation, the source code of the proposed image translation model is publicly available at https://github.com/subecky/Image-Translation-With-Self-Attention
URL: https://global.ieice.org/en_transactions/fundamentals/10.1587/transfun.2022IMP0004/_p

Copy

@ARTICLE{e106-a_9_1202,
author={Rebeka SULTANA, Gosuke OHASHI, },
journal={IEICE TRANSACTIONS on Fundamentals},
title={GAN-based Image Translation Model with Self-Attention for Nighttime Dashcam Data Augmentation},
year={2023},
volume={E106-A},
number={9},
pages={1202-1210},
abstract={High-performance deep learning-based object detection models can reduce traffic accidents using dashcam images during nighttime driving. Deep learning requires a large-scale dataset to obtain a high-performance model. However, existing object detection datasets are mostly daytime scenes and a few nighttime scenes. Increasing the nighttime dataset is laborious and time-consuming. In such a case, it is possible to convert daytime images to nighttime images by image-to-image translation model to augment the nighttime dataset with less effort so that the translated dataset can utilize the annotations of the daytime dataset. Therefore, in this study, a GAN-based image-to-image translation model is proposed by incorporating self-attention with cycle consistency and content/style separation for nighttime data augmentation that shows high fidelity to annotations of the daytime dataset. Experimental results highlight the effectiveness of the proposed model compared with other models in terms of translated images and FID scores. Moreover, the high fidelity of translated images to the annotations is verified by a small object detection model according to detection results and mAP. Ablation studies confirm the effectiveness of self-attention in the proposed model. As a contribution to GAN-based data augmentation, the source code of the proposed image translation model is publicly available at https://github.com/subecky/Image-Translation-With-Self-Attention},
keywords={},
doi={10.1587/transfun.2022IMP0004},
ISSN={1745-1337},
month={September},}

Copy

TY - JOUR
TI - GAN-based Image Translation Model with Self-Attention for Nighttime Dashcam Data Augmentation
T2 - IEICE TRANSACTIONS on Fundamentals
SP - 1202
EP - 1210
AU - Rebeka SULTANA
AU - Gosuke OHASHI
PY - 2023
DO - 10.1587/transfun.2022IMP0004
JO - IEICE TRANSACTIONS on Fundamentals
SN - 1745-1337
VL - E106-A
IS - 9
JA - IEICE TRANSACTIONS on Fundamentals
Y1 - September 2023
AB - High-performance deep learning-based object detection models can reduce traffic accidents using dashcam images during nighttime driving. Deep learning requires a large-scale dataset to obtain a high-performance model. However, existing object detection datasets are mostly daytime scenes and a few nighttime scenes. Increasing the nighttime dataset is laborious and time-consuming. In such a case, it is possible to convert daytime images to nighttime images by image-to-image translation model to augment the nighttime dataset with less effort so that the translated dataset can utilize the annotations of the daytime dataset. Therefore, in this study, a GAN-based image-to-image translation model is proposed by incorporating self-attention with cycle consistency and content/style separation for nighttime data augmentation that shows high fidelity to annotations of the daytime dataset. Experimental results highlight the effectiveness of the proposed model compared with other models in terms of translated images and FID scores. Moreover, the high fidelity of translated images to the annotations is verified by a small object detection model according to detection results and mAP. Ablation studies confirm the effectiveness of self-attention in the proposed model. As a contribution to GAN-based data augmentation, the source code of the proposed image translation model is publicly available at https://github.com/subecky/Image-Translation-With-Self-Attention
ER -

IEICE TRANSACTIONS on Fundamentals