Gender Recognition Using a Gaze-Guided Self-Attention Mechanism Robust Against Background Bias in Training Samples

Masashi NISHIYAMA; Michiko INOUE; Yoshio IWAI

doi:10.1587/transinf.2021EDP7117

IEICE TRANSACTIONS on Information

Gender Recognition Using a Gaze-Guided Self-Attention Mechanism Robust Against Background Bias in Training Samples

Masashi NISHIYAMA, Michiko INOUE, Yoshio IWAI

Full Text Views

0

Cite this

Summary :

We propose an attention mechanism in deep learning networks for gender recognition using the gaze distribution of human observers when they judge the gender of people in pedestrian images. Prevalent attention mechanisms spatially compute the correlation among values of all cells in an input feature map to calculate attention weights. If a large bias in the background of pedestrian images (e.g., test samples and training samples containing different backgrounds) is present, the attention weights learned using the prevalent attention mechanisms are affected by the bias, which in turn reduces the accuracy of gender recognition. To avoid this problem, we incorporate an attention mechanism called gaze-guided self-attention (GSA) that is inspired by human visual attention. Our method assigns spatially suitable attention weights to each input feature map using the gaze distribution of human observers. In particular, GSA yields promising results even when using training samples with the background bias. The results of experiments on publicly available datasets confirm that our GSA, using the gaze distribution, is more accurate in gender recognition than currently available attention-based methods in the case of background bias between training and test samples.

Publication: IEICE TRANSACTIONS on Information Vol.E105-D No.2 pp.415-426

Publication Date: 2022/02/01

Publicized: 2021/11/18

Online ISSN: 1745-1361

DOI: 10.1587/transinf.2021EDP7117

Type of Manuscript: PAPER

Category: Image Recognition, Computer Vision

Authors

Masashi NISHIYAMA
  Tottori University
Michiko INOUE
  Tottori University
Yoshio IWAI
  Tottori University

Keyword

gaze distribution, attention mechanism, convolutional neural network, gender recognition, self-attention

Cite this

Copy

Masashi NISHIYAMA, Michiko INOUE, Yoshio IWAI, "Gender Recognition Using a Gaze-Guided Self-Attention Mechanism Robust Against Background Bias in Training Samples" in IEICE TRANSACTIONS on Information, vol. E105-D, no. 2, pp. 415-426, February 2022, doi: 10.1587/transinf.2021EDP7117.
Abstract: We propose an attention mechanism in deep learning networks for gender recognition using the gaze distribution of human observers when they judge the gender of people in pedestrian images. Prevalent attention mechanisms spatially compute the correlation among values of all cells in an input feature map to calculate attention weights. If a large bias in the background of pedestrian images (e.g., test samples and training samples containing different backgrounds) is present, the attention weights learned using the prevalent attention mechanisms are affected by the bias, which in turn reduces the accuracy of gender recognition. To avoid this problem, we incorporate an attention mechanism called gaze-guided self-attention (GSA) that is inspired by human visual attention. Our method assigns spatially suitable attention weights to each input feature map using the gaze distribution of human observers. In particular, GSA yields promising results even when using training samples with the background bias. The results of experiments on publicly available datasets confirm that our GSA, using the gaze distribution, is more accurate in gender recognition than currently available attention-based methods in the case of background bias between training and test samples.
URL: https://global.ieice.org/en_transactions/information/10.1587/transinf.2021EDP7117/_p

Copy

@ARTICLE{e105-d_2_415,
author={Masashi NISHIYAMA, Michiko INOUE, Yoshio IWAI, },
journal={IEICE TRANSACTIONS on Information},
title={Gender Recognition Using a Gaze-Guided Self-Attention Mechanism Robust Against Background Bias in Training Samples},
year={2022},
volume={E105-D},
number={2},
pages={415-426},
abstract={We propose an attention mechanism in deep learning networks for gender recognition using the gaze distribution of human observers when they judge the gender of people in pedestrian images. Prevalent attention mechanisms spatially compute the correlation among values of all cells in an input feature map to calculate attention weights. If a large bias in the background of pedestrian images (e.g., test samples and training samples containing different backgrounds) is present, the attention weights learned using the prevalent attention mechanisms are affected by the bias, which in turn reduces the accuracy of gender recognition. To avoid this problem, we incorporate an attention mechanism called gaze-guided self-attention (GSA) that is inspired by human visual attention. Our method assigns spatially suitable attention weights to each input feature map using the gaze distribution of human observers. In particular, GSA yields promising results even when using training samples with the background bias. The results of experiments on publicly available datasets confirm that our GSA, using the gaze distribution, is more accurate in gender recognition than currently available attention-based methods in the case of background bias between training and test samples.},
keywords={},
doi={10.1587/transinf.2021EDP7117},
ISSN={1745-1361},
month={February},}

Copy

TY - JOUR
TI - Gender Recognition Using a Gaze-Guided Self-Attention Mechanism Robust Against Background Bias in Training Samples
T2 - IEICE TRANSACTIONS on Information
SP - 415
EP - 426
AU - Masashi NISHIYAMA
AU - Michiko INOUE
AU - Yoshio IWAI
PY - 2022
DO - 10.1587/transinf.2021EDP7117
JO - IEICE TRANSACTIONS on Information
SN - 1745-1361
VL - E105-D
IS - 2
JA - IEICE TRANSACTIONS on Information
Y1 - February 2022
AB - We propose an attention mechanism in deep learning networks for gender recognition using the gaze distribution of human observers when they judge the gender of people in pedestrian images. Prevalent attention mechanisms spatially compute the correlation among values of all cells in an input feature map to calculate attention weights. If a large bias in the background of pedestrian images (e.g., test samples and training samples containing different backgrounds) is present, the attention weights learned using the prevalent attention mechanisms are affected by the bias, which in turn reduces the accuracy of gender recognition. To avoid this problem, we incorporate an attention mechanism called gaze-guided self-attention (GSA) that is inspired by human visual attention. Our method assigns spatially suitable attention weights to each input feature map using the gaze distribution of human observers. In particular, GSA yields promising results even when using training samples with the background bias. The results of experiments on publicly available datasets confirm that our GSA, using the gaze distribution, is more accurate in gender recognition than currently available attention-based methods in the case of background bias between training and test samples.
ER -

IEICE TRANSACTIONS on Information