The search functionality is under construction.

IEICE TRANSACTIONS on Information

Siamese Transformer for Saliency Prediction Based on Multi-Prior Enhancement and Cross-Modal Attention Collaboration

Fazhan YANG, Xingge GUO, Song LIANG, Peipei ZHAO, Shanhua LI

  • Full Text Views

    0

  • Cite this

Summary :

Visual saliency prediction has improved dramatically since the advent of convolutional neural networks (CNN). Although CNN achieves excellent performance, it still cannot learn global and long-range contextual information well and lacks interpretability due to the locality of convolution operations. We proposed a saliency prediction model based on multi-prior enhancement and cross-modal attention collaboration (ME-CAS). Concretely, we designed a transformer-based Siamese network architecture as the backbone for feature extraction. One of the transformer branches captures the context information of the image under the self-attention mechanism to obtain a global saliency map. At the same time, we build a prior learning module to learn the human visual center bias prior, contrast prior, and frequency prior. The multi-prior input to another Siamese branch to learn the detailed features of the underlying visual features and obtain the saliency map of local information. Finally, we use an attention calibration module to guide the cross-modal collaborative learning of global and local information and generate the final saliency map. Extensive experimental results demonstrate that our proposed ME-CAS achieves superior results on public benchmarks and competitors of saliency prediction models. Moreover, the multi-prior learning modules enhance images express salient details, and model interpretability.

Publication
IEICE TRANSACTIONS on Information Vol.E106-D No.9 pp.1572-1583
Publication Date
2023/09/01
Publicized
2023/06/20
Online ISSN
1745-1361
DOI
10.1587/transinf.2022EDP7220
Type of Manuscript
PAPER
Category
Image Recognition, Computer Vision

Authors

Fazhan YANG
  China University of Mining and Technology
Xingge GUO
  China University of Mining and Technology
Song LIANG
  China University of Mining and Technology
Peipei ZHAO
  China University of Mining and Technology
Shanhua LI
  China University of Mining and Technology

Keyword