The search functionality is under construction.

IEICE TRANSACTIONS on Information

Learning Local Similarity with Spatial Interrelations on Content-Based Image Retrieval

Longjiao ZHAO, Yu WANG, Jien KATO, Yoshiharu ISHIKAWA

  • Full Text Views

    3

  • Cite this

Summary :

Convolutional Neural Networks (CNNs) have recently demonstrated outstanding performance in image retrieval tasks. Local convolutional features extracted by CNNs, in particular, show exceptional capability in discrimination. Recent research in this field has concentrated on pooling methods that incorporate local features into global features and assess the global similarity of two images. However, the pooling methods sacrifice the image's local region information and spatial relationships, which are precisely known as the keys to the robustness against occlusion and viewpoint changes. In this paper, instead of pooling methods, we propose an alternative method based on local similarity, determined by directly using local convolutional features. Specifically, we first define three forms of local similarity tensors (LSTs), which take into account information about local regions as well as spatial relationships between them. We then construct a similarity CNN model (SCNN) based on LSTs to assess the similarity between the query and gallery images. The ideal configuration of our method is sought through thorough experiments from three perspectives: local region size, local region content, and spatial relationships between local regions. The experimental results on a modified open dataset (where query images are limited to occluded ones) confirm that the proposed method outperforms the pooling methods because of robustness enhancement. Furthermore, testing on three public retrieval datasets shows that combining LSTs with conventional pooling methods achieves the best results.

Publication
IEICE TRANSACTIONS on Information Vol.E106-D No.5 pp.1069-1080
Publication Date
2023/05/01
Publicized
2023/02/14
Online ISSN
1745-1361
DOI
10.1587/transinf.2022EDP7163
Type of Manuscript
PAPER
Category
Image Processing and Video Processing

Authors

Longjiao ZHAO
  Nagoya University
Yu WANG
  Hitotsubashi University
Jien KATO
  Ritsumeikan University
Yoshiharu ISHIKAWA
  Nagoya University

Keyword