The search functionality is under construction.

IEICE TRANSACTIONS on Fundamentals

CCTSS: The Combination of CNN and Transformer with Shared Sublayer for Detection and Classification

Aorui GOU, Jingjing LIU, Xiaoxiang CHEN, Xiaoyang ZENG, Yibo FAN

  • Full Text Views

    0

  • Cite this

Summary :

Convolutional Neural Networks (CNNs) and Transformers have achieved remarkable performance in detection and classification tasks. Nevertheless, their feature extraction cannot consider both local and global information, so the detection and classification performance can be further improved. In addition, more and more deep learning networks are designed as more and more complex, and the amount of computation and storage space required is also significantly increased. This paper proposes a combination of CNN and transformer, and designs a local feature enhancement module and global context modeling module to enhance the cascade network. While the local feature enhancement module increases the range of feature extraction, the global context modeling is used to capture the feature maps' global information. To decrease the model complexity, a shared sublayer is designed to realize the sharing of weight parameters between the adjacent convolutional layers or cross convolutional layers, thereby reducing the number of convolutional weight parameters. Moreover, to effectively improve the detection performance of neural networks without increasing network parameters, the optimal transport assignment approach is proposed to resolve the problem of label assignment. The classification loss and regression loss are the summations of the cost between the demander and supplier. The experiment results demonstrate that the proposed Combination of CNN and Transformer with Shared Sublayer (CCTSS) performs better than the state-of-the-art methods in various datasets and applications.

Publication
IEICE TRANSACTIONS on Fundamentals Vol.E107-A No.1 pp.141-156
Publication Date
2024/01/01
Publicized
2023/07/06
Online ISSN
1745-1337
DOI
10.1587/transfun.2022EAP1157
Type of Manuscript
PAPER
Category
Image

Authors

Aorui GOU
  Fudan University
Jingjing LIU
  Fudan University,Shanghai University
Xiaoxiang CHEN
  Fudan University
Xiaoyang ZENG
  Fudan University
Yibo FAN
  Fudan University

Keyword