The search functionality is under construction.
The search functionality is under construction.

Keyword Search Result

[Keyword] Si(16348hit)

21-40hit(16348hit)

  • Throughput Maximization-Based AP Clustering Methods in Downlink Cell-Free MIMO Under Partial CSI Condition Open Access

    Daisuke ISHII  Takanori HARA  Kenichi HIGUCHI  

     
    PAPER-Wireless Communication Technologies

      Vol:
    E107-B No:10
      Page(s):
    653-660

    In this paper, we investigate a method for clustering user equipment (UE)-specific transmission access points (APs) in downlink cell-free multiple-input multiple-output (MIMO) assuming that the APs distributed over the system coverage know only part of the instantaneous channel state information (CSI). As a beamforming (BF) method based on partial CSI, we use a layered partially non-orthogonal zero-forcing (ZF) method based on channel matrix muting, which is applicable to the case where different transmitting AP groups are selected for each UE under partial CSI conditions. We propose two AP clustering methods. Both proposed methods first tentatively determine the transmitting APs independently for each UE and then iteratively update the transmitting APs for each UE based on the estimated throughput considering the interference among the UEs. One of the two proposed methods introduces a UE cluster for each UE into the iterative updates of the transmitting APs to balance throughput performance and scalability. Computer simulations show that the proposed methods achieve higher geometric-mean and worst user throughput than those for the conventional methods.

  • Global Navigation Satellite System Signal Phase Combining and Performance of Distributed Antenna Arrays Open Access

    Wenfei GUO  Jun ZHANG  Chi GUO  Weijun FENG  

     
    PAPER-Navigation, Guidance and Control Systems

      Vol:
    E107-B No:10
      Page(s):
    670-680

    Low signal power and susceptibility to interference cause difficulties for traditional global navigation satellite system (GNSS) receivers in tracking weak signals. Extending coherent integration time is a common approach for enhancing signal gain. However, coherent integration time cannot be indefinitely increased owing to navigation bit sign transition, receiver dynamics, and clock noises. This study proposes a cross-correlation phase combining (CPC) algorithm suitable for distributed multi-antenna receivers to improve carrier-tracking performance in weak GNSS signal conditions. This algorithm cross-correlates each antenna’s intermediate frequency (IF) signal and local carrier to detect the phase differences. Subsequently, the IF signals are weighted to achieve phase alignment and coherently combined. The carrier-to-noise ratio (CNR) and carrier phase equation of the combined signal were derived for the CPC algorithm. Global positioning system (GPS) signals received by distributed antenna array with six elements were used to validate the performance of the algorithm. The results demonstrated that the CPC algorithm could effectively achieve signal phase alignment at 32 dB-Hz, resulting in a combined-signal CNR enhancement of 6 dB. The phase-tracking error variance was reduced by 72% at 30 dB-Hz compared with that of a single-antenna signal. The algorithm exhibited low phased array calibration requirements, overcoming the limitations associated with coherent integration time and effectively enhancing tracking performance in weak-signal environments.

  • Review: Noncontact Sensing of Animals Using Radar Open Access

    Takuya SAKAMOTO  Itsuki IWATA  Toshiki MINAMI  Takuya MATSUMOTO  

     
    INVITED PAPER

      Pubricized:
    2024/04/09
      Vol:
    E107-C No:10
      Page(s):
    279-287

    There has been a growing interest in the application of radar technology to the monitoring of humans and animals and their positions, motions, activities, and vital signs. Radar can be used, for example, to remotely measure vital signs such as respiration and heartbeat without contact. Radar-based human sensing is expected to be adopted in a variety of fields, such as medicine, healthcare, and entertainment, but what can be realized by radar-based animal sensing? This paper reviews the latest research trends in the noncontact sensing of animals using radar systems. We also present examples of our past radar experiments for the respiratory measurement of monkeys and the heartbeat measurement of chimpanzees. The trends in this field are reviewed in terms of the target animal species, type of vital sign, and radar type and selection of frequencies.

  • Japanese Institutionalization and Global Standardization of Wireless Power Transmission, and Recently R&D Trend in Japan Open Access

    Takuya FUJIMOTO  

     
    INVITED PAPER

      Pubricized:
    2024/04/23
      Vol:
    E107-C No:10
      Page(s):
    299-306

    In Japan, research on spatial transmission Wireless Power Transfer/Transmission (WPT) for long-distance power transmission has been conducted ahead of the rest of the world; however, until 2022, there has been no category under the Radio Law, and it has been treated as an experimental station. The authors are working on Japanese institutionalization (revision of ministerial ordinances) and global standardization of this spatial transmission WPT for social implementation. This paper describes the Japanese and international institutionalization and standardization trends. In addition, as the latest trend in R&D trends, as the next step of institutionalization, the author introduces two national projects that are being worked on by industry, academia, and government for Step 2, which can be used for a wider range of applications by relaxing the scope of use and restrictions from Step 1, which has various restrictions. The first is about the Cross-ministerial Strategic Innovation Promotion Program (SIP) Phase 2. In SIP Phase 2, we conducted R&D on “WPT system for sensor networks and mobile devices”. This R&D is research on detecting and avoiding people so that radio exposure does not exceed protection guidelines and detecting incumbent radios and avoiding harmful interference so that more power can be transmitted under coexistence conditions. The other is “Research and Development for Expansion of Radio Resources” to be conducted by the Ministry of Internal Affairs and Communications (MIC), which is scheduled for four years from FY2022. This is also a more concrete research and development project for Step 2 institutionalization, along with the results of the SIP mentioned above.

  • Comprehensive Design Approach to Switch-Mode Resonant Power Amplifiers Exploiting Geodesic-to-Geodesic Impedance Conversion Open Access

    Minoru MIZUTANI  Takashi OHIRA  

     
    PAPER

      Pubricized:
    2024/04/09
      Vol:
    E107-C No:10
      Page(s):
    307-314

    This paper presents a comprehensive design approach to load-independent radio frequency (RF) power amplifiers. We project the zero-voltage-switching (ZVS) and zero-voltage-derivative-switching (ZVDS) load impedances onto a Smith chart, and find that their loci exhibit geodesic arcs. We exploit a two-port reactive network to convert the geodesic locus into another geodesic. This is named geodesic-to-geodesic (G2G) impedance conversion, and the power amplifier that employs G2G conversion is called class-G2G amplifier. We comprehensively explore the possible circuit topologies, and find that there are twenty G2G networks to create class-G2G amplifiers. We also find out that the class-G2G amplifier behaves like a transformer or a gyrator converting from dc to RF. The G2G design theory is verified via a circuit simulation. We also verified the theory through an experiment employing a prototype 100 W amplifier at 6.78 MHz. We conclude that the presented design approach is quite comprehensive and useful for the future development of high-efficiency RF power amplifiers.

  • Load-Independent Class-E Design with Load Adjustment Circuit Inverter Considering External Quality Factor Open Access

    Akihiko ISHIWATA  Yasumasa NAKA  Masaya TAMURA  

     
    PAPER

      Pubricized:
    2024/04/09
      Vol:
    E107-C No:10
      Page(s):
    315-322

    The load-independent zero-voltage switching class-E inverter has garnered considerable interest as an essential component in wireless power transfer systems. This inverter achieves high efficiency across a broad spectrum of load conditions by incorporating a load adjustment circuit (LAC) subsequent to the resonant filter. Nevertheless, the presence of the LAC influences the output impedance of the inverter, thereby inducing a divergence between the targeted and observed output power, even in ideal lossless simulations. Consequently, iterative adjustments to component values are required via an LC element implementation. We introduce a novel design methodology that incorporates an external quality factor on the side of the resonant filter, inclusive of the LAC. Thus, the optimized circuit achieves the intended output power without necessitating alterations in component values.

  • Efficiency Enhancement of a Single-Diode Rectenna Using Harmonic Control of the Antenna Impedance Open Access

    Katsumi KAWAI  Naoki SHINOHARA  Tomohiko MITANI  

     
    PAPER

      Pubricized:
    2024/04/09
      Vol:
    E107-C No:10
      Page(s):
    323-331

    This study introduces a novel single-diode rectenna, enhancing the rf-dc conversion efficiency using harmonic control of the antenna impedance. We employ source-pull simulations encompassing the fundamental frequency and the harmonics to achieve a highly efficient rectenna. The results of the source-pull simulations delineate the source-impedance ranges required for enhanced efficiency at each harmonic. Based on the source-pull simulation results, we designed two inverted-F antenna with input impedances within and without these identified source impedance ranges. Experimental results show that the proposed rectenna has a maximum rf-dc conversion efficiency of 75.9% at the fundamental frequency of 920 MHz, an input power of 10.8 dBm, and a load resistance of 1 kΩ, which is higher than that of the comparative rectenna without harmonic control of the antenna impedance. This study demonstrates that the proposed rectenna achieves high efficiency through the direct connection of the antenna and the single diode, along with harmonic control of the antenna impedance.

  • Experimental Study on Sub-Terahertz Wideband Single-Carrier Transmitter with Pre-Equalizing Frequency Response Open Access

    Atsushi FUKUDA  Hiroto YAMAMOTO  Junya MATSUDAIRA  Sumire AOKI  Yasunori SUZUKI  

     
    PAPER

      Pubricized:
    2024/04/09
      Vol:
    E107-C No:10
      Page(s):
    332-339

    This paper proposes a novel configuration for a wideband single-carrier transmitter using a sub-terahertz frequency. For wideband single-carrier transmission over a bandwidth of several gigahertz, the frequency response non-flatness derived from transmitter components in an operating band seriously deteriorates the transmission quality due to inter-symbol interference. A promising approach to address this problem is equalizing the frequency response non-flatness at the transmitter. The proposed novel configuration has a feedback route for calculating the inverse frequency response and multiplying it with a transmission signal spectrum in the frequency domain. Moreover, we verify that employing the proposed transmitter configuration simplifies the receiver configuration by lowering the calculation complexity to minimize the inter-symbol interference to meet the signal-to-interference-and-noise ratio requirements. To confirm the feasibility of the proposed configuration, the transmission quality obtained using the proposed configuration is measured and evaluated. Experimental results confirm that the proposed configuration improves the error vector magnitude value to over 5 dB for a 10 Gbaud transmission and the transmission data rate of 25 Gbps.

  • Hybrid Precoding for mmWave Massive Beamspace MIMO System with Limited Resolution Overlapped Phase Shifters Network Open Access

    Ting DING  Jiandong ZHU  Jing YANG  Xingmeng JIANG  Chengcheng LIU  

     
    PAPER

      Pubricized:
    2024/03/25
      Vol:
    E107-C No:10
      Page(s):
    355-363

    Considering the non-convexity of hybrid precoding and the hardware constraints of practical systems, a hybrid precoding architecture, which combines limited-resolution overlapped phase shifter networks with lens array, is investigated. The analogy part is a beam selection network composed of overlapped low-resolution phase shifter networks. In particular, in the proposed hybrid precoding algorithm, the analog precoding improves array gain by utilizing the quantization beam alignment method, whereas the digital precoding schemes multiplexing gain by adopting a Wiener Filter precoding scheme with a minimum mean square error criterion. Finally, in the sparse scattering millimeter-wave channel for the uniform linear array, the proposed method is compared with the existing scheme by computer simulation by using the ideal channel state information and the non-ideal channel state information. It is concluded that the proposed scheme performs better in low signal-to-noise regions and can achieve a good compromise between system performance and hardware complexity.

  • Chaos and Synchronization - Potential Ingredients of Innovation in Analog Circuit Design? Open Access

    Ludovico MINATI  

     
    INVITED PAPER

      Pubricized:
    2024/03/11
      Vol:
    E107-C No:10
      Page(s):
    376-391

    Recent years have seen a general resurgence of interest in analog signal processing and computing architectures. In addition, extensive theoretical and experimental literature on chaos and analog chaotic oscillators exists. One peculiarity of these circuits is the ability to generate, despite their structural simplicity, complex spatiotemporal patterns when several of them are brought towards synchronization via coupling mechanisms. While by no means a systematic survey, this paper provides a personal perspective on this area. After briefly covering design aspects and the synchronization phenomena that can arise, a selection of results exemplifying potential applications is presented, including in robot control, distributed sensing, reservoir computing, and data augmentation. Despite their interesting properties, the industrial applications of these circuits remain largely to be realized, seemingly due to a variety of technical and organizational factors including a paucity of design and optimization techniques. Some reflections are given regarding this situation, the potential relevance to discontinuous innovation in analog circuit design of chaotic oscillators taken both individually and as synchronized networks, and the factors holding back the transition to higher levels of technology readiness.

  • Millimeter-Wave Transceiver Utilizing On-Chip Butler Matrix for Simultaneous 5G Relay Communication and Wireless Power Transfer Open Access

    Keito YUASA  Michihiro IDE  Sena KATO  Kenichi OKADA  Atsushi SHIRANE  

     
    PAPER

      Pubricized:
    2024/04/15
      Vol:
    E107-C No:10
      Page(s):
    408-415

    This paper introduces a wireless-powered relay transceiver designed to extend 5G millimeter-wave coverage. It employs an on-chip butler matrix, enabling beam control-free operation. The prototype includes PCB array antennas and on-chip butler matrix and rectifiers manufactured using a Si CMOS 65 nm process. The relay transceiver performs effectively in beam angles from -45° to 45°. In the 24 GHz wireless power transmission (WPT) mode, it generates 0.12 mW with 0 dBm total input power, boasting an RF-DC conversion efficiency of 12.2%. It also demonstrates communication performance at 28 GHz in both RX and TX modes with a 100 MHz bandwidth and 64QAM modulation.

  • REM-CiM: Attentional RGB-Event Fusion Multi-Modal Analog CiM for Area/Energy-Efficient Edge Object Detection during Both Day and Night Open Access

    Yuya ICHIKAWA  Ayumu YAMADA  Naoko MISAWA  Chihiro MATSUI  Ken TAKEUCHI  

     
    PAPER

      Pubricized:
    2024/04/09
      Vol:
    E107-C No:10
      Page(s):
    426-435

    Integrating RGB and event sensors improves object detection accuracy, especially during the night, due to the high-dynamic range of event camera. However, introducing an event sensor leads to an increase in computational resources, which makes the implementation of RGB-event fusion multi-modal AI to CiM difficult. To tackle this issue, this paper proposes RGB-Event fusion Multi-modal analog Computation-in-Memory (CiM), called REM-CiM, for multi-modal edge object detection AI. In REM-CiM, two proposals about multi-modal AI algorithms and circuit implementation are co-designed. First, Memory capacity-Efficient Attentional Feature Pyramid Network (MEA-FPN), the model architecture for RGB-event fusion analog CiM, is proposed for parameter-efficient RGB-event fusion. Convolution-less bi-directional calibration (C-BDC) in MEA-FPN extracts important features of each modality with attention modules, while reducing the number of weight parameters by removing large convolutional operations from conventional BDC. Proposed MEA-FPN w/ C-BDC achieves a 76% reduction of parameters while maintaining mean Average Precision (mAP) degradation to < 2.3% during both day and night, compared with Attentional FPN fusion (A-FPN), a conventional BDC-adopted FPN fusion. Second, the low-bit quantization with clipping (LQC) is proposed to reduce area/energy. Proposed REM-CiM with MEA-FPN and LQC achieves almost the same memory cells, 21% less ADC area, 24% less ADC energy and 0.17% higher mAP than conventional FPN fusion CiM without LQC.

  • 3D Parallel ReRAM Computation-in-Memory for Hyperdimensional Computing Open Access

    Fuyuki KIHARA  Chihiro MATSUI  Ken TAKEUCHI  

     
    BRIEF PAPER

      Pubricized:
    2024/04/16
      Vol:
    E107-C No:10
      Page(s):
    436-439

    In this work, we propose a 1T1R ReRAM CiM architecture for Hyperdimensional Computing (HDC). The number of Source Lines and Bit Lines is reduced by introducing memory cells that are connected in series, which is especially advantageous when using a 3D implementation. The results of CiM operations contain errors, but HDC is robust against them, so that even if the XNOR operation has an error of 25%, the inference accuracy remains above 90%.

  • MISpeller: Multimodal Information Enhancement for Chinese Spelling Correction Open Access

    Jiakai LI  Jianyong DUAN  Hao WANG  Li HE  Qing ZHANG  

     
    PAPER-Natural Language Processing

      Pubricized:
    2024/06/07
      Vol:
    E107-D No:10
      Page(s):
    1342-1352

    Chinese spelling correction is a foundational task in natural language processing that aims to detect and correct spelling errors in text. Most spelling corrections in Chinese used multimodal information to model the relationship between incorrect and correct characters. However, feature information mismatch occured during fusion result from the different sources of features, causing the importance relationships between different modalities to be ignored, which in turn restricted the model from learning in an efficient manner. To this end, this paper proposes a multimodal language model-based Chinese spelling corrector, named as MISpeller. The method, based on ChineseBERT as the basic model, allows the comprehensive capture and fusion of character semantic information, phonetic information and graphic information in a single model without the need to construct additional neural networks, and realises the phenomenon of unequal fusion of multi-feature information. In addition, in order to solve the overcorrection issues, the replication mechanism is further introduced, and the replication factor is used as the dynamic weight to efficiently fuse the multimodal information. The model is able to control the proportion of original characters and predicted characters according to different input texts, and it can learn more specifically where errors occur. Experiments conducted on the SIGHAN benchmark show that the proposed model achieves the state-of-the-art performance of the F1 score at the correction level by an average of 4.36%, which validates the effectiveness of the model.

  • Dispersion in a Polygon Open Access

    Tetsuya ARAKI  Shin-ichi NAKANO  

     
    PAPER-Algorithms and Data Structures

      Pubricized:
    2024/03/11
      Vol:
    E107-A No:9
      Page(s):
    1458-1464

    The dispersion problem is a variant of facility location problems, that has been extensively studied. Given a polygon with n edges on a plane we want to find k points in the polygon so that the minimum pairwise Euclidean distance of the k points is maximized. We call the problem the k-dispersion problem in a polygon. Intuitively, for an island, we want to locate k drone bases far away from each other in flying distance to avoid congestion in the sky. In this paper, we give a polynomial-time approximation scheme (PTAS) for this problem when k is a constant and ε < 1 (where ε is a positive real number). Our proposed algorithm runs in O(((1/ε)2 + n/ε)k) time with 1/(1 + ε) approximation, the first PTAS developed for this problem. Additionally, we consider three variations of the dispersion problem and design a PTAS for each of them.

  • Outsider-Anonymous Broadcast Encryption with Keyword Search: Generic Construction, CCA Security, and with Sublinear Ciphertexts Open Access

    Keita EMURA  Kaisei KAJITA  Go OHTAKE  

     
    PAPER-Cryptography and Information Security

      Pubricized:
    2024/02/26
      Vol:
    E107-A No:9
      Page(s):
    1465-1477

    As a multi-receiver variant of public key encryption with keyword search (PEKS), broadcast encryption with keyword search (BEKS) has been proposed (Attrapadung et al. at ASIACRYPT 2006/Chatterjee-Mukherjee at INDOCRYPT 2018). Unlike broadcast encryption, no receiver anonymity is considered because the test algorithm takes a set of receivers as input and thus a set of receivers needs to be contained in a ciphertext. In this paper, we propose a generic construction of BEKS from anonymous and weakly robust 3-level hierarchical identity-based encryption (HIBE). The proposed generic construction provides outsider anonymity, where an adversary is allowed to obtain secret keys of outsiders who do not belong to the challenge sets, and provides sublinear-size ciphertext in terms of the number of receivers. Moreover, the proposed construction considers security against chosen-ciphertext attack (CCA) where an adversary is allowed to access a test oracle in the searchable encryption context. The proposed generic construction can be seen as an extension to the Fazio-Perera generic construction of anonymous broadcast encryption (PKC 2012) from anonymous and weakly robust identity-based encryption (IBE) and the Boneh et al. generic construction of PEKS (EUROCRYPT 2004) from anonymous IBE. We run the Fazio-Perera construction employs on the first-level identity and run the Boneh et al. generic construction on the second-level identity, i.e., a keyword is regarded as a second-level identity. The third-level identity is used for providing CCA security by employing one-time signatures. We also introduce weak robustness in the HIBE setting, and demonstrate that the Abdalla et al. generic transformation (TCC 2010/JoC 2018) for providing weak robustness to IBE works for HIBE with an appropriate parameter setting. We also explicitly introduce attractive concrete instantiations of the proposed generic construction from pairings and lattices, respectively.

  • Quantum Collision Resistance of Double-Block-Length Hashing Open Access

    Shoichi HIROSE  Hidenori KUWAKADO  

     
    PAPER-Cryptography and Information Security

      Pubricized:
    2024/03/04
      Vol:
    E107-A No:9
      Page(s):
    1478-1487

    In 2005, Nandi introduced a class of double-block-length compression functions hπ(x) := (h(x), h(π(x))), where h is a random oracle with an n-bit output and π is a non-cryptographic public permutation. Nandi demonstrated that the collision resistance of hπ is optimal if π has no fixed point in the classical setting. Our study explores the collision resistance of hπ and the Merkle-Damgård hash function using hπ in the quantum random oracle model. Firstly, we reveal that the quantum collision resistance of hπ may not be optimal even if π has no fixed point. If π is an involution, then a colliding pair of inputs can be found for hπ with only O(2n/2) queries by the Grover search. Secondly, we present a sufficient condition on π for the optimal quantum collision resistance of hπ. This condition states that any collision attack needs Ω(22n/3) queries to find a colliding pair of inputs. The proof uses the recent technique of Zhandry’s compressed oracle. Thirdly, we show that the quantum collision resistance of the Merkle-Damgård hash function using hπ can be optimal even if π is an involution. Finally, we discuss the quantum collision resistance of double-block-length compression functions using a block cipher.

  • Color Correction Method Considering Hue Information for Dichromats Open Access

    Shi BAO  Xiaoyan SONG  Xufei ZHUANG  Min LU  Gao LE  

     
    PAPER-Image

      Pubricized:
    2024/04/22
      Vol:
    E107-A No:9
      Page(s):
    1496-1508

    Images with rich color information are an important source of information that people obtain from the objective world. Occasionally, it is difficult for people with red-green color vision deficiencies to obtain color information from color images. We propose a method of color correction for dichromats based on the physiological characteristics of dichromats, considering hue information. First, the hue loss of color pairs under normal color vision was defined, an objective function was constructed on its basis, and the resultant image was obtained by minimizing it. Finally, the effectiveness of the proposed method is verified through comparison tests. Red-green color vision deficient people fail to distinguish between partial red and green colors. When the red and green connecting lines are parallel to the a* axis of CIE L*a*b*, red and green perception defectives cannot distinguish the color pair, but can distinguish the color pair parallel to the b* axis. Therefore, when two colors are parallel to the a* axis, their color correction yields good results. When color correction is performed on a color, the hue loss between the two colors under normal color vision is supplemented with b* so that red-green color vision-deficient individuals can distinguish the color difference between the color pairs. The magnitude of the correction is greatest when the connecting lines of the color pairs are parallel to the a* axis, and no color correction is applied when the connecting lines are parallel to the b* axis. The objective evaluation results show that the method achieves a higher score, indicating that the proposed method can maintain the naturalness of the image while reducing confusing colors.

  • Deep Learning-Inspired Automatic Minutiae Extraction from Semi-Automated Annotations Open Access

    Hongtian ZHAO  Hua YANG  Shibao ZHENG  

     
    PAPER-Vision

      Pubricized:
    2024/04/05
      Vol:
    E107-A No:9
      Page(s):
    1509-1521

    Minutiae pattern extraction plays a crucial role in fingerprint registration and identification for electronic applications. However, the extraction accuracy is seriously compromised by the presence of contaminated ridge lines and complex background scenarios. General image processing-based methods, which rely on many prior hypotheses, fail to effectively handle minutiae extraction in complex scenarios. Previous works have shown that CNN-based methods can perform well in object detection tasks. However, the deep neural networks (DNNs)-based methods are restricted by the limitation of public labeled datasets due to legitimate privacy concerns. To address these challenges comprehensively, this paper presents a fully automated minutiae extraction method leveraging DNNs. Firstly, we create a fingerprint minutiae dataset using a semi-automated minutiae annotation algorithm. Subsequently, we propose a minutiae extraction model based on Residual Networks (Resnet) that enables end-to-end prediction of minutiae. Moreover, we introduce a novel non-maximal suppression (NMS) procedure, guided by the Generalized Intersection over Union (GIoU) metric, during the inference phase to effectively handle outliers. Experimental evaluations conducted on the NIST SD4 and FVC 2004 databases demonstrate the superiority of the proposed method over existing state-of-the-art minutiae extraction approaches.

  • DETrack: Multi-Object Tracking Algorithm Based on Feature Decomposition and Feature Enhancement Open Access

    Feng WEN  Haixin HUANG  Xiangyang YIN  Junguang MA  Xiaojie HU  

     
    PAPER-Neural Networks and Bioengineering

      Pubricized:
    2024/04/22
      Vol:
    E107-A No:9
      Page(s):
    1522-1533

    Multi-object tracking (MOT) algorithms are typically classified as one-shot or two-step algorithms. The one-shot MOT algorithm is widely studied and applied due to its fast inference speed. However, one-shot algorithms include two sub-tasks of detection and re-ID, which have conflicting directions for model optimization, thus limiting tracking performance. Additionally, MOT algorithms often suffer from serious ID switching issues, which can negatively affect the tracking effect. To address these challenges, this study proposes the DETrack algorithm, which consists of feature decomposition and feature enhancement modules. The feature decomposition module can effectively exploit the differences and correlations of different tasks to solve the conflict problem. Moreover, it can effectively mitigate the competition between the detection and re-ID tasks, while simultaneously enhancing their cooperation. The feature enhancement module can improve feature quality and alleviate the problem of target ID switching. Experimental results demonstrate that DETrack has achieved improvements in multi-object tracking performance, while reducing the number of ID switching. The designed method of feature decomposition and feature enhancement can significantly enhance target tracking effectiveness.

21-40hit(16348hit)