IEICE TRANSACTIONS on Information

Impact Factor

0.59
Eigenfactor

0.002
article influence

0.1
Cite Score

1.4

To the Advance publication
To the Archives

Advance publication (published online immediately after acceptance)

Lightweight Neural Data Sequence Modeling by Scale Causal Blocks
Hiroaki AKUTSU Ko ARAI

Pubricized:
2024/11/08
PAPER
- Summary
- Free PDF (1.6MB)
Recaptured Image Detection Based on Multi-Scale Residual Features of Discriminative Regions
Lanxi LIU Pengpeng YANG Suwen DU Sani M. ABDULLAHI

Pubricized:
2024/11/08
PAPER
- Summary
- Free PDF (5.9MB)
Learn Discriminative Features for Small Object Detection through Multi-scale Image Degradation with Contrastive Learning
Xiaoguang TU Zhi HE Gui FU Jianhua LIU Mian ZHONG Chao ZHOU Xia LEI Juhang YIN Yi HUANG Yu WANG

Pubricized:
2024/11/05
PAPER
- Summary
- Free PDF (1.4MB)
Joint Distribution-Aligned Dual-Sparse Linear Regression for Cross-Stimulus Speech-Based Depression Detection
Yingying LU Cheng LU Yuan ZONG Feng ZHOU Chuangao TANG

Pubricized:
2024/11/01
LETTER
- Summary
- Free PDF (229.4KB)
Multi-grained Guaranteeable Requirement Analysis for Iterative Adaptation
Jialong LI Takuto YAMAUCHI Takanori HIRANO Jinyu CAI Kenji TEI

Pubricized:
2024/10/31
PAPER
- Summary
- Free PDF (567.6KB)
A fully digital transmitting-receiving platform for MIMO radar waveform diversity experiment
Wei LEI Yue ZHANG Hanfeng XIE Zebin CHEN Zengping CHEN Weixing LI

Pubricized:
2024/10/30
PAPER
- Summary
- Free PDF (7MB)
Leveraging Different Boolean Function Decompositions to Reduce T-Count in LUT-based Quantum Circuit Synthesis
David CLARINO Naoya ASADA Atsushi MATSUO Shigeru YAMASHITA

Pubricized:
2024/10/30
PAPER
- Summary
- Free PDF (1.3MB)
Criticality and Tolerance in Injection Timing in Cup-Stacking Method for Collective Communication
Takashi YOKOTA Kanemitsu OOTSU

Pubricized:
2024/10/28
PAPER
- Summary
- Free PDF (1.5MB)
An anchor-free Siamese tracker with multi-attention and corner detection mechanism
Xiaokang Jin Benben Huang Hao Sheng Yao Wu

Pubricized:
2024/10/28
PAPER
- Summary
- Free PDF (3MB)
Effect of Politeness on Trust in Re-enter Requests to User by Smart Speaker -Pilot Study-
Tomoki MIYAMOTO

Pubricized:
2024/10/23
LETTER
- Summary
- Free PDF (2.2MB)
Fine-tuning Models for Final Disagreement Anticipation in Negotiation Mid-Dialogues
Ken WATANABE Katsuhide FUJITA

Pubricized:
2024/10/10
PAPER
- Summary
- Free PDF (3.8MB)
Deepfake speech detection: approaches from acoustic features related to auditory perception to deep neural networks
Masashi UNOKI Kai LI Anuwat CHAIWONGYEN Quoc-Huy NGUYEN Khalid ZAMAN

Pubricized:
2024/10/07
INVITED PAPER
- Summary
- Free PDF (965KB)
Video Watermarking Method Based on 3D U-Net Robust Against Re-shooting
Takaharu TSUBOYAMA Ryota TAKAHASHI Motoi IWATA Koichi KISE

Pubricized:
2024/10/07
PAPER
- Summary
- Free PDF (4MB)
UTStyleCap4K: Generating Image Captions with Sentimental Styles
Chi ZHANG Li TAO Toshihiko YAMASAKI

Pubricized:
2024/10/02
PAPER
- Summary
- Free PDF (2.5MB)
FP-GNN: A Graph Neural Network for Hardware Trojan Detection in Gate-Level Netlist
Ann Jelyn TIEMPO Yong-Jin JEONG

Pubricized:
2024/10/01
LETTER
- Summary
- Free PDF (532.3KB)
Adaptive Merge Candidate Selection based on Geometric Partitioning Mode beyond Versatile Video Coding
Haruhisa KATO Yoshitaka KIDANI Kei KAWAMURA

Pubricized:
2024/09/24
PAPER
- Summary
- Free PDF (4MB)
A Multi-Agent Deep Reinforcement Learning Algorithm for Task offloading in future 6G V2X Network
Jiakun LI Jiajian LI Yanjun SHI Hui LIAN Haifan WU

Pubricized:
2024/09/24
PAPER
- Summary
- Free PDF (1.2MB)
Dalio: In-Kernel Centralized Replication for Key-Value Stores
Gyuyeong KIM

Pubricized:
2024/09/20
LETTER
- Summary
- Free PDF (138.7KB)
Detecting Textual Backdoor Attacks via Class Difference for Text Classification System
Hyun KWON Jun LEE

Pubricized:
2024/09/19
PAPER
- Summary
- Free PDF (680.5KB)
D2PT: Density to Point Transformer with Knowledge Distillation for Crowd Counting and Localization
Fan LI Enze YANG Chao LI Shuoyan LIU Haodong WANG

Pubricized:
2024/09/17
LETTER
- Summary
- Free PDF (1.8MB)
Incremental learning for network traffic classification using generative adversarial networks
Guangjin Ouyang Yong Guo Yu Lu Fang He

Pubricized:
2024/09/13
PAPER
- Summary
- Free PDF (1.3MB)
Multi-Scale Rail Surface Anomaly Detection Based on Weighted Multivariate Gaussian Distribution
Yuyao LIU Qingyong LI Shi BAO Wen WANG

Pubricized:
2024/09/12
PAPER
- Summary
- Free PDF (7.8MB)
BP-CRN: A Lightweight Two-Stage Convolutional Recurrent Network For Multi-channel Speech Enhancement
Cong PANG Ye NI Jia Ming CHENG Lin ZHOU Li ZHAO

Pubricized:
2024/09/10
LETTER
- Summary
- Free PDF (2.6MB)
Building Defect Prediction Models by Online Learning Considering Defect Overlooking
Nikolay FEDOROV Yuta YAMASAKI Masateru TSUNODA Akito MONDEN Amjed TAHIR Kwabena Ebo BENNIN Koji TODA Keitaro NAKASAI

Pubricized:
2024/09/09
LETTER
- Summary
- Free PDF (96.5KB)
The Impact of Defect (Re) Prediction on Software Testing
Yukasa MURAKAMI Yuta YAMASAKI Masateru TSUNODA Akito MONDEN Amjed TAHIR Kwabena Ebo BENNIN Koji TODA Keitaro NAKASAI

Pubricized:
2024/09/09
LETTER
- Summary
- Free PDF (148.4KB)
Deterministic and Probabilistic Certified Defenses for Content-Based Image Retrieval
Kazuya KAKIZAKI Kazuto FUKUCHI Jun SAKUMA

Pubricized:
2024/09/05
PAPER
- Summary
- Free PDF (3.4MB)
Fault-tolerant Routing in Bicubes
Yitong WANG Htoo Htoo Sandi KYAW Kunihiro FUJIYOSHI Keiichi KANEKO

Pubricized:
2024/09/05
PAPER
- Summary
- Free PDF (759.2KB)
Integrating Cyber-Physical Modeling for Pandemic Surveillance: A Graph-Based Approach for Disease Hotspot Prediction and Public Awareness
Waqas NAWAZ Muhammad UZAIR Kifayat ULLAH KHAN Iram FATIMA

Pubricized:
2024/08/29
PAPER
- Summary
- Free PDF (2.2MB)
Real-time Interactions with Photos and Texts in Large Classrooms
Haeyoung Lee

Pubricized:
2024/08/28
LETTER
- Summary
- Free PDF (372.3KB)
CNN-based feature integration network for speech enhancement in microphone arrays
Ji XI Pengxu JIANG Yue XIE Wei JIANG Hao DING

Pubricized:
2024/08/26
LETTER
- Summary
- Free PDF (2MB)
Partial Enhancement and Channel Aggregation for Visible-Infrared Person Re-Identification
Weiwei JING Zhonghua LI

Pubricized:
2024/08/26
PAPER
- Summary
- Free PDF (1.4MB)
Practical APT Group Hash Unit Profiling Framework Using TTPs
Sena LEE Chaeyoung KIM Hoorin PARK

Pubricized:
2024/08/20
LETTER
- Summary
- Free PDF (716.5KB)
Bilaterally Colored Finite Automata and Bilaterally Colored Regular Expressions
Akira ITO Yoshiaki TAKAHASHI

Pubricized:
2024/08/20
PAPER
- Summary
- Free PDF (652.7KB)
Strategies and Equilibria on Indistinguishability of Winning Objectives and Related Decision Problems
Rindo NAKANISHI Yoshiaki TAKATA Hiroyuki SEKI

Pubricized:
2024/08/20
PAPER
- Summary
- Free PDF (1.5MB)
Computational Complexity of Yajisan-Kazusan and Stained Glass
Chuzo IWAMOTO Ryo TAKAISHI

Pubricized:
2024/08/16
PAPER
- Summary
- Free PDF (682.2KB)
A clustering-based deep learning method for water level prediction
Chih-Ping Wang Duen-Ren Liu

Pubricized:
2024/08/14
LETTER
- Summary
- Free PDF (778.9KB)
Stochastic Dual Coordinate Ascent for Learning Sign Constrained Linear Predictors
Yuya TAKADA Rikuto MOCHIDA Miya NAKAJIMA Syun-suke KADOYA Daisuke SANO Tsuyoshi KATO

Pubricized:
2024/08/08
PAPER
- Summary
- Free PDF (483.5KB)
Multi-dimensional and Multi-task Facial Expression Recognition for Academic Outcomes Prediction
Yi Huo Yun Ge

Pubricized:
2024/08/08
LETTER
- Summary
- Free PDF (458.2KB)
Mixup SVM Learning for Compound Toxicity Prediction Using Human Pluripotent Stem Cells
Rikuto MOCHIDA Miya NAKAJIMA Haruki ONO Takahiro ANDO Tsuyoshi KATO

Pubricized:
2024/08/08
LETTER
- Summary
- Free PDF (170.9KB)
A Bigram Based ILP Formulation for Break Minimization in Sports Scheduling Problems
Koichi FUJII Tomomi MATSUI

Pubricized:
2024/08/08
PAPER
- Summary
- Free PDF (1MB)
Dendritic Learning-based Feature Fusion for Deep Networks
Yaotong SONG Zhipeng LIU Zhiming ZHANG Jun TANG Zhenyu LEI Shangce GAO

Pubricized:
2024/08/07
LETTER
- Summary
- Free PDF (289.3KB)
Applying Run-Length Compression to the Configuration Data of SLM Fine-Grained Reconfigurable Logic
Souhei TAKAGI Takuya KOJIMA Hideharu AMANO Morihiro KUGA Masahiro IIDA

Pubricized:
2024/08/07
PAPER
- Summary
- Free PDF (2.6MB)
Imperceptible Trojan Attacks to the Graph-based Big Data Processing in Smart Society
Jun ZHOU Masaaki KONDO

Pubricized:
2024/08/07
PAPER
- Summary
- Free PDF (825.5KB)
Feasibility Study of Applying Spatial Crowd Smoothing Without Economic Incentives on Ticket Reservation System that Applies Nudges
Tetsuya MANABE Wataru UNUMA

Pubricized:
2024/08/05
PAPER
- Summary
- Free PDF (1.3MB)
(15/14)n Flips are (almost) Sufficient to Sort Heydari and Sudborough's Pancake Stack
Kazuyuki AMANO

Pubricized:
2024/08/05
LETTER
- Summary
- Free PDF (288.7KB)
Overlapping of Lattice Unfolding for Cuboids
Takumi SHIOTA Tonan KAMATA Ryuhei UEHARA

Pubricized:
2024/08/05
PAPER
- Summary
- Free PDF (722.9KB)
An FPT Algorithm for the Exact Matching Problem and NP-hardness of Related Problems
Hitoshi MURAKAMI Yutaro YAMAGUCHI

Pubricized:
2024/08/01
PAPER
- Summary
- Free PDF (704.5KB)
Recognition of Vibration Dampers Based on Deep Learning Method in UAV Images
Jingjing Liu Chuanyang Liu Yiquan Wu Zuo Sun

Pubricized:
2024/07/30
PAPER
- Summary
- Free PDF (3.2MB)
Temporal correlation-based end-to-end rate control in DCVC
Zhenglong YANG Weihao DENG Guozhong WANG Tao FAN Yixi LUO

Pubricized:
2024/07/29
LETTER
- Summary
- Free PDF (574.4KB)
A Subclass of Mu-Calculus with the Freeze Quantifier Equivalent to Büchi Register Automata
Yoshiaki TAKATA Akira ONISHI Ryoma SENDA Hiroyuki SEKI

Pubricized:
2024/07/26
LETTER
- Summary
- Free PDF (137.6KB)
Degraded image classification using knowledge distillation and robust data augmentations
Dinesh DAULTANI Masayuki TANAKA Masatoshi OKUTOMI Kazuki ENDO

Pubricized:
2024/07/26
PAPER
- Summary
- Free PDF (4.4MB)
Escape from the Room
Kento KIMURA Tomohiro HARAMIISHI Kazuyuki AMANO Shin-ichi NAKANO

Pubricized:
2024/07/11
PAPER
- Summary
- Free PDF (1.1MB)
Online combinatorial linear optimization via a Frank-Wolfe-based metarounding algorithm
Ryotaro MITSUBOSHI Kohei HATANO Eiji TAKIMOTO

Pubricized:
2024/07/11
PAPER
- Summary
- Free PDF (1.1MB)
A Flip-count-based Dynamic Temperature Control Method for Constrained Combinatorial Optimization by Parallel Annealing Algorithms
Genta INOUE Daiki OKONOGI Satoru JIMBO Thiem Van CHU Masato MOTOMURA Kazushi KAWAMURA

Pubricized:
2024/07/11
PAPER
- Summary
- Free PDF (1.7MB)
Performance evaluation of CAIN model frame interpolation using training data limited by fixed camera scene detection
Hikaru USAMI Yusuke KAMEDA

Pubricized:
2024/07/11
LETTER
- Summary
- Free PDF (708.9KB)
Towards Superior Pruning Performance in Federated Learning with Discriminative Data
Yinan YANG

Pubricized:
2024/06/27
- Summary
- Free PDF (7.9MB)
Design and implementation of opto-electrical hybrid floating-point multipliers
Takumi INABA Takatsugu ONO Koji INOUE Satoshi KAWAKAMI

Pubricized:
2024/06/26
- Summary
- Free PDF (2.5MB)
HDR-VDA: A Full Stage Data Augmentation Method for HDR Video Reconstruction
Fengshan ZHAO Qin LIU Takeshi IKENAGA

Pubricized:
2024/06/17
- Summary
- Free PDF (1.2MB)
Space-efficient FPT Algorithms for Degeneracy
Naohito MATSUMOTO Kazuhiro KURITA Masashi KIYOMI

Pubricized:
2024/05/31
- Summary
- Free PDF (101.3KB)
The Least Core of Routing Game Without Triangle Inequality
Tomohiro KOBAYASHI Tomomi MATSUI

Pubricized:
2024/05/30
- Summary
- Free PDF (232.9KB)
Enumerating floorplans with Aligned Columns
Shin-ichi NAKANO

Pubricized:
2024/05/30
- Summary
- Free PDF (365.4KB)
An IP Core Protection Scheme Based on Hybrid Lightweight Encryption for Neuromorphic Computing System
Ming PAN

The aritcle processing charge of this paper has not been paid.

Pubricized:
2022/09/14
- Summary

Whole issue (99.9MB)

Volume E107-D No.7 (Publication Date:2024/07/01)

Regular Section

A VVC Dependent Quantization Optimization Based on the Parallel Viterbi Algorithm and Its FPGA Implementation Open Access
Qinghua SHENG Yu CHENG Xiaofang HUANG Changcai LAI Xiaofeng HUANG Haibin YIN

PAPER-Computer System

Pubricized:
2024/03/04
Page(s):
797-806
Dependent Quantization (DQ) is a new quantization tool introduced in the Versatile Video Coding (VVC) standard. While it provides better rate-distortion calculation accuracy, it also increases the computational complexity and hardware cost compared to the widely used scalar quantization. To address this issue, this paper proposes a parallel-dependent quantization hardware architecture using Verilog HDL language. The architecture preprocesses the coefficients with a scalar quantizer and a high-frequency filter, and then further segments and processes the coefficients in parallel using the Viterbi algorithm. Additionally, the weight bit width of the rate-distortion calculation is reduced to decrease the quantization cycle and computational complexity. Finally, the final quantization of the TU is determined through sequential scanning and judging of the rate-distortion cost. Experimental results show that the proposed algorithm reduces the quantization cycle by an average of 56.96% compared to VVC’s reference platform VTM, with a Bjøntegaard delta bit rate (BDBR) loss of 1.03% and 1.05% under the Low-delay P and Random Access configurations, respectively. Verification on the AMD FPGA development platform demonstrates that the hardware implementation meets the quantization requirements for 1080P@60Hz video hardware encoding.
Understanding Characteristics of Phishing Reports from Experts and Non-Experts on Twitter Open Access
Hiroki NAKANO Daiki CHIBA Takashi KOIDE Naoki FUKUSHI Takeshi YAGI Takeo HARIU Katsunari YOSHIOKA Tsutomu MATSUMOTO

PAPER-Information Network

Pubricized:
2024/03/01
Page(s):
807-824
The increase in phishing attacks through email and short message service (SMS) has shown no signs of deceleration. The first thing we need to do to combat the ever-increasing number of phishing attacks is to collect and characterize more phishing cases that reach end users. Without understanding these characteristics, anti-phishing countermeasures cannot evolve. In this study, we propose an approach using Twitter as a new observation point to immediately collect and characterize phishing cases via e-mail and SMS that evade countermeasures and reach users. Specifically, we propose CrowdCanary, a system capable of structurally and accurately extracting phishing information (e.g., URLs and domains) from tweets about phishing by users who have actually discovered or encountered it. In our three months of live operation, CrowdCanary identified 35,432 phishing URLs out of 38,935 phishing reports. We confirmed that 31,960 (90.2%) of these phishing URLs were later detected by the anti-virus engine, demonstrating that CrowdCanary is superior to existing systems in both accuracy and volume of threat extraction. We also analyzed users who shared phishing threats by utilizing the extracted phishing URLs and categorized them into two distinct groups - namely, experts and non-experts. As a result, we found that CrowdCanary could collect information that is specifically included in non-expert reports, such as information shared only by the company brand name in the tweet, information about phishing attacks that we find only in the image of the tweet, and information about the landing page before the redirect. Furthermore, we conducted a detailed analysis of the collected information on phishing sites and discovered that certain biases exist in the domain names and hosting servers of phishing sites, revealing new characteristics useful for unknown phishing site detection.
Research on the Switch Migration Strategy Based on Global Optimization Open Access
Xiao’an BAO Shifan ZHOU Biao WU Xiaomei TU Yuting JIN Qingqi ZHANG Na ZHANG

PAPER-Information Network

Pubricized:
2024/03/25
Page(s):
825-834
With the popularization of software defined networks, switch migration as an important network management strategy has attracted increasing attention. Most existing switch migration strategies only consider local conditions and simple load thresholds, without fully considering the overall optimization and dynamics of the network. Therefore, this article proposes a switch migration algorithm based on global optimization. This algorithm adds a load prediction module to the migration model, determines the migration controller, and uses an improved whale optimization algorithm to determine the target controller and its surrounding controller set. Based on the load status of the controller and the traffic priority of the switch to be migrated, the optimal migration switch set is determined. The experimental results show that compared to existing schemes, the algorithm proposed in this paper improves the average flow processing efficiency by 15% to 40%, reduces switch migration times, and enhances the security of the controller.
VH-YOLOv5s: Detecting the Skin Color of Plectropomus leopardus in Aquaculture Using Mobile Phones Open Access
Beibei LI Xun RAN Yiran LIU Wensheng LI Qingling DUAN

PAPER-Artificial Intelligence, Data Mining

Pubricized:
2024/03/04
Page(s):
835-844
Fish skin color detection plays a critical role in aquaculture. However, challenges arise from image color cast and the limited dataset, impacting the accuracy of the skin color detection process. To address these issues, we proposed a novel fish skin color detection method, termed VH-YOLOv5s. Specifically, we constructed a dataset for fish skin color detection to tackle the limitation posed by the scarcity of available datasets. Additionally, we proposed a Variance Gray World Algorithm (VGWA) to correct the image color cast. Moreover, the designed Hybrid Spatial Pyramid Pooling (HSPP) module effectively performs multi-scale feature fusion, thereby enhancing the feature representation capability. Extensive experiments have demonstrated that VH-YOLOv5s achieves excellent detection results on the Plectropomus leopardus skin color dataset, with a precision of 91.7%, recall of 90.1%, mAP@0.5 of 95.2%, and mAP@0.5:0.95 of 57.5%. When compared to other models such as Centernet, AutoAssign, and YOLOX-s, VH-YOLOv5s exhibits superior detection performance, surpassing them by 2.5%, 1.8%, and 1.7%, respectively. Furthermore, our model can be deployed directly on mobile phones, making it highly suitable for practical applications.
Power Peak Load Forecasting Based on Deep Time Series Analysis Method Open Access
Ying-Chang HUNG Duen-Ren LIU

PAPER-Artificial Intelligence, Data Mining

Pubricized:
2024/03/21
Page(s):
845-856
The prediction of peak power load is a critical factor directly impacting the stability of power supply, characterized significantly by its time series nature and intricate ties to the seasonal patterns in electricity usage. Despite its crucial importance, the current landscape of power peak load forecasting remains a multifaceted challenge in the field. This study aims to contribute to this domain by proposing a method that leverages a combination of three primary models - the GRU model, self-attention mechanism, and Transformer mechanism - to forecast peak power load. To contextualize this research within the ongoing discourse, it’s essential to consider the evolving methodologies and advancements in power peak load forecasting. By delving into additional references addressing the complexities and current state of the power peak load forecasting problem, this study aims to build upon the existing knowledge base and offer insights into contemporary challenges and strategies adopted within the field. Data preprocessing in this study involves comprehensive cleaning, standardization, and the design of relevant functions to ensure robustness in the predictive modeling process. Additionally, recognizing the necessity to capture temporal changes effectively, this research incorporates features such as “Weekly Moving Average” and “Monthly Moving Average” into the dataset. To evaluate the proposed methodologies comprehensively, this study conducts comparative analyses with established models such as LSTM, Self-attention network, Transformer, ARIMA, and SVR. The outcomes reveal that the models proposed in this study exhibit superior predictive performance compared to these established models, showcasing their effectiveness in accurately forecasting electricity consumption. The significance of this research lies in two primary contributions. Firstly, it introduces an innovative prediction method combining the GRU model, self-attention mechanism, and Transformer mechanism, aligning with the contemporary evolution of predictive modeling techniques in the field. Secondly, it introduces and emphasizes the utility of “Weekly Moving Average” and “Monthly Moving Average” methodologies, crucial in effectively capturing and interpreting seasonal variations within the dataset. By incorporating these features, this study enhances the model’s ability to account for seasonal influencing factors, thereby significantly improving the accuracy of peak power load forecasting. This contribution aligns with the ongoing efforts to refine forecasting methodologies and addresses the pertinent challenges within power peak load forecasting.
Conflict Management Method Based on a New Belief Divergence in Evidence Theory Open Access
Zhu YIN Xiaojian MA Hang WANG

PAPER-Office Information Systems, e-Business Modeling

Pubricized:
2024/03/01
Page(s):
857-868
Highly conflicting evidence that may lead to the counter-intuitive results is one of the challenges for information fusion in Dempster-Shafer evidence theory. To deal with this issue, evidence conflict is investigated based on belief divergence measuring the discrepancy between evidence. In this paper, the pignistic probability transform belief χ² divergence, named as BBχ² divergence, is proposed. By introducing the pignistic probability transform, the proposed BBχ² divergence can accurately quantify the difference between evidence with the consideration of multi-element sets. Compared with a few belief divergences, the novel divergence has more precision. Based on this advantageous divergence, a new multi-source information fusion method is devised. The proposed method considers both credibility weights and information volume weights to determine the overall weight of each evidence. Eventually, the proposed method is applied in target recognition and fault diagnosis, in which comparative analysis indicates that the proposed method can realize the highest accuracy for managing evidence conflict.
2D Human Skeleton Action Recognition Based on Depth Estimation Open Access
Lei WANG Shanmin YANG Jianwei ZHANG Song GU

PAPER-Image Recognition, Computer Vision

Pubricized:
2024/02/27
Page(s):
869-877
Human action recognition (HAR) exhibits limited accuracy in video surveillance due to the 2D information captured with monocular cameras. To address the problem, a depth estimation-based human skeleton action recognition method (SARDE) is proposed in this study, with the aim of transforming 2D human action data into 3D format to dig hidden action clues in the 2D data. SARDE comprises two tasks, i.e., human skeleton action recognition and monocular depth estimation. The two tasks are integrated in a multi-task manner in end-to-end training to comprehensively utilize the correlation between action recognition and depth estimation by sharing parameters to learn the depth features effectively for human action recognition. In this study, graph-structured networks with inception blocks and skip connections are investigated for depth estimation. The experimental results verify the effectiveness and superiority of the proposed method in skeleton action recognition that the method reaches state-of-the-art on the datasets.
Research on Mask-Wearing Detection Algorithm Based on Improved YOLOv7-Tiny Open Access
Min GAO Gaohua CHEN Jiaxin GU Chunmei ZHANG

PAPER-Image Recognition, Computer Vision

Pubricized:
2024/03/19
Page(s):
878-889
Wearing a mask correctly is an effective method to prevent respiratory infectious diseases. Correct mask use is a reliable approach for preventing contagious respiratory infections. However, when dealing with mask-wearing in some complex settings, the detection accuracy still needs to be enhanced. The technique for mask-wearing detection based on YOLOv7-Tiny is enhanced in this research. Distribution Shifting Convolutions (DSConv) based on YOLOv7-tiny are used instead of the 3×3 convolution in the original model to simplify computation and increase detection precision. To decrease the loss of coordinate regression and enhance the detection performance, we adopt the loss function Intersection over Union with Minimum Points Distance (MPDIoU) instead of Complete Intersection over Union (CIoU) in the original model. The model is introduced with the GSConv and VoVGSCSP modules, recognizing the model’s mobility. The P6 detection layer has been designed to increase detection precision for tiny targets in challenging environments and decrease missed and false positive detection rates. The robustness of the model is increased further by creating and marking a mask-wearing data set in a multi environment that uses Mixup and Mosaic technologies for data augmentation. The efficiency of the model is validated in this research using comparison and ablation experiments on the mask dataset. The results demonstrate that when compared to YOLOv7-tiny, the precision of the enhanced detection algorithm is improved by 5.4%, Recall by 1.8%, mAP@.5 by 3%, mAP@.5:.95 by 1.7%, while the FLOPs is decreased by 8.5G. Therefore, the improved detection algorithm realizes more real-time and accurate mask-wearing detection tasks.
Improving Sliced Wasserstein Distance with Geometric Median for Knowledge Distillation Open Access
Hongyun LU Mengmeng ZHANG Hongyuan JING Zhi LIU

LETTER-Fundamentals of Information Systems

Pubricized:
2024/03/08
Page(s):
890-893
Currently, the most advanced knowledge distillation models use a metric learning approach based on probability distributions. However, the correlation between supervised probability distributions is typically geometric and implicit, causing inefficiency and an inability to capture structural feature representations among different tasks. To overcome this problem, we propose a knowledge distillation loss using the robust sliced Wasserstein distance with geometric median (GMSW) to estimate the differences between the teacher and student representations. Due to the intuitive geometric properties of GMSW, the student model can effectively learn to align its produced hidden states from the teacher model, thereby establishing a robust correlation among implicit features. In experiment, our method outperforms state-of-the-art models in both high-resource and low-resource settings.
Channel Pruning via Improved Grey Wolf Optimizer Pruner Open Access
Xueying WANG Yuan HUANG Xin LONG Ziji MA

LETTER-Fundamentals of Information Systems

Pubricized:
2024/03/07
Page(s):
894-897
In recent years, the increasing complexity of deep network structures has hindered their application in small resource constrained hardware. Therefore, we urgently need to compress and accelerate deep network models. Channel pruning is an effective method to compress deep neural networks. However, most existing channel pruning methods are prone to falling into local optima. In this paper, we propose a channel pruning method via Improved Grey Wolf Optimizer Pruner which called IGWO-Pruner to prune redundant channels of convolutional neural networks. It identifies pruning ratio of each layer by using Improved Grey Wolf algorithm, and then fine-tuning the new pruned network model. In experimental section, we evaluate the proposed method in CIFAR datasets and ILSVRC-2012 with several classical networks, including VGGNet, GoogLeNet and ResNet-18/34/56/152, and experimental results demonstrate the proposed method is able to prune a large number of redundant channels and parameters with rare performance loss.
Comparative Performance Analysis of I/O Interfaces on Different NVMe SSDs in a High CPU Contention Scenario Open Access
SeulA LEE Jiwoong PARK

LETTER-Software System

Pubricized:
2024/03/18
Page(s):
898-900
This paper analyzes performance differences between interrupt-based and polling-based asynchronous I/O interfaces in high CPU contention scenarios. It examines how the choice of I/O Interface can differ depending on the performance of NVMe SSDs, particularly when using PCIe 3.0 and PCIe 4.0-based SSDs.
Real-Time Safety Driving Advisory System Utilizing a Vision-Based Driving Monitoring Sensor Open Access
Masahiro TADA Masayuki NISHIDA

LETTER-Human-computer Interaction

Pubricized:
2024/03/15
Page(s):
901-907
In this study, we use a vision-based driving monitoring sensor to track drivers’ visual scanning behavior, a key factor for preventing traffic accidents. Our system evaluates driver’s behaviors by referencing the safety knowledge of professional driving instructors, and provides real-time voice-guided safety advice to encourage safer driving. Our system’s evaluation of safe driving behaviors matched the instructor’s evaluation with accuracy over 80%.
Amodal Instance Segmentation of Thin Objects with Large Overlaps by Seed-to-Mask Extending Open Access
Ryohei KANKE Masanobu TAKAHASHI

LETTER-Image Recognition, Computer Vision

Pubricized:
2024/02/29
Page(s):
908-911
Amodal Instance Segmentation (AIS) aims to segment the regions of both visible and invisible parts of overlapping objects. The mainstream Mask R-CNN-based methods are unsuitable for thin objects with large overlaps because of their object proposal features with bounding boxes for three reasons. First, capturing the entire shapes of overlapping thin objects is difficult. Second, the bounding boxes of close objects are almost identical. Third, a bounding box contains many objects in most cases. In this paper, we propose a box-free AIS method, Seed-to-Mask, for thin objects with large overlaps. The method specifies a target object using a seed and iteratively extends the segmented region. We have achieved better performance in experiments on artificial data consisting only of thin objects.

IEICE TRANSACTIONS on Information

Advance publication (published online immediately after acceptance)

Volume E107-D No.7 (Publication Date:2024/07/01)

A VVC Dependent Quantization Optimization Based on the Parallel Viterbi Algorithm and Its FPGA Implementation Open Access

Understanding Characteristics of Phishing Reports from Experts and Non-Experts on Twitter Open Access

Research on the Switch Migration Strategy Based on Global Optimization Open Access

VH-YOLOv5s: Detecting the Skin Color of Plectropomus leopardus in Aquaculture Using Mobile Phones Open Access

Power Peak Load Forecasting Based on Deep Time Series Analysis Method Open Access

Conflict Management Method Based on a New Belief Divergence in Evidence Theory Open Access

2D Human Skeleton Action Recognition Based on Depth Estimation Open Access

Research on Mask-Wearing Detection Algorithm Based on Improved YOLOv7-Tiny Open Access

Improving Sliced Wasserstein Distance with Geometric Median for Knowledge Distillation Open Access

Channel Pruning via Improved Grey Wolf Optimizer Pruner Open Access

Comparative Performance Analysis of I/O Interfaces on Different NVMe SSDs in a High CPU Contention Scenario Open Access

Real-Time Safety Driving Advisory System Utilizing a Vision-Based Driving Monitoring Sensor Open Access

Amodal Instance Segmentation of Thin Objects with Large Overlaps by Seed-to-Mask Extending Open Access

Latest Issue

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles