IEICE global.ieice.org Site

Keyword Search Result

[Keyword] EE(4053hit)

121-140hit(4053hit)

Lookahead Search-Based Low-Complexity Multi-Type Tree Pruning Method for Versatile Video Coding (VVC) Intra Coding
Qi TENG Guowei TENG Xiang LI Ran MA Ping AN Zhenglong YANG

PAPER-Coding Theory

Pubricized:
2022/08/24
Vol:
E106-A No:3
Page(s):
606-615
The latest versatile video coding (VVC) introduces some novel techniques such as quadtree with nested multi-type tree (QTMT), multiple transform selection (MTS) and multiple reference line (MRL). These tools improve compression efficiency compared with the previous standard H.265/HEVC, but they suffer from very high computational complexity. One of the most time-consuming parts of VVC intra coding is the coding tree unit (CTU) structure decision. In this paper, we propose a low-complexity multi-type tree (MT) pruning method for VVC intra coding. This method consists of lookahead search and MT pruning. The lookahead search process is performed to derive the approximate rate-distortion (RD) cost of each MT node at depth 2 or 3. Subsequently, the improbable MT nodes are pruned by different strategies under different cost errors. These strategies are designed according to the priority of the node. Experimental results show that the overall proposed algorithm can achieve 47.15% time saving with only 0.93% Bjøntegaard delta bit rate (BDBR) increase over natural scene sequences, and 45.39% time saving with 1.55% BDBR increase over screen content sequences, compared with the VVC reference software VTM 10.0. Such results demonstrate that our method achieves a good trade-off between computational complexity and compression quality compared to recent methods.
On the Degrees of Freedom of a Propagation-Delay Based Multicast X Channel with Two Transmitters and Arbitrary Receivers
Conggai LI Qian GAN Feng LIU Yanli XU

PAPER-Wireless Communication Technologies

Pubricized:
2022/08/23
Vol:
E106-B No:3
Page(s):
267-274
Compared with the unicast scenario, X channels with multicast messaging can support richer transmission scenarios. The transmission efficiency of the wireless multicast X channel is an important and open problem. This article studies the degrees of freedom of a propagation-delay based multicast X channel with two transmitters and arbitrary receivers, where each transmitter sends K different messages and each receiver desires K - 1 of them from each transmitter. The cyclic polynomial approach is adopted for modeling and analysis. The DoF upper bound is analyzed and shown to be unreachable. Then a suboptimal scheme with one extra time-slot cycle is proposed, which uses the cyclic interference alignment method and achieves a DoF of K - 1. Finally, the feasibility conditions in the Euclidean space are derived and the potential applications are demonstrated for underwater acoustic and terrestrial radio communications.
A Resource-Efficient Green Paradigm For Crowdsensing Based Spectrum Detection In Internet of Things Networks
Xiaohui LI Qi ZHU Wenchao XIA Yunpei CHEN

PAPER-Terrestrial Wireless Communication/Broadcasting Technologies

Pubricized:
2022/09/12
Vol:
E106-B No:3
Page(s):
275-286
Crowdsensing-based spectrum detection (CSD) is promising to enable full-coverage radio resource availability for the increasingly connected machines in the Internet of Things (IoT) networks. The current CSD scheme consumes a lot of energy and network resources for local sensing, processing, and distributed data reporting for each crowdsensing device. Furthermore, when the amount of reported data is large, the data fusion implemented at the requestor can easily cause high latency. For improving efficiencies in both energy and network resources, this paper proposes a green CSD (GCSD) paradigm. The ambient backscatter (AmB) is used to enable a battery-free mode of operation in which the received spectrum data is reported directly through backscattering without local processing. The energy for backscattering can be provided by ambient radio frequency (RF) sources. Then, relying on air computation (AirComp), the data fusion can be implemented during the backscattering process and over the air by utilizing the summation property of wireless channel. This paper illustrates the model and the implementation process of the GCSD paradigm. Closed-form expressions of detection metrics are derived for the proposed GCSD. Simulation results verify the correctness of the theoretical derivation and demonstrate the green properties of the GCSD paradigm.
Pumping Lemmas for Languages Expressed by Computational Models with Registers
Rindo NAKANISHI Yoshiaki TAKATA Hiroyuki SEKI

PAPER

Pubricized:
2022/10/14
Vol:
E106-D No:3
Page(s):
284-293
Register automaton (RA), register context-free grammar (RCFG) and register tree automaton (RTA) are computational models with registers which deal with data values. This paper shows pumping lemmas for the classes of languages expressed by RA, RCFG and RTA. Among them, the first lemma was already proved in terms of nominal automata, which is an abstraction of RA. We define RTA in a deterministic and bottom-up manner. For these languages, the notion of ‘pumped word’ must be relaxed in such a way that a pumped subword is not always the same as the original subword, but is any word equivalent to the original subword in terms of data type defined in this paper. By using the lemmas, we give examples of languages that do not belong to the above-mentioned classes of languages.
A Subclass of Mu-Calculus with the Freeze Quantifier Equivalent to Register Automata
Yoshiaki TAKATA Akira ONISHI Ryoma SENDA Hiroyuki SEKI

PAPER

Pubricized:
2022/10/25
Vol:
E106-D No:3
Page(s):
294-302
Register automaton (RA) is an extension of finite automaton by adding registers storing data values. RA has good properties such as the decidability of the membership and emptiness problems. Linear temporal logic with the freeze quantifier (LTL↓) proposed by Demri and Lazić is a counterpart of RA. However, the expressive power of LTL↓ is too high to be applied to automatic verification. In this paper, we propose a subclass of modal µ-calculus with the freeze quantifier, which has the same expressive power as RA. Since a conjunction ψ1 ∧ ψ2 in a general LTL↓ formula cannot be simulated by RA, the proposed subclass prohibits at least one of ψ1 and ψ2 from containing the freeze quantifier or a temporal operator other than X (next). Since the obtained subclass of LTL↓ does not have the ability to represent a cycle in RA, we adopt µ-calculus over the subclass of LTL↓, which allows recursive definition of temporal formulas. We provide equivalent translations from the proposed subclass of µ-calculus to RA and vice versa and prove their correctness.
Weighted Multiple Context-Free Grammars
Yusuke INOUE Kenji HASHIMOTO Hiroyuki SEKI

PAPER

Pubricized:
2022/10/14
Vol:
E106-D No:3
Page(s):
309-318
Multiple context-free grammar (MCFG) is an extension of context-free grammar (CFG), which generates tuples of words. The expressive power of MCFG is between CFG and context-sensitive grammar while MCFG inherits good properties of CFG. In this paper, we introduce weighted multiple context-free grammar (WMCFG) as a quantitative extension of MCFG. Then we investigate properties of WMCFG such as polynomial-time computability of basic problems, its closure property and expressive power.
Ordinal Regression Based on the Distributional Distance for Tabular Data
Yoshiyuki TAJIMA Tomoki HAMAGAMI

PAPER-Artificial Intelligence, Data Mining

Pubricized:
2022/12/16
Vol:
E106-D No:3
Page(s):
357-364
Ordinal regression is used to classify instances by considering ordinal relation between labels. Existing methods tend to decrease the accuracy when they adhere to the preservation of the ordinal relation. Therefore, we propose a distributional knowledge-based network (DK-net) that considers ordinal relation while maintaining high accuracy. DK-net focuses on image datasets. However, in industrial applications, one can find not only image data but also tabular data. In this study, we propose DK-neural oblivious decision ensemble (NODE), an improved version of DK-net for tabular data. DK-NODE uses NODE for feature extraction. In addition, we propose a method for adjusting the parameter that controls the degree of compliance with the ordinal relation. We experimented with three datasets: WineQuality, Abalone, and Eucalyptus dataset. The experiments showed that the proposed method achieved high accuracy and small MAE on three datasets. Notably, the proposed method had the smallest average MAE on all datasets.
Learning Multi-Level Features for Improved 3D Reconstruction
Fairuz SAFWAN MAHAD Masakazu IWAMURA Koichi KISE

PAPER-Image Recognition, Computer Vision

Pubricized:
2022/12/08
Vol:
E106-D No:3
Page(s):
381-390
3D reconstruction methods using neural networks are popular and have been studied extensively. However, the resulting models typically lack detail, reducing the quality of the 3D reconstruction. This is because the network is not designed to capture the fine details of the object. Therefore, in this paper, we propose two networks designed to capture both the coarse and fine details of the object to improve the reconstruction of the detailed parts of the object. To accomplish this, we design two networks. The first network uses a multi-scale architecture with skip connections to associate and merge features from other levels. For the second network, we design a multi-branch deep generative network that separately learns the local features, generic features, and the intermediate features through three different tailored components. In both network architectures, the principle entails allowing the network to learn features at different levels that can reconstruct the fine parts and the overall shape of the reconstructed 3D model. We show that both of our methods outperformed state-of-the-art approaches.
Functional Connectivity and Small-World Networks in Prion Disease
Chisho TAKEOKA Toshimasa YAMAZAKI Yoshiyuki KUROIWA Kimihiro FUJINO Toshiaki HIRAI Hidehiro MIZUSAWA

LETTER-Biological Engineering

Pubricized:
2022/11/28
Vol:
E106-D No:3
Page(s):
427-430
We characterized prion disease by comparing brain functional connectivity network (BFCN), which were constructed by 16-ch scalp-recorded electroencephalograms (EEGs). The connectivity between each pair of nodes (electrodes) were computed by synchronization likelihood (SL). The BFCN was applied to graph theory to discriminate prion disease patients from healthy elderlies and dementia groups.
Adversarial Reinforcement Learning-Based Coordinated Robust Spatial Reuse in Broadcast-Overlaid WLANs
Yuto KIHIRA Yusuke KODA Koji YAMAMOTO Takayuki NISHIO

PAPER-Terrestrial Wireless Communication/Broadcasting Technologies

Pubricized:
2022/08/02
Vol:
E106-B No:2
Page(s):
203-212
Broadcast services for wireless local area networks (WLANs) are being standardized in the IEEE 802.11 task group bc. Envisaging the upcoming coexistence of broadcast access points (APs) with densely-deployed legacy APs, this paper addresses a learning-based spatial reuse with only partial receiver-awareness. This partial awareness means that the broadcast APs can leverage few acknowledgment frames (ACKs) from recipient stations (STAs). This is in view of the specific concerns of broadcast communications. In broadcast communications for a very large number of STAs, ACK implosions occur unless some STAs are stopped from responding with ACKs. Given this, the main contribution of this paper is to demonstrate the feasibility to improve the robustness of learning-based spatial reuse to hidden interferers only with the partial receiver-awareness while discarding any re-training of broadcast APs. The core idea is to leverage robust adversarial reinforcement learning (RARL), where before a hidden interferer is installed, a broadcast AP learns a rate adaptation policy in a competition with a proxy interferer that provides jamming signals intelligently. Therein, the recipient STAs experience interference and the partial STAs provide a feedback overestimating the effect of interference, allowing the broadcast AP to select a data rate to avoid frame losses in a broad range of recipient STAs. Simulations demonstrate the suppression of the throughput degradation under a sudden installation of a hidden interferer, indicating the feasibility of acquiring robustness to the hidden interferer.
Machine Learning in 6G Wireless Communications Open Access
Tomoaki OHTSUKI

INVITED PAPER

Pubricized:
2022/08/10
Vol:
E106-B No:2
Page(s):
75-83
Mobile communication systems are not only the core of the Information and Communication Technology (ICT) infrastructure but also that of our social infrastructure. The 5th generation mobile communication system (5G) has already started and is in use. 5G is expected for various use cases in industry and society. Thus, many companies and research institutes are now trying to improve the performance of 5G, that is, 5G Enhancement and the next generation of mobile communication systems (Beyond 5G (6G)). 6G is expected to meet various highly demanding requirements even compared with 5G, such as extremely high data rate, extremely large coverage, extremely low latency, extremely low energy, extremely high reliability, extreme massive connectivity, and so on. Artificial intelligence (AI) and machine learning (ML), AI/ML, will have more important roles than ever in 6G wireless communications with the above extreme high requirements for a diversity of applications, including new combinations of the requirements for new use cases. We can say that AI/ML will be essential for 6G wireless communications. This paper introduces some ML techniques and applications in 6G wireless communications, mainly focusing on the physical layer.
Toward Selective Adversarial Attack for Gait Recognition Systems Based on Deep Neural Network
Hyun KWON

LETTER-Information Network

Pubricized:
2022/11/07
Vol:
E106-D No:2
Page(s):
262-266
Deep neural networks (DNNs) perform well for image recognition, speech recognition, and pattern analysis. However, such neural networks are vulnerable to adversarial examples. An adversarial example is a data sample created by adding a small amount of noise to an original sample in such a way that it is difficult for humans to identify but that will cause the sample to be misclassified by a target model. In a military environment, adversarial examples that are correctly classified by a friendly model while deceiving an enemy model may be useful. In this paper, we propose a method for generating a selective adversarial example that is correctly classified by a friendly gait recognition system and misclassified by an enemy gait recognition system. The proposed scheme generates the selective adversarial example by combining the loss for correct classification by the friendly gait recognition system with the loss for misclassification by the enemy gait recognition system. In our experiments, we used the CASIA Gait Database as the dataset and TensorFlow as the machine learning library. The results show that the proposed method can generate selective adversarial examples that have a 98.5% attack success rate against an enemy gait recognition system and are classified with 87.3% accuracy by a friendly gait recognition system.
Broadcast with Tree Selection from Multiple Spanning Trees on an Overlay Network Open Access
Takeshi KANEKO Kazuyuki SHUDO

PAPER-Network

Pubricized:
2022/08/16
Vol:
E106-B No:2
Page(s):
145-155
On an overlay network where a number of nodes work autonomously in a decentralized way, the efficiency of broadcasts has a significant impact on the performance of distributed systems built on the network. While a broadcast method using a spanning tree produces a small number of messages, the routing path lengths are prone to be relatively large. Moreover, when multiple nodes can be source nodes, inefficient broadcasts often occur because the efficient tree topology differs for each node. To address this problem, we propose a novel protocol in which a source node selects an efficient tree from multiple spanning trees when broadcasting. Our method shortens routing paths while maintaining a small number of messages. We examined path lengths and the number of messages for broadcasts on various topologies. As a result, especially for a random graph, our proposed method shortened path lengths by approximately 28% compared with a method using a spanning tree, with almost the same number of messages.
Commit-Based Class-Level Defect Prediction for Python Projects
Khine Yin MON Masanari KONDO Eunjong CHOI Osamu MIZUNO

PAPER

Pubricized:
2022/11/14
Vol:
E106-D No:2
Page(s):
157-165
Defect prediction approaches have been greatly contributing to software quality assurance activities such as code review or unit testing. Just-in-time defect prediction approaches are developed to predict whether a commit is a defect-inducing commit or not. Prior research has shown that commit-level prediction is not enough in terms of effort, and a defective commit may contain both defective and non-defective files. As the defect prediction community is promoting fine-grained granularity prediction approaches, we propose our novel class-level prediction, which is finer-grained than the file-level prediction, based on the files of the commits in this research. We designed our model for Python projects and tested it with ten open-source Python projects. We performed our experiment with two settings: setting with product metrics only and setting with product metrics plus commit information. Our investigation was conducted with three different classifiers and two validation strategies. We found that our model developed by random forest classifier performs the best, and commit information contributes significantly to the product metrics in 10-fold cross-validation. We also created a commit-based file-level prediction for the Python files which do not have the classes. The file-level model also showed a similar condition as the class-level model. However, the results showed a massive deviation in time-series validation for both levels and the challenge of predicting Python classes and files in a realistic scenario.
Comparative Evaluation of Diverse Features in Fluency Evaluation of Spontaneous Speech
Huaijin DENG Takehito UTSURO Akio KOBAYASHI Hiromitsu NISHIZAKI

PAPER-Speech and Hearing

Pubricized:
2022/10/25
Vol:
E106-D No:1
Page(s):
36-45
There have been lots of previous studies on fluency evaluation of spontaneous speech. However, most of them focus on lexical cues, and little emphasis is placed on how diverse acoustic features and deep end-to-end models contribute to improving the performance. In this paper, we describe multi-layer neural network to investigate not only lexical features extracted from transcription, but also consider utterance-level acoustic features from audio data. We also conduct the experiments to investigate the performance of end-to-end approaches with mel-spectrogram in this task. As the speech fluency evaluation task, we evaluate our proposed method in two binary classification tasks of fluent speech detection and disfluent speech detection. Speech data of around 10 seconds duration each with the annotation of the three classes of “fluent,” “neutral,” and “disfluent” is used for evaluation. According to the two way splits of those three classes, the task of fluent speech detection is defined as binary classification of fluent vs. neutral and disfluent, while that of disfluent speech detection is defined as binary classification of fluent and neutral vs. disfluent. We then conduct experiments with the purpose of comparative evaluation of multi-layer neural network with diverse features as well as end-to-end models. For the fluent speech detection, in the comparison of utterance-level disfluency-based, prosodic, and acoustic features with multi-layer neural network, disfluency-based and prosodic features only are better. More specifically, the performance improved a lot when removing all of the acoustic features from the full set of features, while the performance is damaged a lot if fillers related features are removed. Overall, however, the end-to-end Transformer+VGGNet model with mel-spectrogram achieves the best results. For the disfluent speech detection, the multi-layer neural network using disfluency-based, prosodic, and acoustic features without fillers achieves the best results. The end-to-end Transformer+VGGNet architecture also obtains high scores, whereas it is exceeded by the best results with the multi-layer neural network with significant difference. Thus, unlike in the fluent speech detection, disfluency-based and prosodic features other than fillers are still necessary in the disfluent speech detection.
A Non-Intrusive Speech Quality Evaluation Method Based on the Audiogram and Weighted Frequency Information for Hearing Aid
Ruxue GUO Pengxu JIANG Ruiyu LIANG Yue XIE Cairong ZOU

LETTER-Speech and Hearing

Pubricized:
2022/07/25
Vol:
E106-A No:1
Page(s):
64-68
For a long time, the compensation effect of hearing aid is mainly evaluated subjectively, and there are fewer studies of objective evaluation. Furthermore, a pure speech signal is generally required as a reference in the existing objective evaluation methods, which restricts the practicality in a real-world environment. Therefore, this paper presents a non-intrusive speech quality evaluation method for hearing aid, which combines the audiogram and weighted frequency information. The proposed model mainly includes an audiogram information extraction network, a frequency information extraction network, and a quality score mapping network. The audiogram is the input of the audiogram information extraction network, which helps the system capture the information related to hearing loss. In addition, the low-frequency bands of speech contain loudness information and the medium and high-frequency components contribute to semantic comprehension. The information of two frequency bands is input to the frequency information extraction network to obtain time-frequency information. When obtaining the high-level features of different frequency bands and audiograms, they are fused into two groups of tensors that distinguish the information of different frequency bands and used as the input of the attention layer to calculate the corresponding weight distribution. Finally, a dense layer is employed to predict the score of speech quality. The experimental results show that it is reasonable to combine the audiogram and the weight of the information from two frequency bands, which can effectively realize the evaluation of the speech quality of the hearing aid.
Global Asymptotic Stabilization of Feedforward Systems with an Uncertain Delay in the Input by Event-Triggered Control
Ho-Lim CHOI

LETTER-Systems and Control

Pubricized:
2022/06/28
Vol:
E106-A No:1
Page(s):
69-72
In this letter, we consider a global stabilization problem for a class of feedforward systems by an event-triggered control. This is an extended work of [10] in a way that there are uncertain feedforward nonlinearity and time-varying input delay in the system. First, we show that the considered system is globally asymptotically stabilized by a proposed event-triggered controller with a gain-scaling factor. Then, we also show that the interexecution times can be enlarged by adjusting a gain-scaling factor. A simulation example is given for illustration.
Polar Coding Aided by Adaptive Channel Equalization for Underwater Acoustic Communication
Feng LIU Qianqian WU Conggai LI Fangjiong CHEN Yanli XU

LETTER-Communication Theory and Signals

Pubricized:
2022/07/01
Vol:
E106-A No:1
Page(s):
83-87
To improve the performance of underwater acoustic communications, this letter proposes a polar coding scheme with adaptive channel equalization, which can reduce the amount of feedback information. Furthermore, a hybrid automatic repeat request (HARQ) mechanism is provided to mitigate the impact of estimation errors. Simulation results show that the proposed scheme outperforms the turbo equalization in bit error rate. Computational complexity analysis is also provided for comparison.
Skin Visualization Using Smartphone and Deep Learning in the Beauty Industry
Makoto HASEGAWA Rui MATSUO

PAPER-Biocybernetics, Neurocomputing

Pubricized:
2022/10/12
Vol:
E106-D No:1
Page(s):
68-77
Human skin visualization in the beauty industry with a smart-phone based on deep learning was discussed. Skin was photographed with a medical camera that could simultaneously capture RGB and UV images of the same area. Smartphone RGB images were converted into versions similar to medical RGB and UV images via a deep learning method called cycle-GAN, which was trained with the medical and the smartphone images. After converting the smartphone image into a version similar to a medical RGB image using cycle-GAN, the processed image was also converted into a pseudo-UV image via a deep learning method called U-NET. Hidden age spots were effectively visualized by this image. RGB and UV images similar to medical images can be captured with a smartphone. Provided the neural network on deep learning is trained, a medical camera is not required.
Design of a Dual-Wideband BPF with Parallel-Coupled Stepped Impedance Resonator and Open-Circuited Stubs
Chun-Ping CHEN Zhewang MA Tetsuo ANADA

BRIEF PAPER-Microwaves, Millimeter-Waves

Pubricized:
2022/06/15
Vol:
E105-C No:12
Page(s):
761-766
This brief paper proposes a dual-wideband filter consisting of a parallel-coupled stepped-impedance-resonator (SIR) and open-circuited stubs. Firstly, a notched UWB (ultra-wideband) bandpass filter (BPF) with steep skirt characteristics is theoretically designed. Then a bandstop filter(BSF) is implemented using an SIR and open stubs. By replacing the transmission line part of UWB filter with the BSF, a novel dual-wideband filter (DWBPF) is realized. As a design example, a DWBPF with two passbands, i.e. 3.4-4.8GHz and 7.25-10.25GHz, is designed to validate the design procedure. The designed filter exhibits steep skirt characteristics.

121-140hit(4053hit)

Keyword Search Result

[Keyword] EE(4053hit)

Lookahead Search-Based Low-Complexity Multi-Type Tree Pruning Method for Versatile Video Coding (VVC) Intra Coding

On the Degrees of Freedom of a Propagation-Delay Based Multicast X Channel with Two Transmitters and Arbitrary Receivers

A Resource-Efficient Green Paradigm For Crowdsensing Based Spectrum Detection In Internet of Things Networks

Pumping Lemmas for Languages Expressed by Computational Models with Registers

A Subclass of Mu-Calculus with the Freeze Quantifier Equivalent to Register Automata

Weighted Multiple Context-Free Grammars

Ordinal Regression Based on the Distributional Distance for Tabular Data

Learning Multi-Level Features for Improved 3D Reconstruction

Functional Connectivity and Small-World Networks in Prion Disease

Adversarial Reinforcement Learning-Based Coordinated Robust Spatial Reuse in Broadcast-Overlaid WLANs

Machine Learning in 6G Wireless Communications Open Access

Toward Selective Adversarial Attack for Gait Recognition Systems Based on Deep Neural Network

Broadcast with Tree Selection from Multiple Spanning Trees on an Overlay Network Open Access

Commit-Based Class-Level Defect Prediction for Python Projects

Comparative Evaluation of Diverse Features in Fluency Evaluation of Spontaneous Speech

A Non-Intrusive Speech Quality Evaluation Method Based on the Audiogram and Weighted Frequency Information for Hearing Aid

Global Asymptotic Stabilization of Feedforward Systems with an Uncertain Delay in the Input by Event-Triggered Control

Polar Coding Aided by Adaptive Channel Equalization for Underwater Acoustic Communication

Skin Visualization Using Smartphone and Deep Learning in the Beauty Industry

Design of a Dual-Wideband BPF with Parallel-Coupled Stepped Impedance Resonator and Open-Circuited Stubs

Latest Issue

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles