Rupasingha A. H. M. RUPASINGHA Incheon PAIK Banage T. G. S. KUMARA
With the expansion of the Internet, the number of available Web services has increased. Web service clustering to identify functionally similar clusters has become a major approach to the efficient discovery of suitable Web services. In this study, we propose a Web service clustering approach that uses novel ontology learning and a similarity calculation method based on the specificity of an ontology in a domain with respect to information theory. Instead of using traditional methods, we generate the ontology using a novel method that considers the specificity and similarity of terms. The specificity of a term describes the amount of domain-specific information contained in that term. Although general terms contain little domain-specific information, specific terms may contain much more domain-related information. The generated ontology is used in the similarity calculations. New logic-based filters are introduced for the similarity-calculation procedure. If similarity calculations using the specified filters fail, then information-retrieval-based methods are applied to the similarity calculations. Finally, an agglomerative clustering algorithm, based on the calculated similarity values, is used for the clustering. We achieved highly efficient and accurate results with this clustering approach, as measured by improved average precision, recall, F-measure, purity and entropy values. According to the results, specificity of terms plays a major role when classifying domain information. Our novel ontology-based clustering approach outperforms comparable existing approaches that do not consider the specificity of terms.
In this paper we extend hyperparameter-free sparse signal reconstruction approaches to permit the high-resolution time delay estimation of spread spectrum signals and demonstrate their feasibility in terms of both performance and computation complexity by applying them to the ISO/IEC 24730-2.1 real-time locating system (RTLS). Numerical examples show that the sparse asymptotic minimum variance (SAMV) approach outperforms other sparse algorithms and multiple signal classification (MUSIC) regardless of the signal correlation, especially in the case where the incoming signals are closely spaced within a Rayleigh resolution limit. The performance difference among the hyperparameter-free approaches decreases significantly as the signals become more widely separated. SAMV is sometimes strongly influenced by the noise correlation, but the degrading effect of the correlated noise can be mitigated through the noise-whitening process. The computation complexity of SAMV can be feasible for practical system use by setting the power update threshold and the grid size properly, and/or via parallel implementations.
Sangwoo PARK Iickho SONG Seungwon LEE Seokho YOON
We propose a cooperative cognitive radio network (CCRN) with secondary users (SUs) employing two simultaneous transmit and receive (STAR) antennas. In the proposed framework of full-duplex (FD) multiple-input-multiple-output (MIMO) CCRN, the region of achievable rate is expanded via FD communication among SUs enabled by the STAR antennas adopted for the SUs. The link capacity of the proposed framework is analyzed theoretically. It is shown through numerical analysis that the proposed FD MIMO-CCRN framework can provide a considerable performance gain over the conventional frameworks of CCRN and MIMO-CCRN.
Tatsuya NOBUNAGA Toshiaki WATANABE Hiroya TANAKA
Individuals can be identified by features extracted from an electrocardiogram (ECG). However, irregular palpitations due to stress or exercise decrease the identification accuracy due to distortion of the ECG waveforms. In this letter, we propose a human identification scheme based on the frequency spectrums of an ECG, which can successfully extract features and thus identify individuals even while exercising. For the proposed scheme, we demonstrate an accuracy rate of 99.8% in a controlled experiment with exercising subjects. This level of accuracy is achieved by determining the significant features of individuals with a random forest classifier. In addition, the effectiveness of the proposed scheme is verified using a publicly available ECG database. We show that the proposed scheme also achieves a high accuracy with this public database.
Natthawute SAE-LIM Shinpei HAYASHI Motoshi SAEKI
Code smells are indicators of design flaws or problems in the source code. Various tools and techniques have been proposed for detecting code smells. These tools generally detect a large number of code smells, so approaches have also been developed for prioritizing and filtering code smells. However, lack of empirical data detailing how developers filter and prioritize code smells hinders improvements to these approaches. In this study, we investigated ten professional developers to determine the factors they use for filtering and prioritizing code smells in an open source project under the condition that they complete a list of five tasks. In total, we obtained 69 responses for code smell filtration and 50 responses for code smell prioritization from the ten professional developers. We found that Task relevance and Smell severity were most commonly considered during code smell filtration, while Module importance and Task relevance were employed most often for code smell prioritization. These results may facilitate further research into code smell detection, prioritization, and filtration to better focus on the actual needs of developers.
We present a modular way of implementing adaptive decisions in performing scientific simulations. The proposed method employs modern software engineering mechanisms to allow for better software management in scientific computing, where software adaptation has often been implemented manually by the programmer or by using in-house tools, which complicates software management over time. By applying the aspect-oriented programming (AOP) paradigm, we consider software adaptation as a separate concern and, using popular AOP constructs, implement adaptive decision separately from the original code base, thereby improving software management. We demonstrate the effectiveness of our approach with applications to stochastic simulation software.
Hayato SOYA Osamu TAKYU Keiichiro SHIRAI Mai OHTA Takeo FUJII Fumihito SASAMORI Shiro HANDA
A multi-channel cognitive radio is a powerful solution for recovering the exhaustion of frequency spectrum resources. In a cognitive radio, although master and slave terminals (which construct a communication link) have the freedom to access arbitrary channels, access channel mismatch is caused. A rendezvous scheme based on frequency hopping can compensate for this mismatch by exchanging control signals through a selected channel in accordance with a certain rule. However, conventional frequency hopping schemes do not consider an access protocol of both control signals in the rendezvous scheme and the signal caused by channel access from other systems. Further, they do not consider an information sharing method to reach a consensus between the master and slave terminals. This paper proposes a modified rendezvous scheme based on learning-based channel occupancy rate (COR) estimation and describes a specific channel-access rule in the slave terminal. On the basis of this rule, the master estimates a channel selected by the slave by considering the average COR of the other systems. Since the master can narrow down the number of channels, a fast rendezvous scheme with a few control signals is established.
Hiroki IWATA Kenta UMEBAYASHI Janne J. LEHTOMÄKI Shusuke NARIEDA
We introduce a Welch FFT segment size selection method for FFT-based wide band spectrum measurement in the context of smart spectrum access (SSA), in which statistical spectrum usage information of primary users (PUs), such as duty cycle (DC), will be exploited by secondary users (SUs). Energy detectors (EDs) based on Welch FFT can detect the presence of PU signals in a broadband environment efficiently, and DC can be estimated properly if a Welch FFT segment size is set suitably. There is a trade-off between detection performance and frequency resolution in terms of the Welch FFT segment size. The optimum segment size depends on signal-to-noise ratio (SNR) which makes practical and optimum segment size setting difficult. For this issue, we previously proposed a segment size selection method employing a relationship between noise floor (NF) estimation output and the segment size without SNR information. It can achieve accurate spectrum awareness at the expense of relatively high computational complexity since it employs exhaustive search to select a proper segment size. In this paper, we propose a segment size selection method that offers reasonable spectrum awareness performance with low computational complexity since limited search is used. Numerical evaluations show that the proposed method can match the spectrum awareness performance of the conventional method with 70% lower complexity or less.
Mitsunari KANNO Shigeru MIEDA Nobuhide YOKOTA Wataru KOBAYASHI Hiroshi YASAKA
Frequency chirp of a semiconductor laser is controlled by using hybrid modulation, which simultaneously modulates intra-cavity loss and injection current to the laser. The positive adiabatic chirp of injection-current modulation is compensated with the negative adiabatic chirp created by intra-cavity-loss modulation, which enhances the chromatic-dispersion tolerance of the laser. A proof-of-concept transmission experiment confirmed that the hybrid modulation laser has a larger dispersion tolerance than conventional directly modulated lasers due to the negative frequency chirp originating from intra-cavity-loss modulation.
Yu ZHANG Pengyuan ZHANG Qingwei ZHAO
In this letter, we explored the usage of spatio-temporal information in one unified framework to improve the performance of multichannel speech recognition. Generalized cross correlation (GCC) is served as spatial feature compensation, and an attention mechanism across time is embedded within long short-term memory (LSTM) neural networks. Experiments on the AMI meeting corpus show that the proposed method provides a 8.2% relative improvement in word error rate (WER) over the model trained directly on the concatenation of multiple microphone outputs.
Xuefang NIE Yang WANG Liqin DING Jiliang ZHANG
Cellular heterogeneous networks (HetNets) with densely deployed small cells can effectively boost network capacity. The co-channel interference and the prominent energy consumption are two crucial issues in HetNets which need to be addressed. Taking the traffic variations into account, this paper proposes a theoretical framework to analyze spectral efficiency (SE) and energy efficiency (EE) considering jointly further-enhanced inter-cell interference coordination (FeICIC) and spectrum allocation (SA) via a stochastic geometric approach for a two-tier downlink HetNet. SE and EE are respectively derived and validated by Monte Carlo simulations. To create spectrum and energy efficient HetNets that can adapt to traffic demands, a non-convex optimization problem with the power control factor, resource partitioning fraction and number of subchannels for the SE and EE tradeoff is formulated, based on which, an iterative algorithm with low complexity is proposed to achieve the sub-optimal solution. Numerical results confirm the effectiveness of the joint FeICIC and SA scheme in HetNets. Meanwhile, a system design insight on resource allocation for the SE and EE tradeoff is provided.
Ryo MASUMURA Taichi ASAMI Takanobu OBA Hirokazu MASATAKI Sumitaka SAKAUCHI Akinori ITO
This paper proposes a novel domain adaptation method that can utilize out-of-domain text resources and partially domain matched text resources in language modeling. A major problem in domain adaptation is that it is hard to obtain adequate adaptation effects from out-of-domain text resources. To tackle the problem, our idea is to carry out model merger in a latent variable space created from latent words language models (LWLMs). The latent variables in the LWLMs are represented as specific words selected from the observed word space, so LWLMs can share a common latent variable space. It enables us to perform flexible mixture modeling with consideration of the latent variable space. This paper presents two types of mixture modeling, i.e., LWLM mixture models and LWLM cross-mixture models. The LWLM mixture models can perform a latent word space mixture modeling to mitigate domain mismatch problem. Furthermore, in the LWLM cross-mixture models, LMs which individually constructed from partially matched text resources are split into two element models, each of which can be subjected to mixture modeling. For the approaches, this paper also describes methods to optimize mixture weights using a validation data set. Experiments show that the mixture in latent word space can achieve performance improvements for both target domain and out-of-domain compared with that in observed word space.
Eunchul YOON Janghyun KIM Unil YUN
A novel Doppler spread estimation scheme is proposed for an orthogonal frequency division multiplexing (OFDM) system with a Rayleigh fading channel. The proposal develops a composite power spectral density (PSD) function by averaging the multiple PSD functions computed with multiple sets of the channel frequency response (CFR) coefficients. The Doppler spread is estimated by finding the maximum location of the composite PSD quantities larger than a threshold value given by a fixed fraction of the maximum composite PSD quantity. It is shown by simulation that the proposed scheme performs better than three conventional Doppler spread estimation schemes not only in isotropic scattering environments, but also in nonisotropic scattering environments. Moreover, the proposed scheme is shown to perform well in some Rician channel environments if the Rician K-factor is small.
Xianxu HOU Jiasong ZHU Ke SUN Linlin SHEN Guoping QIU
Motivated by the observation that certain convolutional channels of a Convolutional Neural Network (CNN) exhibit object specific responses, we seek to discover and exploit the convolutional channels of a CNN in which neurons are activated by the presence of specific objects in the input image. A method for explicitly fine-tuning a pre-trained CNN to induce object specific channel (OSC) and systematically identifying it for the human faces has been developed. In this paper, we introduce a multi-scale approach to constructing robust face heatmaps based on OSC features for rapidly filtering out non-face regions thus significantly improving search efficiency for face detection. We show that multi-scale OSC can be used to develop simple and compact face detectors in unconstrained settings with state of the art performance.
Soh YOSHIDA Takahiro OGAWA Miki HASEYAMA Mitsuji MUNEYASU
Video reranking is an effective way for improving the retrieval performance of text-based video search engines. This paper proposes a graph-based Web video search reranking method with local and global consistency analysis. Generally, the graph-based reranking approach constructs a graph whose nodes and edges respectively correspond to videos and their pairwise similarities. A lot of reranking methods are built based on a scheme which regularizes the smoothness of pairwise relevance scores between adjacent nodes with regard to a user's query. However, since the overall consistency is measured by aggregating only the local consistency over each pair, errors in score estimation increase when noisy samples are included within query-relevant videos' neighbors. To deal with the noisy samples, the proposed method leverages the global consistency of the graph structure, which is different from the conventional methods. Specifically, in order to detect this consistency, the propose method introduces a spectral clustering algorithm which can detect video groups, in which videos have strong semantic correlation, on the graph. Furthermore, a new regularization term, which smooths ranking scores within the same group, is introduced to the reranking framework. Since the score regularization is performed by both local and global aspects simultaneously, the accurate score estimation becomes feasible. Experimental results obtained by applying the proposed method to a real-world video collection show its effectiveness.
Gibran BENITEZ-GARCIA Tomoaki NAKAMURA Masahide KANEKO
An increasing number of psychological studies have demonstrated that the six basic expressions of emotions are not culturally universal. However, automatic facial expression recognition (FER) systems disregard these findings and assume that facial expressions are universally expressed and recognized across different cultures. Therefore, this paper presents an analysis of Western-Caucasian and East-Asian facial expressions of emotions based on visual representations and cross-cultural FER. The visual analysis builds on the Eigenfaces method, and the cross-cultural FER combines appearance and geometric features by extracting Local Fourier Coefficients (LFC) and Facial Fourier Descriptors (FFD) respectively. Furthermore, two possible solutions for FER under multicultural environments are proposed. These are based on an early race detection, and independent models for culture-specific facial expressions found by the analysis evaluation. HSV color quantization combined with LFC and FFD compose the feature extraction for race detection, whereas culture-independent models of anger, disgust and fear are analyzed for the second solution. All tests were performed using Support Vector Machines (SVM) for classification and evaluated using five standard databases. Experimental results show that both solutions overcome the accuracy of FER systems under multicultural environments. However, the approach which individually considers the culture-specific facial expressions achieved the highest recognition rate.
Mingcong YANG Kai GUO Yongbing ZHANG Yusheng JI
The elastic optical network (EON) is a promising new optical technology that uses spectrum resources much more efficiently than does traditional wavelength division multiplexing (WDM). This paper focuses on the routing, modulation level, spectrum and transceiver allocation (RMSTA) problems of the EON. In contrast to previous works that consider only the routing and spectrum allocation (RSA) or routing, modulation level and spectrum allocation (RMSA) problems, we additionally consider the transceiver allocation problem. Because transceivers can be used to regenerate signals (by connecting two transceivers back-to-back) along a transmission path, different regeneration sites on a transmission path result in different spectrum and transceiver usage. Thus, the RMSTA problem is both more complex and more challenging than are the RSA and RMSA problems. To address this problem, we first propose an integer linear programming (ILP) model whose objective is to optimize the balance between spectrum usage and transceiver usage by tuning a weighting coefficient to minimize the cost of network operations. Then, we propose a novel virtual network-based heuristic algorithm to solve the problem and present the results of experiments on representative network topologies. The results verify that, compared to previous works, the proposed algorithm can significantly reduce both resource consumption and time complexity.
Kentaro TOKORO Shunsuke SAITO Kensaku KANOMATA Masanori MIURA Bashir AHMMAD Shigeru KUBOTA Fumihiko HIROSE
We report room-temperature atomic layer deposition (ALD) of SnO2 using tetramethyltin (TMT) as a precursor and plasma-excited humidified argon as an oxidizing gas and investigate the saturation behaviors of these gases on SnO2-covered Si prisms by IR absorption spectroscopy to determine optimal precursor/oxidizer injection conditions. TMT is demonstrated to adsorb on the SnO2 surface by reacting with surface OH groups, which are regenerated by oxidizing the TMT-saturated surface by plasma-excited humidified argon. We provide a detailed discussion of the growth mechanism. We also report the RT ALD application to the RT TFT fabrication.
Deokgyu YUN Hannah LEE Seung Ho CHOI
This paper proposes a deep learning-based non-intrusive objective speech intelligibility estimation method based on recurrent neural network (RNN) with long short-term memory (LSTM) structure. Conventional non-intrusive estimation methods such as standard P.563 have poor estimation performance and lack of consistency, especially, in various noise and reverberation environments. The proposed method trains the LSTM RNN model parameters by utilizing the STOI that is the standard intrusive intelligibility estimation method with reference speech signal. The input and output of the LSTM RNN are the MFCC vector and the frame-wise STOI value, respectively. Experimental results show that the proposed objective intelligibility estimation method outperforms the conventional standard P.563 in various noisy and reverberant environments.
Shota ISHIMURA Byung-Gon KIM Kazuki TANAKA Shinobu NANBA Kosuke NISHIMURA Hoon KIM Yun C. CHUNG Masatoshi SUZUKI
The intermediate frequency-over-fiber (IFoF) technology has attracted attention as an alternative transmission scheme to the functional split for the next-generation mobile fronthaul links due to its high spectral efficiency and perfect centralized control ability. In this paper, we discuss and clarify network architectures suited for IFoF, based on its advantages over the functional split. One of the major problems for IFoF transmission is dispersion-induced RF power fading, which limits capacity and transmission distance. We introduce our previous work, in which high-capacity and long-distance IFoF transmission was demonstrated by utilizing a parallel intensity/phase modulators (IM/PM) transmitter which can effectively avoid the fading. The IFoF technology with the proposed scheme is well suited for the long-distance mobile fronthaul links for the 5th generation (5G) mobile system and beyond.