The search functionality is under construction.
The search functionality is under construction.

Keyword Search Result

[Keyword] Ada(1871hit)

681-700hit(1871hit)

  • Achievable Rate of Adaptive Wireless Multicast with Antenna Diversity in Nakagami Fading Channels

    Jae Cheol PARK  Jin Soo WANG  Iickho SONG  Yun Hee KIM  

     
    LETTER-Wireless Communication Technologies

      Vol:
    E93-B No:10
      Page(s):
    2826-2829

    We derive the average achievable rate of an adaptive wireless multicast method with antenna diversity in Nakagami fading channels when the rate is selected by the minimum signal-to-noise ratio (SNR) of the multicast group. Based on the limiting distribution of the minimum SNR, we then derive an approximation to the average achievable rate, which provides accurate values easily in a wide range of channel parameters.

  • Adaptive Hot Clutter Mitigation Using Subbanding by Multi-Channel Synthetic Aperture Radar

    Jiantao SUN  Ping ZHANG  

     
    LETTER-Sensing

      Vol:
    E93-B No:10
      Page(s):
    2837-2841

    A hot clutter mitigation algorithm based on Subbanding and Space Fast-time Adaptive Processing (Fast-time STAP) for Multi-channel Synthetic Aperture Radar (MSAR) is analyzed, and is compared with the method based on just fast-time STAP. Simulation results demonstrate that the method based on subbanding and fast-time STAP performs better than the method based on just fast-time STAP in hot clutter mitigation for MSAR.

  • Population Estimation of RFID Tags Using Hadamard Footprints

    Joontae KIM  Seung-Ri JIN  Dong-Jo PARK  

     
    LETTER-Wireless Communication Technologies

      Vol:
    E93-B No:9
      Page(s):
    2473-2476

    A novel method is proposed that can estimate the tag population in Radio Frequency Identification (RFID) systems by using a Hadamard code for the tag response. We formulate the maximum likelihood estimator for the tag population using the number of observed footprints. The lookup table of the estimation algorithm has low complexity. Simulation results show that the proposed estimator performs considerably better than the conventional schemes.

  • A Hybrid Acoustic and Pronunciation Model Adaptation Approach for Non-native Speech Recognition

    Yoo Rhee OH  Hong Kook KIM  

     
    PAPER-Adaptation

      Vol:
    E93-D No:9
      Page(s):
    2379-2387

    In this paper, we propose a hybrid model adaptation approach in which pronunciation and acoustic models are adapted by incorporating the pronunciation and acoustic variabilities of non-native speech in order to improve the performance of non-native automatic speech recognition (ASR). Specifically, the proposed hybrid model adaptation can be performed at either the state-tying or triphone-modeling level, depending at which acoustic model adaptation is performed. In both methods, we first analyze the pronunciation variant rules of non-native speakers and then classify each rule as either a pronunciation variant or an acoustic variant. The state-tying level hybrid method then adapts pronunciation models and acoustic models by accommodating the pronunciation variants in the pronunciation dictionary and by clustering the states of triphone acoustic models using the acoustic variants, respectively. On the other hand, the triphone-modeling level hybrid method initially adapts pronunciation models in the same way as in the state-tying level hybrid method; however, for the acoustic model adaptation, the triphone acoustic models are then re-estimated based on the adapted pronunciation models and the states of the re-estimated triphone acoustic models are clustered using the acoustic variants. From the Korean-spoken English speech recognition experiments, it is shown that ASR systems employing the state-tying and triphone-modeling level adaptation methods can relatively reduce the average word error rates (WERs) by 17.1% and 22.1% for non-native speech, respectively, when compared to a baseline ASR system.

  • Improvements of the One-to-Many Eigenvoice Conversion System

    Yamato OHTANI  Tomoki TODA  Hiroshi SARUWATARI  Kiyohiro SHIKANO  

     
    PAPER-Voice Conversion

      Vol:
    E93-D No:9
      Page(s):
    2491-2499

    We have developed a one-to-many eigenvoice conversion (EVC) system that allows us to convert a single source speaker's voice into an arbitrary target speaker's voice using an eigenvoice Gaussian mixture model (EV-GMM). This system is capable of effectively building a conversion model for an arbitrary target speaker by adapting the EV-GMM using only a small amount of speech data uttered by the target speaker in a text-independent manner. However, the conversion performance is still insufficient for the following reasons: 1) the excitation signal is not precisely modeled; 2) the oversmoothing of the converted spectrum causes muffled sounds in converted speech; and 3) the conversion model is affected by redundant acoustic variations among a lot of pre-stored target speakers used for building the EV-GMM. In order to address these problems, we apply the following promising techniques to one-to-many EVC: 1) mixed excitation; 2) a conversion algorithm considering global variance; and 3) adaptive training of the EV-GMM. The experimental results demonstrate that the conversion performance of one-to-many EVC is significantly improved by integrating all of these techniques into the one-to-many EVC system.

  • Reduced Complexity in Antenna Selection for Polarized MIMO System with SVD for the Practical MIMO Communication Channel Environment

    Maung SANN MAW  Iwao SASASE  

     
    PAPER-Wireless Communication Technologies

      Vol:
    E93-B No:9
      Page(s):
    2389-2399

    In the conventional multi-input multi-output (MIMO) communication systems, most of the antenna selection methods considered are suitable only for spatially separated uni-polarized system under Rayleigh fading channel in non-line of sight (NLOS) condition. There have a few antenna selection schemes for the cross-polarized system in LOS condition and Ricean fading channel, and no antenna selection scheme for the MIMO channel with both LOS and NLOS. In the practical MIMO channel case, influence of LOS and NLOS conditions in the channel can vary from time to time according to the channel parameters and user movement in the system. Based on these influences and channel condition, uni-polarized system may outperform a cross-polarized. Thus, we should consider this kind of practical MIMO channel environment when developing the antenna selection scheme. Moreover, no research work has been done on reducing the complexity of antenna selection for this kind of practical MIMO channel environment. In this paper, reduced complexity in antenna selection is proposed to give the higher throughput in the practical MIMO channel environment. In the proposed scheme, suitable polarized antennas are selected based on the calculation of singular value decomposition (SVD) of channel matrix and then adaptive bit loading is applied. Simulation results show that throughput of the system can be improved under the constraint of target BER and total transmit power of the MIMO system.

  • Adaptive Step-Size Subarray LMS Beamforming

    Ann-Chen CHANG  

     
    LETTER-Antennas and Propagation

      Vol:
    E93-B No:9
      Page(s):
    2448-2450

    The performance of the least-mean-square (LMS) beamformer is heavily dependent on the choice of the step-size, for it governs the convergence rate and steady-state excess mean squared error. To meet the conflicting requirement of low misadjustment, especially for the beamformer being modified in response to the multipath environmental changes, it needs to be controlled in a proper way. In this letter, we present an efficient adaptive step-size subarray LMS to achieve good performance. Simulation results are provided for illustrating the effectiveness of the proposed scheme.

  • Acoustic Model Adaptation for Speech Recognition

    Koichi SHINODA  

     
    INVITED PAPER

      Vol:
    E93-D No:9
      Page(s):
    2348-2362

    Statistical speech recognition using continuous-density hidden Markov models (CDHMMs) has yielded many practical applications. However, in general, mismatches between the training data and input data significantly degrade recognition accuracy. Various acoustic model adaptation techniques using a few input utterances have been employed to overcome this problem. In this article, we survey these adaptation techniques, including maximum a posteriori (MAP) estimation, maximum likelihood linear regression (MLLR), and eigenvoice. We also present a schematic view called the adaptation pyramid to illustrate how these methods relate to each other.

  • MIMO Radar System for Respiratory Monitoring Using Tx and Rx Modulation with M-Sequence Codes

    Takashi MIWA  Shun OGIWARA  Yoshiki YAMAKOSHI  

     
    PAPER-Sensing

      Vol:
    E93-B No:9
      Page(s):
    2416-2423

    The importance of respiratory monitoring systems during sleep have increased due to early diagnosis of sleep apnea syndrome (SAS) in the home. This paper presents a simple respiratory monitoring system suitable for home use having 3D ranging of targets. The range resolution and azimuth resolution are obtained by a stepped frequency transmitting signal and MIMO arrays with preferred pair M-sequence codes doubly modulating in transmission and reception, respectively. Due to the use of these codes, Gold sequence codes corresponding to all the antenna combinations are equivalently modulated in receiver. The signal to interchannel interference ratio of the reconstructed image is evaluated by numerical simulations. The results of experiments on a developed prototype 3D-MIMO radar system show that this system can extract only the motion of respiration of a human subject 2 m apart from a metallic rotatable reflector. Moreover, it is found that this system can successfully measure the respiration information of sleeping human subjects for 96.6 percent of the whole measurement time except for instances of large posture change.

  • Joint Adaptive Modulation and Power Allocation for Multiuser MIMO Systems with Reduced-Feedback

    Youxiang WANG  Yongwan PARK  

     
    LETTER-Communication Theory and Signals

      Vol:
    E93-A No:9
      Page(s):
    1700-1703

    This paper presents an adaptive modulation and power allocation method for uplink multiuser multiple-input multiple-output systems under the assumption that there is perfect channel information at the receiver but not at the transmitter. The receiver jointly optimizes the power level and modulation order for all users under constraints of transmit power and error requirements, and returns these information to each user via a low-rate feedback channel. Power and modulation optimization maximizes the total throughput under a tolerable bit error ratio for each user. The ellipsoid method is used to design efficient algorithms for optimal power and modulation level.

  • Automation Power Energy Management Strategy for Mobile Telecom Industry

    Jong-Ching HWANG  Jung-Chin CHEN  Jeng-Shyang PAN  Yi-Chao HUANG  

     
    PAPER

      Vol:
    E93-B No:9
      Page(s):
    2232-2238

    The aim of this research is to study the power energy cost reduction of the mobile telecom industry through the supervisor control and data acquisition (SCADA) system application during globalization and liberalization competition. Yet this management system can be proposed functions: operating monitors, the analysis on load characteristics and dropping the cost of management.

  • Unsupervised Speaker Adaptation Using Speaker-Class Models for Lecture Speech Recognition

    Tetsuo KOSAKA  Yuui TAKEDA  Takashi ITO  Masaharu KATO  Masaki KOHDA  

     
    PAPER-Adaptation

      Vol:
    E93-D No:9
      Page(s):
    2363-2369

    In this paper, we propose a new speaker-class modeling and its adaptation method for the LVCSR system and evaluate the method on the Corpus of Spontaneous Japanese (CSJ). In this method, closer speakers are selected from training speakers and the acoustic models are trained by using their utterances for each evaluation speaker. One of the major issues of the speaker-class model is determining the selection range of speakers. In order to solve the problem, several models which have a variety of speaker range are prepared for each evaluation speaker in advance, and the most proper model is selected on a likelihood basis in the recognition step. In addition, we improved the recognition performance using unsupervised speaker adaptation with the speaker-class models. In the recognition experiments, a significant improvement could be obtained by using the proposed speaker adaptation based on speaker-class models compared with the conventional adaptation method.

  • Adaptive Arbitration of Fair QoS Based Resource Allocation in Multi-Tier Computing Systems

    Naoki HAYASHI  Toshimitsu USHIO  Takafumi KANAZAWA  

     
    PAPER-Concurrent Systems

      Vol:
    E93-A No:9
      Page(s):
    1678-1683

    This paper proposes an adaptive resource allocation for multi-tier computing systems to guarantee a fair QoS level under resource constraints of tiers. We introduce a multi-tier computing architecture which consists of a group of resource managers and an arbiter. Resource allocation of each client is managed by a dedicated resource manager. Each resource manager updates resources allocated to subtasks of its client by locally exchanging QoS levels with other resource managers. An arbiter compensates the updated resources to avoid overload conditions in tiers. Based on the compensation by the arbiter, the subtasks of each client are executed in corresponding tiers. We derive sufficient conditions for the proposed resource allocation to achieve a fair QoS level avoiding overload conditions in all tiers with some assumptions on a QoS function and a resource consumption function of each client. We conduct a simulation to demonstrate that the proposed resource allocation can adaptively achieve a fair QoS level without causing any overload condition.

  • Commercial Shot Classification Based on Multiple Features Combination

    Nan LIU  Yao ZHAO  Zhenfeng ZHU  Rongrong NI  

     
    LETTER-Image Processing and Video Processing

      Vol:
    E93-D No:9
      Page(s):
    2651-2655

    This paper presents a commercial shot classification scheme combining well-designed visual and textual features to automatically detect TV commercials. To identify the inherent difference between commercials and general programs, a special mid-level textual descriptor is proposed, aiming to capture the spatio-temporal properties of the video texts typical of commercials. In addition, we introduce an ensemble-learning based combination method, named Co-AdaBoost, to interactively exploit the intrinsic relations between the visual and textual features employed.

  • Lightweight Precision-Adaptive Time Synchronization in Wireless Sensor Networks

    Li LI  Yongpan LIU  Huazhong YANG  Hui WANG  

     
    PAPER-Network

      Vol:
    E93-B No:9
      Page(s):
    2299-2308

    Time synchronization is an essential service for wireless sensor networks (WSNs). However, fixed-period time synchronization can not serve multiple users efficiently in terms of energy consumption. This paper proposes a lightweight precision-adaptive protocol for cluster-based multi-user networks. It consists of a basic average time synchronization algorithm and an adaptive control loop. The basic average time synchronization algorithm achieves 1 µs instantaneous synchronization error performance. It also prolongs re-synchronization period by taking the average of two specified nodes' local time to be cluster global time. The adaptive control loop realizes diverse levels of synchronization precision based on the proportional relationship between sync error and re-synchronization period. Experimental results show that the proposed precision-adaptive protocol can respond to the sync error bound change within 2 steps. It is faster than the exponential convergence of the adaptive protocols based on multiplicative iterations.

  • Real-Time Monitoring of Multicast Group Information

    Achmad BASUKI  Achmad Husni THAMRIN  Hitoshi ASAEDA  Jun MURAI  

     
    PAPER-Information Network

      Vol:
    E93-D No:8
      Page(s):
    2213-2222

    This paper presents a method to monitor information of a large-sized multicast group that can follow the group's dynamics in real-time while avoiding feedback implosion by using probabilistic polling. In particular, this paper improves the probabilistic-polling-based approach by deriving a reference mean value as the reference control value for the number of expected feedback from the properties of a binomial estimation model. As a result, our method adaptively changes its estimation parameters depending on the feedback from receivers in order to achieve a fast estimate time with high accuracy, while preventing the possible occurrence of feedback implosion. Our experimental implementation and evaluation on PlanetLab showed that the proposed method effectively controls the number of feedback and accurately estimates the size of a dynamic multicast group.

  • Integrating Overlay Protocols for Providing Autonomic Services in Mobile Ad-Hoc Networks

    Panagiotis GOUVAS  Anastasios ZAFEIROPOULOS  Athanassios LIAKOPOULOS  Gregoris MENTZAS  Nikolas MITROU  

     
    PAPER

      Vol:
    E93-B No:8
      Page(s):
    2022-2034

    Next generation network characteristics increase the complexity in the design and provision of advanced services, making inappropriate the selection of traditional approaches. Future networks are becoming larger in scale, more dynamic and more heterogeneous. In order to cope with these requirements, services are expected to adapt to environmental conditions and require minimum human intervention. In this paper a new model for providing autonomous and decentralized services is proposed, especially focusing on mobile ad hoc networks (MANETs). Using a newly proposed four-layered approach, service development may be realized independently from the underlying physical network. In a reference implementation, it is demonstrated that it is possible to set up an overlay network that hides any network changes from the service layer. Multiple mechanisms have been adapted in order to efficiently -- in terms of message exchanges and convergence time -- operate over an ad hoc environment. Finally, it is demonstrated that a specific service could operate over a dynamic network with multiple failures.

  • An Empirical Study of FTL Performance in Conjunction with File System Pursuing Data Integrity

    In Hwan DOH  Myoung Sub SHIM  Eunsam KIM  Jongmoo CHOI  Donghee LEE  Sam H. NOH  

     
    LETTER-Software System

      Vol:
    E93-D No:8
      Page(s):
    2302-2305

    Due to the detachability of Flash storage, which is a dominant portable storage, data integrity stored in Flash storages becomes an important issue. This study considers the performance of Flash Translation Layer (FTL) schemes embedded in Flash storages in conjunction with file system behavior that pursue high data integrity. To assure extreme data integrity, file systems synchronously write all file data to storage accompanying hot write references. In this study, we concentrate on the effect of hot write references on Flash storage, and we consider the effect of absorbing the hot write references via nonvolatile write cache on the performance of the FTL schemes in Flash storage. In so doing, we quantify the performance of typical FTL schemes for a realistic digital camera workload that contains hot write references through experiments on a real system environment. Results show that for the workload with hot write references FTL performance does not conform with previously reported studies. We also conclude that the impact of the underlying FTL schemes on the performance of Flash storage is dramatically reduced by absorbing the hot write references via nonvolatile write cache.

  • Study of Prominence Detection Based on Various Phone-Specific Features

    Sung Soo KIM  Chang Woo HAN  Nam Soo KIM  

     
    LETTER-Speech and Hearing

      Vol:
    E93-D No:8
      Page(s):
    2327-2330

    In this letter, we present useful features accounting for pronunciation prominence and propose a classification technique for prominence detection. A set of phone-specific features are extracted based on a forced alignment of the test pronunciation provided by a speech recognition system. These features are then applied to the traditional classifiers such as the support vector machine (SVM), artificial neural network (ANN) and adaptive boosting (Adaboost) for detecting the place of prominence.

  • Adaptive Zero-Coefficient Distribution Scan for Inter Block Mode Coding of H.264/AVC

    Jing-Xin WANG  Alvin W.Y. SU  

     
    PAPER-Image Processing and Video Processing

      Vol:
    E93-D No:8
      Page(s):
    2273-2280

    Scanning quantized transform coefficients is an important tool for video coding. For example, the MPEG-4 video coder adopts three different scans to get better coding efficiency. This paper proposes an adaptive zero-coefficient distribution scan in inter block coding. The proposed method attempts to improve H.264/AVC zero coefficient coding by modifying the scan operation. Since the zero-coefficient distribution is changed by the proposed scan method, new VLC tables for syntax elements used in context-adaptive variable length coding (CAVLC) are also provided. The savings in bit-rate range from 2.2% to 5.1% in the high bit-rate cases, depending on different test sequences.

681-700hit(1871hit)