IEICE global.ieice.org Site

Keyword Search Result

[Keyword] ACH(1072hit)

701-720hit(1072hit)

CombNET-III: A Support Vector Machine Based Large Scale Classifier with Probabilistic Framework
Mauricio KUGLER Susumu KUROYANAGI Anto Satriyo NUGROHO Akira IWATA

PAPER-Pattern Recognition

Vol:
E89-D No:9
Page(s):
2533-2541
Several research fields have to deal with very large classification problems, e.g. handwritten character recognition and speech recognition. Many works have proposed methods to address problems with large number of samples, but few works have been done concerning problems with large numbers of classes. CombNET-II was one of the first methods proposed for such a kind of task. It consists of a sequential clustering VQ based gating network (stem network) and several Multilayer Perceptron (MLP) based expert classifiers (branch networks). With the objectives of increasing the classification accuracy and providing a more flexible model, this paper proposes a new model based on the CombNET-II structure, the CombNET-III. The new model, intended for, but not limited to, problems with large number of classes, replaces the branch networks MLP with multiclass Support Vector Machines (SVM). It also introduces a new probabilistic framework that outputs posterior class probabilities, enabling the model to be applied in different scenarios (e.g. together with Hidden Markov Models). These changes permit the use of a larger number of smaller clusters, which reduce the complexity of the final classifiers. Moreover, the use of binary SVM with probabilistic outputs and a probabilistic decoding scheme permit the use of a pairwise output encoding on the branch networks, which reduces the computational complexity of the training stage. The experimental results show that the proposed model outperforms both the previous model CombNET-II and a single multiclass SVM, while presenting considerably smaller complexity than the latter. It is also confirmed that CombNET-III classification accuracy scales better with the increasing number of clusters, in comparison with CombNET-II.
Detection of Overlapping Speech in Meetings Using Support Vector Machines and Support Vector Regression
Kiyoshi YAMAMOTO Futoshi ASANO Takeshi YAMADA Nobuhiko KITAWAKI

PAPER-Engineering Acoustics

Vol:
E89-A No:8
Page(s):
2158-2165
In this paper, a method of detecting overlapping speech segments in meetings is proposed. It is known that the eigenvalue distribution of the spatial correlation matrix calculated from a multiple microphone input reflects information on the number and relative power of sound sources. However, in a reverberant sound field, the feature of the number of sources in the eigenvalue distribution is degraded by the room reverberation. In the Support Vector Machines approach, the eigenvalue distribution is classified into two classes (overlapping speech segments and single speech segments). In the Support Vector Regression approach, the relative power of sound sources is estimated by using the eigenvalue distribution, and overlapping speech segments are detected based on the estimated relative power. The salient feature of this approach is that the sensitivity of detecting overlapping speech segments can be controlled simply by changing the threshold value of the relative power. The proposed method was evaluated using recorded data of an actual meeting.
Novel Downlink Beamforming Method Using Selective STBC with Common Eigenvectors for MIMO-OFDM Systems
Riichi KUDO Yasushi TAKATORI Kentaro NISHIMORI Koichi TSUNEKAWA

PAPER-Wireless Communication Technologies

Vol:
E89-B No:8
Page(s):
2170-2179
To achieve a very high data rate within a limited frequency band in orthogonal frequency division multiplexing (OFDM) systems, multi-input multi-output (MIMO) techniques are very promising. Moreover, if a transmitter has the channel state information (CSI), the achievable spectrum efficiency can be maximized using the eigenbeam-space division multiplexing (E-SDM). However, this scheme demands accurate channel estimation. Therefore, in a closed-loop transmission scheme, an increase in the amount of feedback is absolutely necessary for the E-SDM. This paper describes a downlink beamforming method that significantly reduces the amount of feedback needed by using the common transmission weight vectors in all sub-carriers, compared to the amount required for E-SDM. The proposed method also applies transmission diversity to compensate for the quality. The effectiveness of the proposed method was confirmed using computer simulations in both Ricean and Rayleigh fading environments.
Development of a Rapid Polishing Machine for On-Site Optical Connector Assemble
Shinsuke MATSUI Shigehisa OHKI Shuichi YANAGI Ryo NAGASE Masaru KOBAYASHI

PAPER-Optical Interconnection

Vol:
E89-C No:8
Page(s):
1233-1237
Field assembly of optical connectors is demanded because of the wide use of optical fiber in telecommunications systems. We propose a new assembling techniques that enable us to assemble connectors anywhere quickly and cost effectively. The key points are an adhesive technique and a polishing technique. In this report, we focus mainly on our a new polishing machine, which is suitable for optical connector ends machining on-site. The machine which is small and light weight can finish optical connector ends easily in a short time with enough low cost.
Monolithically Integrated Mach-Zehnder Interferometer All-Optical Switches by Selective Area MOVPE
Xueliang SONG Naoki FUTAKUCHI Daisuke MIYASHITA Foo Cheong YIT Yoshiaki NAKANO

PAPER-Lasers, Quantum Electronics

Vol:
E89-C No:7
Page(s):
1068-1079
We achieved first dynamic all-optical signal processing with a bandgap-engineered MZI SOA all-optical switch. The wide-gap Selective Area Growth (SAG) technique was used to provide multi-bandgap materials with a single step epitaxy. The maximum photoluminescence (PL) peak shift obtained between the active region and the passive region was 192 nm. The static current switching with the fabricated switch indicated a large carrier induced refractive index change; up to 14 π phase shift was obtained with 60 mA injection in the SOA. The carrier recovery time of the SOA for obtaining a phase shift of π was estimated to be 250-300 ps. A clear eye pattern was obtained in 2.5 Gbps all-optical wavelength conversion. This is the first all-optical wavelength conversion demonstration with a bandgap-engineered PIC with either selective area growth or quantum-well intermixing techniques.
RF MEMS--Enabling Technology for Millimeter-Waves
Youngwoo KWON Sanghyo LEE

INVITED PAPER

Vol:
E89-C No:7
Page(s):
898-905
This paper presents high-performance millimeter-wave passive devices using MEMS technology. The purpose of this paper is to show the possibility of MEMS technology as an enabling technology for millimeter-waves. The loss and cost issues, which have been the inherent barrier for commercialization of mm-waves, can be solved by RF MEMS technology. Successful demonstrations of MEMS technology for mm-waves include novel CPW transmission lines, digital impedance tuners, analog tunable band-pass filters, reconfigurable low-pass filters, V-band digital distributed phase shifters and 2-D mechanical beam-steering antennas. All these circuits were implemented for 30-65 GHz frequency range, and show the state-of-the-art performance, which is beyond the limit set by the conventional technology.
Adaptive Morse Code Recognition Using Support Vector Machines for Persons with Physical Disabilities
Cheng-Hong YANG Li-Yeh CHUANG Cheng-Huei YANG Ching-Hsing LUO

PAPER-Digital Signal Processing

Vol:
E89-A No:7
Page(s):
1995-2002
In this paper, Morse code is selected as a communication adaptive device for persons whose hand coordination and dexterity are impaired by such ailments as amyotrophic lateral sclerosis, multiple sclerosis, muscular dystrophy, and other severe handicaps. Morse code is composed of a series of dots, dashes, and space intervals, and each element is transmitted by sending a signal for a defined length of time. A suitable adaptive automatic recognition method is needed for persons with disabilities due to their difficulty in maintaining a stable typing rate. To overcome this problem, the proposed method combines the support vector machines method with a variable degree variable step size LMS algorithm. The method is divided into five stages: tone recognition, space recognition, training process, adaptive processing, and character recognition. Statistical analyses demonstrated that the proposed method elicited a better recognition rate in comparison to alternative methods from the literature.
A Visual Inspection System Based on Trinarized Broad-Edge and Gray-Scale Hybrid Matching
Haruhisa OKUDA Manabu HASHIMOTO Miwako HIROOKA Kazuhiko SUMI

PAPER-Image Inspection

Vol:
E89-D No:7
Page(s):
2068-2075
In the field of industrial manufacturing, visual pattern inspection is an important task to prevent the inclusion of incorrect parts. There have been demands for such methods able to handle factors caused by positional and rotational alignment, and illumination changes. In this paper, we propose a discrimination method called Trinarized broad-edge and Gray-scale Hybrid Matching (TGHM). The method is highly reliable due to gray-scale cross correlation which has a high pattern discrimination efficiency, with high-speed position and rotation alignment using the characteristics of trinarized broad-edge representation which has high data compressibility and illumination-resistant variability. In an example in which the method is applied to mis-collation inspection equipment of a bookbinding machine, it is confirmed that the processing speed is 24,000 sheets/hour, the error detection rate is 100.0%, and the mis-alarm rate is less than 0.002%, and it is verified that the method is practical.
A Method for English-Korean Target Word Selection Using Multiple Knowledge Sources
Ki-Young LEE Sang-Kyu PARK Han-Woo KIM

PAPER

Vol:
E89-A No:6
Page(s):
1622-1629
Target word selection is one of the most important and difficult tasks in English-Korean Machine Translation. It effects on the overall translation accuracy of machine translation systems. In this paper, we present a new approach to Korean target word selection for an English noun with translation ambiguities using multiple knowledge such as verb frame patterns, sense vectors based on collocations, statistical Korean local context information and co-occurring POS information. Verb frame patterns constructed with dictionary and corpus play an important role in resolving the sparseness problem of collocation data. Sense vectors are a set of collocation data when an English word having target selection ambiguities is to be translated to specific Korean target word. Statistical Korean Local Context Information is an N-gram information generated using Korean corpus. The co-occurring POS information is a statistically significant POS clue which appears with ambiguous word. To evaluate our approach, we applied the method to Tellus-EK system, English-Korean automatic translation system currently developed at ETRI [1],[2]. The experiment showed promising results for diverse sentences from web documents.
Multilingual Closed Caption Translation System for Digital Television
Sanghwa YUH Kongjoo LEE Jungyun SEO

PAPER-Service and System

Vol:
E89-D No:6
Page(s):
1885-1892
In this paper, we present a Korean to Chinese/English/Japanese multilingual Machine Translation (MT) system of closed captions for Digital Television (DTV). Preliminary experiments of our closed caption translation with existing base MT systems had shown unsatisfactory result. In order to achieve more accurate translation with the base MT systems, we adopted live resources of multilingual Named Entities and their translingual equivalences from the Web. We also utilize the program information, which the terrestrial broadcasters offer through DTV transport stream, in order to use program specific dictionaries, including the names of characters, locations and organizations. Two more components are adopted for reducing the ambiguities of parsing and word sense disambiguation; sentence simplification for long sentence segmentation and dynamic domain identification for automatic domain dictionary stacking. With these integrated approaches, we could raise the Mean Opinion Score (MOS) of translation accuracy by 0.40 higher than the base MT systems.
A Metric for Example Matching in Example-Based Machine Translation
Dong-Joo KIM Han-Woo KIM

LETTER

Vol:
E89-A No:6
Page(s):
1713-1716
This paper proposes a metric for example matching under the example-based machine translation. Our metric served as similarity measure is employed to retrieve the most similar examples to a given query. Basically it makes use of simple information such as lemma and part-of-speech information of typographically mismatched words. In addition, it uses the contiguity information of matched word units to catch the full context. Finally we show the results for the correctness of the proposed metric.
Performance Evaluation on Worst Change Time of Flush and SCO Dynamic Changes for State Machine WF-Nets
Shingo YAMAGUCHI Katsuaki MIYAUCHI Qi-Wei GE Minoru TANAKA

LETTER

Vol:
E89-A No:6
Page(s):
1701-1704
This paper deals with the performance evaluation of two types of dynamic change, called Flush and SCO (Synthetic Cut-Over), for state machine WF-nets. As an evaluation measure of dynamic change for marked graph WF-nets, change time has been used. We first generalize change time so as to apply it to dynamic change for state machine WF-nets. By using its maximum value, we evaluate the worst-case of dynamic change for state machine WF-nets. We call the maximum value as worst change time. Then under the same assumptions as our previous studies, we give methods of calculating worst change time of Flush and SCO dynamic changes. We also clarify the relation on worst change time between them. Finally we evaluate them by comparing the values of worst change time for an actual example of dynamic change.
3D Inspection on Wafer Solder Bumps Using Binary Grating Projection in Integrated Circuit Manufacturing
Shu YUAN Dongping TIAN Yanxing ZENG

PAPER-Si Devices and Processes

Vol:
E89-C No:5
Page(s):
602-607
For the measurement of the 3D surface of micro-solderballs in IC (Integrated Circuit) manufacturing inspection, a binary grating project lenses of high MTF (Modulation Transfer Function) with tilted project plane is designed in this paper. Using a combination of lenses and a tilted optical layout both on object and image plane, the wave-front aberrations are reduced and the nonlinear image distortion is corrected with nonlinearity compensation, This optical lens allows us to project the structured light pattern to the inspected objects efficiently for clear deformed coded imaging, it could be used to online measure 3D shape of micro-solderballs with high precision and accuracy.
Sublogarithmic Space-Bounded Multi-Inkdot Alternating Turing Machines with Only Existential (Universal) States
Tsunehiro YOSHINAGA Jianliang XU Katsushi INOUE

LETTER

Vol:
E89-A No:5
Page(s):
1417-1420
This paper investigates the accepting powers of two-way alternating Turing machines (2ATM's) with only existential (universal) states which have inkdots and sublogarithmic space. It is shown that for sublogarithmic space-bounded computations, (i) multi-inkdot 2ATM's with only existential states and the ones with only universal states are incomparable, (ii) k-inkdot 2ATM's are better than k-inkdot 2ATM's with only existential (universal) states, k ≥ 0, and (iii) the class of sets accepted by multi-inkdot 2ATM's with only existential (universal) states is not closed under complementation.
Analysis of Automation Surprises in Human-Machine Systems with Time Information
Masakazu ADACHI Toshimitsu USHIO

PAPER

Vol:
E89-A No:4
Page(s):
1027-1034
This paper analyzes automation surprises in human-machine systems with time information. Automation surprises are phenomena such that the underlying machine's behavior diverges from user's intention and may lead to critical situations. Thus, designing human-machine systems without automation surprises is one of fundamental issues to achieve reliable user interaction with the machines. In this paper, we focus on timed human-machine interaction and address their formal aspects. The presented framework is essentially an extension of untimed human-machine interaction and will cover the previously proposed methodologies. We employ timed automata as a model of human-machine systems with time information. Modeling the human-machine systems as timed automata enables one to deal with not only discrete behavior but also time constraints. Then, by introducing the concept of timed simulation of the machine model and the user model, conditions which guarantee the nonexistence of automation surprises are derived. Finally, we construct a composite model in which a machine model and a user model evolve concurrently and show that automation surprises can be detected by solving a reachability problem in the composite model.
An Energy-Efficient Partitioned Instruction Cache Architecture for Embedded Processors
CheolHong KIM SungWoo CHUNG ChuShik JHON

PAPER-Computer Systems

Vol:
E89-D No:4
Page(s):
1450-1458
Energy efficiency of cache memories is crucial in designing embedded processors. Reducing energy consumption in the instruction cache is especially important, since the instruction cache consumes a significant portion of total processor energy. This paper proposes a new instruction cache architecture, named Partitioned Instruction Cache (PI-Cache), for reducing dynamic energy consumption in the instruction cache by partitioning it to smaller (less power-consuming) sub-caches. When the proposed PI-Cache is accessed, only one sub-cache is accessed by utilizing the temporal/spatial locality of applications. In the meantime, other sub-caches are not accessed, leading to dynamic energy reduction. The PI-Cache also reduces dynamic energy consumption by eliminating the energy consumed in tag lookup and comparison. Moreover, the performance gap between the conventional instruction cache and the proposed PI-Cache becomes little when the physical cache access time is considered. We evaluated the energy efficiency by running a cycle accurate simulator, SimpleScalar, with power parameters obtained from CACTI. Simulation results show that the PI-Cache improves the energy-delay product by 20%-54% compared to the conventional direct-mapped instruction cache.
Comparative Study of Speaker Identification Methods: dPLRM, SVM and GMM
Tomoko MATSUI Kunio TANABE

PAPER-Speaker Recognition

Vol:
E89-D No:3
Page(s):
1066-1073
A comparison of performances is made of three text-independent speaker identification methods based on dual Penalized Logistic Regression Machine (dPLRM), Support Vector Machine (SVM) and Gaussian Mixture Model (GMM) with experiments by 10 male speakers. The methods are compared for the speech data which were collected over the period of 13 months in 6 utterance-sessions of which the earlier 3 sessions were for obtaining training data of 12 seconds' utterances. Comparisons are made with the Mel-frequency cepstrum (MFC) data versus the log-power spectrum data and also with training data in a single session versus in plural ones. It is shown that dPLRM with the log-power spectrum data is competitive with SVM and GMM methods with MFC data, when trained for the combined data collected in the earlier three sessions. dPLRM outperforms GMM method especially as the amount of training data becomes smaller. Some of these findings have been already reported in [1]-[3].
Training Augmented Models Using SVMs
Mark J.F. GALES Martin I. LAYTON

INVITED PAPER

Vol:
E89-D No:3
Page(s):
892-899
There has been significant interest in developing new forms of acoustic model, in particular models which allow additional dependencies to be represented than those contained within a standard hidden Markov model (HMM). This paper discusses one such class of models, augmented statistical models. Here, a local exponential approximation is made about some point on a base model. This allows additional dependencies within the data to be modelled than are represented in the base distribution. Augmented models based on Gaussian mixture models (GMMs) and HMMs are briefly described. These augmented models are then related to generative kernels, one approach used for allowing support vector machines (SVMs) to be applied to variable length data. The training of augmented statistical models within an SVM, generative kernel, framework is then discussed. This may be viewed as using maximum margin training to estimate statistical models. Augmented Gaussian mixture models are then evaluated using rescoring on a large vocabulary speech recognition task.
DCLUE: A Distributed Cluster Emulator
Krishna KANT Amit SAHOO Nrupal JANI

PAPER-Parallel/Distributed Programming Models, Paradigms and Tools

Vol:
E89-D No:2
Page(s):
433-440
Given the availability of high-speed Ethernet and HW based protocol offload, clustered systems using a commodity network fabric (e.g., TCP/IP over Ethernet) are expected to become more attractive for a range of e-business and data center applications. In this paper, we describe a comprehensive simulation to study the performance of clustered database systems using such a fabric. The simulation model currently supports both TCP and SCTP as the transport protocol and models an Oracle 9i like clustered DBMS running a TPC-C like workload. The model can be used to study a wide variety of issues regarding the performance of clustered DBMS systems including the impact of enhancements to network layers (transport, IP, MAC), QoS mechanisms or latency improvements, and cluster-wide power control issues.
Proxy-Based Index Caching for Content-Addressable Networks
Shigeaki TAGASHIRA Syuhei SHIRAKAWA Satoshi FUJITA

PAPER-Peer-to-Peer Computing

Vol:
E89-D No:2
Page(s):
555-562
Content-Addressable Network (CAN) provides a mechanism that could retrieve objects in a P2P network by maintaining indices to those objects in a fully decentralized manner. In the CAN system, index caching is a useful technique for reducing the response time of retrieving objects. The key points of effective caching techniques are to improve cache hit ratio by actively sharing caches distributed over the P2P network with every node and to reduce a maintenance and/or routing overhead for locating the cache of a requested index. In this paper, we propose a new caching technique based on the notion of proxy-type caching techniques which have been widely used in WWW systems. It can achieve active cache sharing by incorporating the concept of proxy caching into the index access mechanism and locate a closer proxy cache of a requested index with a little routing overhead. By the result of simulations, we conclude that it can improve the response time of retrieving indices by 30% compared with conventional caching techniques.

701-720hit(1072hit)

Keyword Search Result

[Keyword] ACH(1072hit)

CombNET-III: A Support Vector Machine Based Large Scale Classifier with Probabilistic Framework

Detection of Overlapping Speech in Meetings Using Support Vector Machines and Support Vector Regression

Novel Downlink Beamforming Method Using Selective STBC with Common Eigenvectors for MIMO-OFDM Systems

Development of a Rapid Polishing Machine for On-Site Optical Connector Assemble

Monolithically Integrated Mach-Zehnder Interferometer All-Optical Switches by Selective Area MOVPE

RF MEMS--Enabling Technology for Millimeter-Waves

Adaptive Morse Code Recognition Using Support Vector Machines for Persons with Physical Disabilities

A Visual Inspection System Based on Trinarized Broad-Edge and Gray-Scale Hybrid Matching

A Method for English-Korean Target Word Selection Using Multiple Knowledge Sources

Multilingual Closed Caption Translation System for Digital Television

A Metric for Example Matching in Example-Based Machine Translation

Performance Evaluation on Worst Change Time of Flush and SCO Dynamic Changes for State Machine WF-Nets

3D Inspection on Wafer Solder Bumps Using Binary Grating Projection in Integrated Circuit Manufacturing

Sublogarithmic Space-Bounded Multi-Inkdot Alternating Turing Machines with Only Existential (Universal) States

Analysis of Automation Surprises in Human-Machine Systems with Time Information

An Energy-Efficient Partitioned Instruction Cache Architecture for Embedded Processors

Comparative Study of Speaker Identification Methods: dPLRM, SVM and GMM

Training Augmented Models Using SVMs

DCLUE: A Distributed Cluster Emulator

Proxy-Based Index Caching for Content-Addressable Networks

Latest Issue

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles