IEICE global.ieice.org Site

Keyword Search Result

[Keyword] EE(4073hit)

3741-3760hit(4073hit)

Speech Enhancement Using Microphone Array with Multi-Stage Processing
Yuchang CAO Sridha SRIDHARAN Miles MOODY

PAPER-Acoustics

Vol:
E79-A No:3
Page(s):
386-394
A microphone array system with multi-stage processing for speech enhancement is presented in this paper. Two beamformers with uniform directional patterns, one aimed at the target source and the other at the interfering sources, convert the multi-channel inputs into two data sequences. A novel microphone array structure with a small aperture has been designed to obtain the dual beamformers. The outputs of the two beam-formers are then presented to a post-processing stage to further improve the quality and intelligibility of the speech signal. The post-processing stage can be selected from one of three different algorithms that are presented, which are suitable for different acoustic environments. Applications for such a system include hands-free telephony, teleconferencing and also special situations where speech signals must be picked up in an extremely noisy acoustic environment in which the microphones are hidden (e.g. in a forensic covert recording system).
Minimization of Multiple-Valued Logic Expressions with Kleenean Coefficients
Yutaka HATA Takahiro HOZUMI Kazuharu YAMATO

PAPER-Computer Hardware and Design

Vol:
E79-D No:3
Page(s):
189-195
This paper describes Kleenean coefficients that are a subset of Kleenean functions for use in representing multiple-valued logic functions. A conventional multiple-valued sum-of-products expression uses product terms that are the MIN of literals and constants. In this paper, a new sum-of-products expression is allowed to sum product terms that also include variables and complements of variables. Since the conventional sum-of-products expression is complete, so also is the augmented one. A minimization method of the new expression is described besed on the binary Quine-McCluskey algorithm. The result of computer simulation shows that a saving of the number of implicants used in minimal expressions by approximately 9% on the average can be obtained for some random functions. A result for some arithmetic functions shows that the minimal solutions of MOD radix SUM, MAX and MIN functions require much fewer implicants than those of the standard sum-of-products expressions. Thus, this paper clarifies that the new expression has an advantage to reduce the number of implicants in minimal sum-of-products expressions.
Improved CELP-Based Coding in a Noisy Environment Using a Trained Sparse Conjugate Codebook
Akitoshi KATAOKA Sachiko KURIHARA Shinji HAYASHI Takehiro MORIYA

PAPER-Speech Processing and Acoustics

Vol:
E79-D No:2
Page(s):
123-129
A trained sparse conjugate codebook is proposed for improving the speech quality of CELP-based coding in a noisy environment. Although CELP coding provides high quality at a low bit rate in a silent environment (creating clean speech), it cannot provide a satisfactory quality in a noisy environment because the conventional fixed codebook is designed to be suitable for clean speech. The proposed codebook consists of two sub-codebooks; each sub-codebook consists of a random component and a trained component. Each component has excitation vectors consisting of a few pulses. In the random component, pulse position and amplitude are determined randomly. Since the radom component does not depend on the speech characteristics, it handles noise better than the trained one. The trained component maintains high quality for clean speech. Since excitation vector is the sum of the two sub-excitation vectors, this codebook handles various speech conditions by selecting a sub-vector from each component. This codebook also reduces the computational complexity of a fixed codebook search and memory requirements compared with the conventional codebook. Subjective testing (absolute category rating (ACR) and degradation category rating (DCR)) indicated that this codebook improves speech quality compared with the conventional trained codebook for noisy speech. The ACR test showed that the quality of the 8 kbit/s CELP coder with this codebook is equivalent to that of the 32 kbit/s ADPCM for clean speech.
A Current-Mode Bit-Block Circuit Applicable to Low-Voltage, Low-Power Pipeline Video-Speed A/D Converters
Yasuhiro SUGIMOTO Shunsaku TOKITO Hisao KAKITANI Eitaro SETA

PAPER

Vol:
E79-A No:2
Page(s):
199-209
This paper describes a study to determine if a current-mode circuit is useful as an analog circuit technique for realizing submicron mixed analog-and-digital MOS LSIs. To examine this, we designed and circuit simulated a new current-mode ADC bit-block for a 3 V, 10-bit level, 20 MHz ADC with a pipeline architecture and with full current-mode approach. A new precision current-mode sample-and-hold circuit which enables operation of a bit block at a clock speed of 20 MHz was developed. Current mismatches caused by the poor output impedance of a device were also decreased by adopting a cascode configuration throughout the design. Operation with a 3 V power supply and a 20 MHz clock speed in a 3-bit A/D configuration was verified through circuit simulation using standard CMOS 0.6 µm device parameters. Gain error, mismatch of current, and linearity of the bit block with changing threshold voltage of a device were carefully examined. The bit block has a gain error of 0.2% (10-bit level), a linearity error of less than 0.1% (more than 10-bit level), and a current mismatch of DAC current sources in a bit cell of 0.2 to 0.4% (more than 8-bit level) with a 3 V power supply and 20 MHz clock speed. An 8-to 9-bit video-speed pipeline ADC can be realized without calibration. This confirms that the current-mode approach is effective.
Projective Image Representation and Its Application to Image Compression
Kyeong-Hoon JUNG Choong Woong LEE

PAPER-Image Processing,Computer Graphics and Pattern Recognition

Vol:
E79-D No:2
Page(s):
136-142
This paper introduces a new image representation method that is named the projective image representation (PIR). We consider an image as a collage of symmetric segments each of which can be well represented by its projection data of a single orientation. A quadtree-based method is adopted to decompose an image into variable sized segments according to the complexity within it. Also, we deal with the application of the PIR to the image compression and propose an efficient algorithm, the quadtree-structured projection vector quantization (QTPVQ) which combines the PIR with the VQ. As the VQ is carried out on the projection data instead of the pixel intensities of the segment, the QTPVQ successfully overcomes the drawbacks of the conventional VQ algorithms such as the blocking artifact and the difficulty in manipulating the large dimension. Above all, the QTPVQ improves the subjective quality greatly, especially at low bit rate, which makes it applicable to low bit rate image coding.
A Proposal of Five-Degree-of-Freedom 3D Nonverbal Voice Interface
Tatsuhiro YONEKURA Rikako NARISAWA Yoshiki WATANABE

PAPER-Human Communications and Ergonomics

Vol:
E79-A No:2
Page(s):
242-247
This paper proposes a new emphasizing three-dimensional pointing device considering user friendliness and lack of cable clutter. The proposed method utilizes five degrees of freedom via the medium of non-verbal voice of human. That is, the spatial direction of the sound source, the type of the voice phoneme and the tone of the voice phoneme are utilized. The input voice is analyzed regarding the above factors and then taking proper effects as previously defined for human interface. In this paper the estimated spatial direction is used for three-dimensional movement for the virtual object as three degrees of freedom. Both of the type and the tone of the voice phoneme are used for remaining two degrees of freedom. Since vocalization of nonverbal human voice is an everyday task, and the intonation of the voice can be quite easily and intentionally controlled by human vocal ability, the proposed scheme is a new three-dimensional spatial interaction medium. In this sense, this paper realizes a cost-effective and handy nonverbal interface scheme without any artificial wearing materials which might give a physical and psychological fatigue. By using the prototype the authors evaluate the performance of the scheme from both of static and dynamic points of view and show some advantages of look and feel, and then prospect possibilities of the application for the proposed scheme.
Message Transfer Algorithms on the Recursive Diagonal Torus
Yulu YANG Hideharu AMANO

PAPER-Computer Systems

Vol:
E79-D No:2
Page(s):
107-116
Recursive Diagonal Torus (RDT) is a class of interconnection network for massively parallel computers with 216 nodes. In this paper, message transfer algorithms on the RDT are proposed and discussed. First, a simple one-to-one message routing algorithm called the vector routing is introduced and its practical extension called the floating vector routing is proposed. In the floating vector routing both the diameter and average distance are improved compared with the fixed vector routing. Next, broadcasting and hypercube emulation algorithm scheme on the RDT are shown. Finally, deadlock-free message routing algorithms on the RDT are discussed. By a simple modification of the e-cube routing and a small numbers of additional virtual channels, both one-to-one message transfer and broadcast can be achieved without deadlock.
Message Forwarding Delay Analysis for Error Control of Data Transmission on ATM Network
Noriaki KAMIYAMA Miki YAMAMOTO Hiromasa IKEDA

PAPER-Communication Networks and Services

Vol:
E79-B No:2
Page(s):
163-172
The message level performance of error controls in data communication on ATM network is analyzed. Three layers, "a cell"(a unit of transmission), "a block"(a unit of error controls) and "a message"(a unit of transmission of user level) are considered. The error controls treated in this paper are GBN (Go-Back-N) and FEC+GBN. The cell loss process is assumed to be the two state Markov chain considering the cell loss process in ATM networks. Numerical results show that (1) the improvement of the message forwarding delay is saturated in some environments when the interface rate becomes high, (2) FEC is efficient when the burstiness of the cell loss process is small, the message length is large and the interface rate is high.
The Performance Prediction on Sentence Recognition Using a Finite State Word Automaton
Takashi OTSUKI Akinori ITO Shozo MAKINO Teruhiko OHTOMO

PAPER-Speech Processing and Acoustics

Vol:
E79-D No:1
Page(s):
47-53
This paper presents the performance prediction method on sentence recognition system which uses a finite state word automaton. When each word is uttered separately, the relationship between word recognition score and sentence recognition score can be approximated using the number of word sequences at a minimum distance from each sentence in the task. But it is not clear that how we get this number when the finite state word automaton is used as linguistic information. Therefore, we propose the algorithm to calculate this number in polynomial time. Then we carry out the prediction using this method and the simulation to compare with the prediction on the task of Japanese text editor commands. And it is shown that our method approximates the lower limit of sentence recognition score.
Continuous Speech Recognition Using a Combination of Syntactic Constraints and Dependency Relationships
Tsuyoshi MORIMOTO

PAPER-Speech Processing and Acoustics

Vol:
E79-D No:1
Page(s):
54-62
This paper proposes a Japanese continuous speech recognition mechanism in which a full-sentence-level context-free-grammar (CFG) and one kind of semantic constraint called dependency relationships between two bunsetsu (a kind of phrase) in Japanese" are used during speech recognition in an integrated way. Each dependency relationship is a modification relationship between two bunsetsu; these relationships include the case-frame relationship of a noun bunsetsu to a predicate bunsetsu, or adnominal modification relationships such as a noun bunsetsu to a noun bunsetsu. To suppress the processing overhead caused by using relationships of this type during speech recognition, no rigorous semantic analysis is performed. Instead, a simple matching with examples" approach is adopted. An experiment was carried out and results were compared with a case employing only CFG constraints. They show that the speech recognition accuracy is improved and that the overhead is small enough.
A short-Span Optical Feeder for Wireless Personal Communication Systems Using Multimode Fibers
Yasuhiko MATSUNAGA Makoto SHIBUTANI

PAPER-System Applications

Vol:
E79-C No:1
Page(s):
118-123
In this paper, we propose to use graded-index multimode fibers (GI-MMFs) with Fabry-Perot laser diodes (FP-LDs) for short-span and low-cost feeders. The multimode fiber feeders can be applied to wireless personal communication systems where the required feeder length is within several hundred meters, such as distributed antenna networks for microcellular systems or wireless LANs. The use of multimode fibers makes fiber coupling and connection easier, and has the potential to greatly reduce total system cost. Three types of GI-MMFs are considered as transmission media, (1) silica-based glass optical fiber (GI-GOF),(2) silica-core plastic-clad fiber (GI-PCF), and (3) all-plastic optical fiber (GI-POF). It is shown that GI-GOF and GI-PCF are suitable for use as feeders in the microcells of CDMA cellular and wireless LAN systems within 300m in length. GI-POF is estimated to be suitable for use as feeders in wireless LANs within 100m in length. A multimode fiber feeder with FP-LDs and GI-PCF of 300 m is developed in order to demonstrate its applicability to a wireless LAN system operating in the 2.4 GHz ISM band.
Bayesian Performance Estimation Driven by Performance Monitoring and Its Application
Hiroshi SAITO

PAPER-Communication Networks and Services

Vol:
E79-B No:1
Page(s):
1-7
A performance estimation method has been developed that combines conventional performance evaluation with Bayesian regression analysis. The conventional method is used to estimate performance a priori; this a priori estimate is then updated through Bayesian regression analysis using monitored performance. This method compensates for modeling errors in the conventional technique without recreating complex performance models; it does not require additional traffic measurement or system behavior models. Numerical examples and applications of traffic management in ATM PVC networks have demonstrated its effectiveness.
Trends of Fiber-Optic Microcellular Radio Communication Networks
Shozo KOMAKI Eiichi OGAWA

INVITED PAPER-System Applications

Vol:
E79-C No:1
Page(s):
98-104
Exploitation of air interfaces for mobile communications is rapidly increasing because of diversified service demands, technology trends and radio propagation conditions. This paper summarizes the radio and optic interaction devices and systems that can solve the future problems resulting from spreading demands in mobile multimedia communications. The concept of the Virtual Free Space Network (Radio Highway Network) is proposed for universal mobile access networks that can support any mobile service or radio air-interface. As one example of the proposed network, the optical TDMA network for radio is analyzed and results of some theoretical calculations are shown.
Recognition of Machine Printed Arabic Characters and Numerals Based on MCR
AbdelMalek B.C. ZIDOURI Supoj CHINVEERAPHAN Makoto SATO

PAPER

Vol:
E78-D No:12
Page(s):
1649-1655
In this paper we describa a system for Off-line Recognition of Arabic characters and Numerals. This is based on expressing the machine printed Arabic alpha-numerical text in terms of strokes obtained by MCR (Minimum Covering Run) expression. The strokes are rendered meaningful by a labeling process. They are used to detect the baseline and to provide necessary features for recognition. The features selected proved to be effective to the extent that with simple right to left analysis we could achieve interesting results. The recognition is achieved by matching to reference prototypes designed for the 28 Arabic characters and 10 numerals. The recognition rate is 97%.
Efficient Algorithms for Real-Time Octree Motion
Yoshifumi KITAMURA Andrew SMITH Fumio KISHINO

PAPER

Vol:
E78-D No:12
Page(s):
1573-1580
This paper presents efficient algorithms for updating moving octrees with real-time performance. The first algorithm works for octrees undergoing both translation and rotation motion; it works efficiently by compacting source octrees into a smaller set of cubes (not necessarily standard octree cubes) as a precomputation step, and by using a fast, exact cube/cube intersection test between source octree cubas and target octree cubes. A parallel version of the algorithm is also described. Finally, the paper presents an efficient algorithm for the more limited case of octree translation only. Experimental results are given to show the efficiency of the algorithms in comparison to competing algorithms. In addition to being fast, the algorithms presented are also space efficient in that they can produce target octrees in the linear octree representation.
622 Mbps 8 mW CMOS Low-Voltage Interface Circuit
Takashi TOMITA Koichi YOKOMIZO Takao HIRAKOSO Kazukiyo HAGA Kuniharu HIROSE

PAPER

Vol:
E78-C No:12
Page(s):
1726-1732
This paper describes ALINX (Advanced Low-voltage Interface Circuit System), a low-power and high-speed interface circuit of submicron CMOS LSI for digital information and telecommunications systems. Differential and single-ended ALINXs are low-voltage swing I/O interface circuits with less than 1.0 V swing from a 1.2 V supply. Specifically, the differential ALINX features a pair of complementary NMOS push-pull drivers operating from a 1.2 V supply, reducing power consumption compared to conventional high-speed interface circuits operating from a 5 V or 3.3 V supply. The DC power consumption is approximately 11% of ECL. We observed 622 Mbps differential transmission with 8 mW power consumption and single-ended transmission at 311 Mbps with 14 mW with a PN23 pseudo-random pattern. We also describe a noise characteristic and ALINX applications to high-speed data buses and LSI for telecommunications systems. A time/space switch LSI with 0.9 W total power consumption was fabricated by 0.5 µm CMOS process technology. This chip can use a plastic QFP.
A Circuit Library for Low Power and High Speed Digital Signal Processor
Hiroshi TAKAHASHI Shigeshi ABIKO Shintaro MIZUSHIMA Yuni OZAWA

PAPER

Vol:
E78-C No:12
Page(s):
1717-1725
A new high performance digital signal processor (DSP) that lowers power consumption, reduces chip count, and enables system cost savings for wireless communications applications was developed. The new device contains high performance, hard-wired functionality with a specialized instruction set to effectively implement the worldwide digital cellular standard algorithms, including GSM, PDC and NADC, and also features both full rate and future half rate processing by software modules. The device provides a wider operating voltage ranging from 1.5 V to 5.5 V using 5 V process based on the market requirement of 5 V supply voltage, even though a power supply voltage in most applications will be shifted to 3 V. Several circuits was newly developed to achieve low power consumption and high speed operation at both 5 V and 3 V process using the same data base. The device also features over 50 MIPS of processing power with low power consumption and 100 nA stand-by current at either 3 V or 5 V. One remarkable advantage is a flexible CPU core approach for the future spin-off devices with different ROM/RAM configurations and peripheral modules without requiring any CPU design changes. This paper describes the architecture of a lower power and high speed design with effective hardware and software modules implementations.
Three-Level Broad-Edge Template Matching and Its Application to Real-Time Vision System
Kazuhiko SUMI Manabu HASHIMOTO Haruhisa OKUDA Shin'ichi KURODA

PAPER

Vol:
E78-D No:12
Page(s):
1526-1532
This paper presents a new internal image representation, in which the scene is encoded into a three-intensity-level image. This representation is generated by Laplacian-Gaussian filtering followed by dual-thresholding. We refer to this imege as three-level broad-edge representation. It supresses the high frequency noise and shading in the image and encodes the sign of relative intensity of a pixel compared with surrounding region. Image model search based on cross correlation using this representation is as reliable as the one based on gray normalized correlation, while it reduces the computational cost by 50 times. We examined the reliability and realtime performance of this method when it is applied to an industrial object recognition task. Our prototype system achieves 3232 image model search from the 128128 pixel area in 2 milli-seconds with a 9 MHz pixel clock image processor. This speed is fast enough for searching and tracking a single object at video frame rate.
A Low-Power and High-Speed Impulse-Transmission CMOS Interface Circuit
Masafumi NOGAWA Yusuke OHTOMO Masayuki INO

PAPER

Vol:
E78-C No:12
Page(s):
1733-1737
A new low-power and high-speed CMOS interface circuit is proposed in which signals are transmitted by means of impulse voltage. This mode of transmission is called impulse transmission. Although a termination resistor is used for impedance matching, the current through the output transistors and the termination resistor flows only in transient states and no current flows in stable states. The output buffer and the termination resistor dissipate power only in transient states, so their power dissipation is reduced to 30% that of conventional low-voltage-swing CMOS interface circuits at 160 MHz. The circuit was fabricated by 0.5 µm CMOS technology and was evaluated at a supply voltage of 3.3 V. Experimental results confirm low power of 4.8 mW at 160 MHz and high-speed 870 Mb/s error free point-to-point transmission.
Parameter Insensitive Disturbance-Rejection Problem with Incomplete-State Feedback
Naohisa OTSUKA Hiroshi INABA Kazuo TORAICHI

PAPER-Systems and Control

Vol:
E78-A No:11
Page(s):
1589-1594
The disturbance-rejection problem is to find a feedback control law for linear control systems such that the influence of disturbances is completely rejected from the output. In 1970 Wonham and Morse first studied this problem in the framework of the so-called geometric approach. On the other hand, in 1985 Ghosh studied parameter insensitive disturbance-rejection problems with state feedback and with dynamic compensator. In this paper we study the parameter insensitive disturbance-rejection problem with static incomplete-state feedback for linear multivariable systems in the framework of the geometric approach from the mathematical point of view. Necessary conditions and/or sufficient conditions for this problem to be solvable are presented. Finally an illustrative example is presented.

3741-3760hit(4073hit)

Keyword Search Result

[Keyword] EE(4073hit)

Speech Enhancement Using Microphone Array with Multi-Stage Processing

Minimization of Multiple-Valued Logic Expressions with Kleenean Coefficients

Improved CELP-Based Coding in a Noisy Environment Using a Trained Sparse Conjugate Codebook

A Current-Mode Bit-Block Circuit Applicable to Low-Voltage, Low-Power Pipeline Video-Speed A/D Converters

Projective Image Representation and Its Application to Image Compression

A Proposal of Five-Degree-of-Freedom 3D Nonverbal Voice Interface

Message Transfer Algorithms on the Recursive Diagonal Torus

Message Forwarding Delay Analysis for Error Control of Data Transmission on ATM Network

The Performance Prediction on Sentence Recognition Using a Finite State Word Automaton

Continuous Speech Recognition Using a Combination of Syntactic Constraints and Dependency Relationships

A short-Span Optical Feeder for Wireless Personal Communication Systems Using Multimode Fibers

Bayesian Performance Estimation Driven by Performance Monitoring and Its Application

Trends of Fiber-Optic Microcellular Radio Communication Networks

Recognition of Machine Printed Arabic Characters and Numerals Based on MCR

Efficient Algorithms for Real-Time Octree Motion

622 Mbps 8 mW CMOS Low-Voltage Interface Circuit

A Circuit Library for Low Power and High Speed Digital Signal Processor

Three-Level Broad-Edge Template Matching and Its Application to Real-Time Vision System

A Low-Power and High-Speed Impulse-Transmission CMOS Interface Circuit

Parameter Insensitive Disturbance-Rejection Problem with Incomplete-State Feedback

Latest Issue

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles