Junpei YAMAUCHI Tetsuya SHIMAMURA
This paper presents an improved spectral subtraction method for speech enhancement. A new noise estimation method is derived in which the noise is assumed to be white. By using the property that a white noise spectrum is flat, high frequency components of a noisy speech spectrum are averaged and the standard deviation of the noise is estimated. This operation is performed in the analysis segment, thus the spectral subtraction method combined with the new noise estimation method does not need non-speech segments and as a result can adapt to non-stationary noise conditions. The effectiveness of the proposed spectral subtraction method is confirmed by experiments.
We have developed and demonstrated a novel technique for electrical inspection and electrical failure analysis, which can detect open, high-resistance, and short circuits without the need for electrical contact with the outside of the LSI chip or the board on which the LSI chip is mounted. The basic idea of the technique is the detection of the magnetic field produced by OBIC (optical beam induced current) or photo current. A DC-SQUID (superconducting quantum interference device) magnetometer is used to detect the magnetic field. This scanning laser-SQUID microscopy ("laser-SQUID" for short) has a spatial resolution of about 1.3 µm. It can be used to distinguish defective chips before bonding pad patterning or after bonding without pin-selection. It can localize any defective site in the chip to within a few square microns.
Roman SOBOLEWSKI Ying XU Xuemei ZHENG Carlo WILLIAMS Jin ZHANG Aleksandr VEREVKIN Galina CHULKOVA Alexander KORNEEV Andrey LIPATOV Oleg OKUNEV Konstantin SMIRNOV Gregory N. GOL'TSMAN
We report our studies on the spectral sensitivity of superconducting NbN thin-film single-photon detectors (SPD's) capable of GHz counting rates of visible and near-infrared photons. In particular, it has been shown that a NbN SPD is sensitive to 1.55-µm wavelength radiation and can be used for quantum communication. Our SPD's exhibit experimentally measured intrinsic quantum efficiencies from 20% at 800 nm up to 1% at 1.55-µm wavelength. The devices demonstrate picosecond response time (<100 ps, limited by our readout system) and negligibly low dark counts. Spectral dependencies of photon counting of continuous-wave, 0.4-µm to 3.5-µm radiation, and 0.63-µm, 1.33-µm, and 1.55-µm laser-pulsed radiations are presented for the single-stripe-type and meander-type devices.
A numerical technique based on Haar wavelets is used for solving transient problems of transmission lines. The approach of our method is to convert the original coupled partial differential equations, the transmission line equations or the telegrapher equations, to a system of ordinary matrix differential equations via Haar wavelets. Then, transient problems of transmission lines can be solved by matrix operations. Numerical examples of homogeneous and dispersive lines, along with both linear and nonlinear loads are verified. In addition, non-sinusoidal signals such as the unit step function and the rectangular pulse for digital applications are included to demonstrate the use of this efficient, easy-to-handle, stable, and versatile method.
In this paper, we describe the recent trend in automatic speech recognition. First, we should point out that the current art of speech recognition by machines is admittedly inferior to the ability of human beings. In particular, we assert that the improvement of acoustic models is necessary. Second, we describe robust feature parameters for noisy environments, which are important in practical usage. Then, we indicate that much training data in the same environment as the recognition stage are useful from the viewpoints of information theory and pattern recognition. Third, we discuss acoustic models and language models which are central issues in speech recognition techniques. Then the principle and limitations of the hidden Markov model (HMM) and recent extended models are discussed. The role of language models is to eliminate improbable candidate words, that is, to reduce the search space. In other words, language models having smaller entropy are preferable. From this standpoint, we survey stochastic language models. Finally, we state some points which deserve attention when constructing speech recognition systems.
Martin STEINBAUER Huseyin OZCELIK Helmut HOFSTETTER Christoph F. MECKLENBRAUKER Ernst BONEK
This contribution discusses which information can be derived from estimated directions of arrival (DOAs) and directions of departure (DODs) from a multiple-input multiple-output (MIMO) radio system, and establishes two new parameters describing the multipath spread at both link ends. We find that the multipath component separation, MCS, combines delay, (double-) angular and Doppler dispersion, as appropriate. MCS provides a system-independent radio characterization of propagation environments and aids in selecting optimum positions for smart-antenna deployment. Evaluation of double-directional measurements (antenna arrays at both link ends) in indoor environments show the usefulness and the limits of the multipath component separation concept.
Akira HIRANO Masaki ASOBE Kenji SATO Yutaka MIYAMOTO Kazushige YONENAGA Hiroshi MIYAZAWA Makoto ABE Hidehiko TAKARA Ippei SHAKE
We achieved a dispersion tolerance of 25-ps/nm at 80-Gbit/s using novel carrier-suppressed return-to-zero (CS-RZ) coding realized by duty ratio and optical multiplexing phase control. We also show that the dispersion tolerance strongly depends on the relative optical phase difference between adjacent time slots, and demonstrate 80-Gbit/s 60-km DSF transmission without dispersion compensation by using a newly-fabricated stable 80-Gbit/s OTDM transmitter.
Kazunori MUKASA Takeshi YAGI Kunio KOKURA
A novel optical transmission line consisted of fibers characterized by positive and negative medial dispersion of NZ-DSF and SMF was designed and fabricated. Both P-MDF and N-MDF have achieved the medial dispersion and low non-linearity simultaneously. Total characteristics were confirmed to be suitable for the future high-bit-rate transmission.
Kenji SATO Shoichiro KUWAHARA Yutaka MIYAMOTO Koichi MURATA Hiroshi MIYAZAWA
Phase-inversion between neighboring pulses appearing in carrier-suppressed return-to-zero pulses is effective in reducing the signal distortion due to chromatic dispersion and nonlinear effects. A generation method of the anti-phase pulses at 40 GHz is demonstrated by using semiconductor mode-locked lasers integrated with chirped gratings. Operation principle and pulse characteristics are described. Suppression of pulse distortion due to fiber dispersion is confirmed for generated anti-phase pulses. Repeaterless 150-km dispersion-shifted-fiber L-band transmission at 42.7 Gbit/s is demonstrated by using the pulse source.
Hitoshi MURAI Hiromi T. YAMADA Kozo FUJII
The initial phase alternation of RZ pulses having duty cycle beyond 50% in dispersion-managed-link is found to help stabilize DM solitons transmissions. The stable soliton propagation of such wide RZ pulses should ease the difficulties designing soliton-based DWDM systems due to less spectral occupancy/channel. For the proof of concept, 40 Gbit/s WDM transmissions are numerically investigated and the initial phase alternation improved the transmission distance by the factor of 2 in the soliton-soliton interaction limited regime. The advantage of this concept has also been verified by conducting 40 Gbit/s single and 8 channels WDM transmission experiments using OTDM techniques with initial phase alternation.
Hiroyuki EHARA Koji YOSHIDA Kazutoshi YASUNAGA Toshiyuki MORII
This paper presents a high quality 4-kbit/s speech coding algorithm based on a CELP algorithm. The coder operates on speech frames of 20 ms. The algorithm has following four main features: multiple sub-codebooks, backward adaptive mode switching, dispersed-pulse structure, and noise post-processing. The multiple sub-codebooks consist of a pulse-codebook and a random-codebook so that they can handle both signals, noise-like (e.g. unvoiced, stationary noise) and pulse-like (e.g. voiced). The backward adaptive mode switching is performed using decoded parameters; therefore, no additional mode bit is transmitted. The random-codebook size is switched with the backward adaptively selected mode. The subjective quality of unvoiced speech or noise-like signal can be improved by this switching operation because the random-codebook size is greatly increased in such signal mode. The dispersed-pulse structure provides better performance of sparse pulse excitation using dispersed pulses instead of simple unit pulses. The noise post-processing employs a stationary background noise generator for producing stationary noise signal. It significantly improves subjective quality of decoded signal under various background noise conditions. Subjective listening tests are conducted in accordance with ACR and DCR tests. The ACR test results indicate that the fundamental performance of the MDP-CELP is equivalent to that of 32-kbit/s adaptive differential pulse code modulation (ADPCM). The DCR test results show that the performance of the MDP-CELP is equivalent to or better than that of 8-kbit/s conjugate-structure algebraic code excited linear prediction (CS-ACELP) under several background noise conditions.
Itsuro MORITA Keiji TANAKA Noboru EDAGAWA Masatoshi SUZUKI
The effectiveness of Aeff enlarged positive dispersion fiber (EE-PDF) and hybrid amplification configuration with erbium-doped fiber amplifier (EDFA) and fiber Raman amplifier for reducing the fiber nonlinearity and improving the transmission performance in long distance 40 Gbit/s-based WDM transmission was investigated. We have confirmed that the use of EE-PDF in modified dispersion map for 40 Gbit/s transmission is quite effective to increase the transmissible distance and have successfully demonstrated 16 40 Gbit/s WDM transmission over 2000 km with proper dispersion management. We have also confirmed that the use of distributed Raman amplification is quite effective to extend the repeater spacing. By adding the optimum Raman amplification, almost the same transmission performance was obtained with a doubled repeater spacing in long distance 40 Gbit/s-based WDM transmission.
This paper proposes constructive timing-violation (CTV) and evaluates its potential. It can be utilized both for increasing clock frequency and for reducing energy consumption. Increasing clock frequency over that determined by the critical paths causes timing violations. On the other hand, while supply voltage reduction can result in substantial power savings, it also causes larger gate delay and thus clock must be slow down in order not to violate timing constraints of critical paths. However, if any tolerant mechanisms are provided for the timing violations, it is not necessary to keep the constraints. Rather, the violations would be constructive for high clock frequency or for energy savings. From these observations, we propose the CTV, which is supported by the tolerant mechanism based on contemporary speculative execution mechanisms. We evaluate the CTV using a cycle-by-cycle simulator and present its considerably promising potential.
Hiroki OOI Tomoo TAKAHARA George ISHIKAWA Shinichi WAKANA Yuichi KAWAHATA Hideki ISONO Nobuaki MITAMURA
We demonstrated variable dispersion compensation using the Virtually Imaged Phased Array (VIPA) for a 40-Gbit/s dense-WDM transmission system. The large tunable range from -800 to +800 ps/nm in the entire C-band wavelength range and the high tunable resolution of 1 ps/nm was achieved by using a 3-dimensional mirror equipped with a stepping motor that we developed. We adopted the dispersion monitor of 40-GHz intensity extracted from the received 40-Gbit/s baseband signals, and verified that this dispersion monitoring method is applicable to nonlinear transmission by detecting the monitor peak. Using the VIPA variable dispersion compensator and the dispersion monitor, we demonstrated 1.28-Tbit/s (40-Gbit/s 32 ch) automatic dispersion compensation. As a result, we confirmed that only two VIPAs and one fixed dispersion compensating fiber (DCF) are needed to make our method applicable to the entire C-band for dense WDM 40-Gbit/s systems having a large transmission range of 80 km.
Feng GAO Huijuan ZHAO Yukari TANIKAWA Yukio YAMADA
Generalized Pulse Spectrum Technique (GPST) is a method to solve the inverse problems of wave-propagation and diffusion-dominated phenomena, and therefore has been popularly applied in image reconstruction of time-resolved diffuse optical tomography. With a standard GPST for simultaneous reconstruction of absorption and scattering coefficients, the products of the gradients of the Green's function and the photon-density flux, based on the photon-diffusion equation, are required to calculate the diffusion-related Jacobian matrix. The adversities are of two-folds: time-consuming and singular in the field near the source. The latter causes a severe insensitivity of the algorithm to the scattering changes deep inside tissue. To cope with the above difficulties, we propose in this paper a modified GPST algorithm that only involves the Green's function and the photon-density flux themselves in the scattering-related matrix. Our simulated and experimental reconstructions show that the modified algorithm can significantly improve the quality of scattering image and accelerate the reconstruction process, without an evident degradation in absorption image.
Fu-Kun CHEN Jar-Ferr YANG Yu-Pin LIN
For multimedia communications, the computational scalability of a multimedia codec is required to match with different working platforms and integrated services of media sources. In this paper, two condensed stochastic codebook search approaches are proposed to progressively reduce the computation required for the algebraic code excited linear predictive (ACELP) and multi-pulse maximum likelihood quantization (MP-MLQ) coders. By reducing the candidates of the codebook before search procedure, the proposed methods can effectively diminish the computation required for the ITU-T G.723.1 dual rate speech coder. Simulation results show that the proposed methods can save over 50 percent for the stochastic codebook search with perceptually intangible degradation in speech quality.
Sang-Wook PARK Seung-Kyun RYU Dae-Hee YOUN
A new objective speech quality measure, Bark Coherence Function is presented. The Coherence Function was used for evaluating the non-linear distortion of low-to-medium rate speech coders. However, it is not well suited for quality estimation in modern speech transmission, especially, CDMA mobile communication system. In the proposed method, Coherence Function is newly defined in psycho-acoustic domain as the cognition module of perceptual speech quality measure and evaluates the perceptual non-linear distortion of mobile system. The experimental results showed that the proposed method has good performance over CDMA PCS and digital cellular system.
Shin-ichi WAKABAYASHI Hitomi MORIYA Asako BABA Yoshinori TAKEUCHI
We have developed optical encoding devices for processing femtosecond pulses. These devices are based on spectral separation devices and light modulators with fiber gratings. Experiments were made to encode a light pulse in the spectral domain. These experiments utilize the characteristics that a femtosecond light pulse has a very broad spectrum. An input femtosecond light pulse is decomposed into a series of wavelength components. Each wavelength component with narrow spectra <1 nm width is successfully extracted into a single mode fiber. Light modulators corresponding to wavelength components are assigned to the 1st bit, the 2nd bit, the 3rd bit,
Ryo HARAGUCHI Naozo SUGIMOTO Shigeru EIHO Yoshio ISHIDA
This paper deals with a method of registration and superimposition of a coronary arterial tree on a myocardial SPECT (Single Photon Emission Computed Tomography) image. We can grasp the myocardial function more easily in connection with the shape of the coronary arterial tree. The superimposed image is easily obtainable through some manual pointing on coronary angiograms (CAG) followed by an automatic matching method: First, a rough shape model of left ventricle is estimated by using SPECT data. This 3-D left ventricular model is projected on a pair of bi-plane CAG images. We can obtain two 2-D coronary images on bull's eye map by scanning the left ventricular surface projected on CAG. By maximizing a matching degree between two 2-D coronary images, registration between CAG and SPECT is performed. Finally the superimposed image is obtained by integrating two 2-D coronary images and bull's eye image of SPECT. We validated our method by numerical experiments with artificial data set and also applied it to two clinical data sets.
Miho TAKAHASHI Tomokazu NAGAO Yoshiharu IMAZEKI Kazuki MATSUZAKI Haruyuki MINAMITANI
This study attempts to demonstrate that activated leukocytes are involved in vascular shut down effect (VSD) in photodynamic therapy (PDT). Hydrogen peroxide (H2O2), a reactive oxygen specie (ROS) that is found in monocytes, was visualized under a confocal laser scanning microscope, and ROS formation was quantified by fluorescence image analysis. The fluorescence intensity was expressed as a gray level graded from 0 to 255. Only the fluorescence derived from monocytes that had ZnCP-III incorporated and were irradiated with an HeNe laser caused increases in the fluorescence distribution over time, while no change of distribution was observed in three other conditions (only Zn CP-III added, only HeNe laser irradiation, or non-treated). The result indicates that the photochemical reaction induced by excitation of a photosensitizer, and ROS was derived from the reaction-stimulated monocytes. The activated monocytes generated ROS themselves and H2O2 was visualized by the DCFH fluorescence method. In conclusion, the result clearly shows that activated monocytes are involved in the VSD effect.