Masahiro SERIZAWA Kazunori OZAWA
This paper proposes a new pitch prediction method for 4 kbps CELP (Code Excited LPC) speech coding with 20 msec frame, for the future ITU-T 4 kbps speech coding standardization. In the conventional CELP speech coding, synthetic speech quality deteriorates rapidly at 4 kbps, especially for female and children's speech with short pitch period. The pitch prediction performance is significantly degraded for such speech. The important reason is that when the pitch period is shorter than the subframe length, the simple repetition of the past excitation signal based on the estimated lag, not the pitch prediction, is usually carried out in the adaptive codebook operation. The proposed pitch prediction method can carry out the pitch prediction without the above approximation by utilizing the current subframe excitation codevector signal, when the pitch prediction parameters are determined. To further improve the performance, a split vector synthesis and perceptually spectral weighting method, and a low-complexity perceptually harmonic and spectral weighting method have also been developed. The informal listening test result shows that the 4 kbps speech coder with 20 msec frame, utilizing all of the proposed improvements, achieves 0.2 MOS higher results than the coder without them.
Seiichi NAKAGAWA Li ZHAO Hideyuki SUZUKI
One of the most effective methods in speech recognition is the HMM which has been used to model speech statistically. The discrete distribution and the continuos distribution HMMs have been widely used in various applications. However, in recent years, HMMs with various output probability functions have been proposed to further improve recognition performance, e.g. the Gaussian mixture continuous and the semi-continuous distributed HMMs. We recently have also proposed the RBF (radial basis function)-based HMM and the VQ-distortion based HMM which use a RBF function and VQ-distortion measure at each state instead of an output probability density function used by traditional HMMs. In this paper, we describe the RBF-based HMM and the VQ-distortion based HMM and compare their performance with the discrete distributed, the Gaussian mixture distributed and the semi-continuous distributed HMMs based on their speech recognition performance rates through experiments on speaker-independent spoken digit recognition. Our results confirmed that the RBF-based and VQ-distortion based HMMs are more robust and superior to traditional HMMs.
A method of tone recognition has been developed for dissyllabic speech of Standard Chinese based on discrete hidden Markov modeling. As for the feature parameters of recognition, combination of macroscopic and microscopic parameters of fundamental frequency contours was shown to give a better result as compared to the isolated use of each parameter. Speaker normalization was realized by introducing an offset to the fundamental frequency. In order to avoid recognition errors due to syllable segmentation, a scheme of concatenated learning was adopted for training hidden Markov models. Based on the observations of fundamental frequency contours of dissyllables, a scheme was introduced to the method, where a contour was represented with a series of three syllabic tone models, two for the first and the second syllables and one for the transition part around the syllabic boundary. Corresponding to the voiceless consonant of the second syllable, fundamental frequency contour of a dissyllable may include a part without fundamental frequencies. This part was linearly interpolated in the current method. To prove the validity of the proposed method, it was compared with other methods, such as representing all of the dissyllabic contours as the concatenation of two models, assigning a special code to the voiceless part, and so on. Tone sandhi was also taken into account by introducing two additional models for the half-third tone and for the first 4th tone of the combination of two 4th tones. With the proposed method, average recognition rate of 96% was achieved for 5 male and 5 female speakers.
Chang-Sheng YANG Hideki KASUYA
Three-dimensional vocal tract shapes of a male, a female and a child subjects are measured from magnetic resonance (MR) images during sustained phonation of Japanese vowels /a, i, u, e, o/. Non-uniform dimensional differences in the vocal tract shapes of the subjects are quantitatively measured. Vocal tract area functions of the female and child subjects are normalized to those of the male on the basis of non-uniform and uniform scalings of the vocal tract length and compared with each other. A comparison is also made between the formant frequencies computed from the area functions normalized by the two different scalings. It is suggested by the comparisons that non-uniformity in the vocal tract dimensions is not essential in the normalization of the five Japanese vowels.
Shizuo MIZUSHINA Hiroyuki OHBA Katsumi ABE Shinya MIZOSHIRI Toshifumi SUGIURA
Microwave radiometry has been investigated for non-invasive measurement of temperature in human body. Recent trends are to explore the capability of retrieving a temperature profile or map from a set of brightness temperatures measured by a multifrequency radiometer operating in a 1-6GHz range. The retrieval of temperature from the multifrequency measurement data is formulated as an inverse problem in which the number of independent measurement or data is limited (7) and the data suffer from considerably large random fluctuations. The standard deviation of the data fluctuation is given by the brightness temperature resolution of the instrument (0.04-0.1K). Solutions are prone to instabilities and large errors unless proper solution methods are used. Solution methods developed during the last few years are reviewed: singular system analysis, bio-heat transfer solution matched with radiometric data, and model-fitting combined with Monte Carlo technique. Typical results obtained by these methods are presented to indicate a crosssection of the present-state-of-the-development in the field. This review concludes with discussions on the radiometric weighting function which connects physical temperatures in object to the brightness temperature. Three-dimensional weighting functions derived by the modal analysis and the FDTD method for a rectangular waveguide antenna coupled to a four layered lossy medium are discussed. Development of temperature retrieval procedures incorporating the 3-D weighting functions is an important and challenging task for future work in this field.
Masaki KYOSO Toshiaki TAKANE Akihiko UCHIYAMA
To make medical telemetry system more reliable in severe electromagnetic environment, we applied spread spectrum communication to ECG data transmission method. Spread spectrum communication system has shown superior performances to other systems, especially, in respect of anti-jamming, which allows it to share the frequency band with current telemetry systems. In this study, we show the characteristics of a spread spectrum transmitter when it is used in the same frequency band as a narrow-band transmitter. The result shows that the spread spectrum telemetry system can use the same frequency band permitted for medical telemetry system.
In this paper we propose a congestion control method for interconnecting connectionless MANs with ATM networks which works at the gateway of DQDB. Since connectionless traffic belong to loss sensitive data, they should experience small cell loss rate. Due to the function of congestion control in the gateway, we can prevent the network from overload which not only introduces serious cell loss at remote destination gateway but also lots of undesirable retransmissions and time delay in the ATM network. It neither needs to modify the slot format of DQDB nor to increase the overhead so the implementation is simple and cost effective. Performance results are also provided to verify the effectiveness.
Hisaya HADAMA Ken-ichi SATO Ikuo TOKIZAWA
This paper presents a newly developed analytical method which evaluates the virtual path bandwidth control effects for a general topology ATM (Asynchronous Transfer Mode) transport network. The virtual path concept can enhance the controllability of path bandwidth. Required link capacity to attain a specified call blocking probability can be reduced by applying virtual path bandwidth control. This paper proposes an analytical method to evaluate the call blocking probability of a general topology ATM network, which includes many virtual paths, that is using virtual path bandwidth control. A method for the designing link capacities of the network is also proposed. These methods make it possible to design an optimum transport network with path bandwidth control. Finally, a newly developed approximation technique is used to develop some analytical results on the effects of dynamic path bandwidth control are provided to demonstrate its effectiveness.
Yoichi HIGASHIYAMA Hiromu ARIYOSHI Miro KRAETZL
The previous literature on consecutive k-out-of-r-from-n: F systems give recursive equations for the system reliability only for the special case when all component probabilities are equal. This paper deals with the problem of calculating the reliability for a (linear or circular) consecutive 2-out-of-r-from-n: F system with unequal component probabilities. We provide two new algorithms for the linear and circular systems which have time complexity of O(n) and O(nr), respectively. The results of some computational experiments are also described.
Hiroshi NOGAMI Gordon L. STÜBER
A new upper hound on the error probability for maximum likelihood sequence estimation of digital signaling on intersymbol interference channels with additive white Gaussian noise is presented. The basic idea is to exclude all parallel error sequences and to exclude some of the overlapping error events from the union bound. It is shown that the new upper bound can be easily and efficiently computed by using a properly labeled error-state diagram and a one-directional stack algorithm. Several examples are presented that compare the new upper bound with bounds previously reported in the literature.
Toshiro WATANABE Shinji HAYASHI
We propose an objective measure from assessing low-rate coded speech. The model for this objective measure, in which several known features of the perceptual processing of speech sounds by the human ear are emulated, is based on the Hertz-to-Bark transformation, critical-band filtering with preemphasis to boost higher frequencies, nonlinear conversion for subjective loudness, and temporal (forward) masking. The effectiveness of the measure, called the Bark spectral distortion rating (BSDR), was validated by second-order polynomial regression analysis between the computed BSDR values and subjective MOS ratings obtained for a large number of utterances coded by several versions of CELP coders and one VSELP coder under three degradation conditions: input speech levels, transmission error rates, and background noise levels. The BSDR values correspond better to MOS ratings than several commonly used measures. Thus, BSDR can be used to accurately predict subjective scores.
Katsumi FURUYA Lira HAMADA Koichi ITO Haruo KASAI
We have developed a new muscle-equivalent phantom which is composed of water, polyethylene powder, NaCl, agar and so on. This phantom can maintain its shape by itself and it is easy to make and handle. The complex permittivity of the phantom is very similar to that of the actual muscle tissue at UHF. The specific heat and thermal conductivity are suitable for the thermographic method.
Akira ADACHI Ken'ichi OKAJIMA Youichi TAKADA Saburo TANAKA Hideo ITOZAKI Haruhisa TOYODA Hisashi KADO
This study shows that using the direct offset integration technique (DOIT) and additional positive feedback (APF) in a high-Tc dc superconducting quantum interference device (SQUID) improves the effective flux-to-voltage transfer function and reduces the flux noise of a magnetometer, thus improving the magnetic field noise. The effective flux-to-voltage transfer function and the flux noise with APF were measured at different values of the positive feedback parameter βa, which depends on the resistance of the APF circuit. These quantities were also compared between conditions with and without APF. This investigation showed that a βa condition the most suitable for minimizing the flux noise of a magnetometer with APF exists and that it is βa=0.77. The effective flux-to-voltage transfer function with APF is about three times what it is without APF (93 µV/Φ0 vs. 32 µV/Φ0). The magnetic field noise of a magnetometer with APF is improved by a factor of about 3 (242 fT/Hz vs. 738 fT/Hz).
Eriko TAKEDA Toshikazu NISHINO Kazumasa TAKAGI
The relationship between the flux-trapping phenomenon and the device-structure of a SQUID has been studied using three types of SQUIDs; a SQUID with a guard-ring, a SQUID with a moat, and a SQUID without these structures. The change in the voltage-flux characteristics of the SQUIDs due to the flux-trapping are measured. For the measurements, an acceleration of the flux-trapping is realized by applying a magnetic field during cooling of the SQUIDs. From the measured results, the SQUID with the guard-ring and that with the moat can reject tha external magnetic field more effectively than the SQUID without these structures. The reason of the difference in the rejection of the external magnetic field is thought to be the existence of superconducting closed loops. However, the flux-trapping of the SQUID with the guard-ring and that with the moat occur more easily than the flux-trapping of the SQUID without these structures for the cooling under the finite magnetic field. Therefore, the moat structure and the guard-ring structure need a higher-grade magnetic shielding for a practical use.
Hiroto KAWAKAMI Yutaka MIYAMOTO Tomoyoshi KATAOKA Kazuo HAGIMOTO
This paper discusses an all-optical tank circuit that uses the comb-shaped gain spectrum generated by a Brillouin amplifier. The theory of timing clock extraction is shown for two cases: with two gains and with three gains. In both cases, the waveform of the extracted timing clock is simulated. According to the simulation, unlike an ordinary tank circuit, the amplitude of the extracted clock is not constant even though the quality factor (Q) is infinite. The extracted clock is clearly influenced by the pattern of the original data stream if the Brillouin gain is finite. The ratio of the maximum extracted clock amplitude to the minimum extracted amplitude is calculated as a function of Brillouin gain. The detuning of the pump light frequency is also discussed. It induces not only changes in the Brillouin gain, but also phase shift in the amplified light. The relation between the frequency drift of the pump lights and the jitter of the extracted timing clock is shown, in both cases: two pump lights are used and three pump lights are used. It is numerically shown that when the all pump lights have the same frequency drift, i.e., their frequency separation is constant, the phase of the extracted clock is not influenced by the frequency drift of the pump lights. The operation principle is demonstrated at 5Gbit/s, 2.5Gbit/s, and 2Gbit/s using two pumping techniques. The parameters of quality factor and the suppression ratio in the baseband domain are measured. Q and the suppression ratio are found to be 160 and 28dB, respectively.
Shin'ichi SATOH Hiroshi MO Masao SAKAUCHI
This letter presents a new method to efficiently extract closed loops as primitive symbols in line drawings. Our method uses a graph search technique for efficiency and exhaustibility, and also incorporates feasibility criteria of symbols. Experiments clearly demonstrated the method's effectiveness.
Chang-Yu SUN Qi-Hu LI Takashi SOMA
A noise cancelling sonar-ranging system based on the adaptive filtering technique, which can automatically adapt itself to the changes in environmental noise-field and improve the passive sonar-ranging/goniometric precision, was introduced by this paper. In the meantime, the software and hardware design principle of the system using high speed VLSI (Very Large Scale Integrated) DSP (Digital Signal Processing) chips, and the practical test results were also presented. In comparison with the traditional ranging system, the system not only enhanced obviously the ranging precision but also possessed some more characteristics such as simple structure, rapid operation, large data-storage volume, easy programming, high reliability and so on.
In this study, a ring of simple chaotic circuits coupled by inductors is investigated. An extremely simple three-dimensional autonomous circuit is considered as a chaotic subcircuit. By carrying out circuit experiments and computer calculations for two, three or four subcircuits case, various synchronization phenomena of chaos are confirmed to be stably generated. For the three subcircuits case, two different synchronization modes coexist, namely in-phase synchronization mode and three-phase synchronization mode. By investigating Poincar
Hisashi KADO Gen UEHARA Hisanao OGATA Hideo ITOZAKI
This paper describes a SQUID magnetometer and the measurement of small signals. It also describes the current state of SQUID technology developed in the SSL project.
Jianming LU Takashi YAHAGI Jianting CAO
This letter presents new estimation algorithm of ARMAX systems which do not always satisfy the strictly positive real (SPR) condition. We show how estimated parameters can converge to their true values based on the overparameterized system. Finally, the results of computer simulation are presented to illustrate the effectiveness of the proposed method.