IEICE global.ieice.org Site

Keyword Search Result

[Keyword] Ti(30728hit)

30201-30220hit(30728hit)

A Unification-Based Japanese Parser for Speech-to-Speech Translation
Masaaki NAGATA Tsuyoshi MORIMOTO

PAPER

Vol:
E76-D No:1
Page(s):
51-61
A unification-based Japanese parser has been implemented for an experimental Japanese-to-English spoken language translation system (SL-TRANS). The parser consists of a unification-based spoken-style Japanese grammar and an active chart parser. The grammar handles the syntactic, semantic, and pragmatic constraints in an integrated fashion using HPSG-based framework in order to cope with speech recognition errors. The parser takes multiple sentential candidates from the HMM-LR speech recognizer, and produces a semantic representation associated with the best scoring parse based on acoustic and linguistic plausibility. The unification-based parser has been tested using 12 dialogues in the conference registration domain, which include 261 sentences uttered by one male speaker. The sentence recognition accuracy of the underlying speech recognizer is 73.6% for the top candidate, and 83.5% for the top three candidates, where the test-set perplexity of the CFG grammar is 65. By ruling out erroneous speech recognition results using various linguistic constraints, the parser improves the sentence recognition accuracy up to 81.6% for the top candidate, and 85.8% for the top three candidates. From the experiment result, we found that the combination of syntactic restriction, selectional restriction and coordinate structure restriction can provide a sufficient restriction to rule out the recognition errors between case-marking particles with the same vowel, which are the type of errors most likely to occur. However, we also found that it is necessary to use pragmatic information, such as topic, presupposition, and discourse structure, to rule out the recognition errors involved with topicalizing particles and sentence final particles.
Phrase Recognition in Conversational Speech Using Prosodic and Phonemic Information
Shigeki OKAWA Takashi ENDO Tetsunori KOBAYASHI Katsuhiko SHIRAI

PAPER

Vol:
E76-D No:1
Page(s):
44-50
In this paper, a new scheme for ohrase recognition in conversational speech is proposed, in which prosodic and phonemic information processing are usefully combined. This approach is employed both to produce candidates of phrase boundaries and to discriminate phonemes. The fundamental frequency patterns of continuous utterances are statistically analyzed and the likelihood of the occurrence of a phrase boundary is calculated for every frame. At the same time, the likelihood of phonemic characteristics of each frame can be obtained using a hierarchical clustering method. These two scores, along with lexical and grammatical constraints, can be effectively utilized to develop a possible word sequences or a word lattices which correspond to the continuous speech utterances. Our preliminary experjment shows the feasibility of applying prosody for continuous speech recognition especially for conversational style utterances.
Photonic LSI--Merging the Optical Technology into LSI--
Yoshihiko MIZUSHIMA

INVITED PAPER-Key Paper

Vol:
E76-C No:1
Page(s):
4-12
The future trends of optical technologies combined with LSI are reviewed. Present problems of LSI, and the possible solutions to these problems through the merger of the optical technology into LSI are discussed. One of the present trends in interconnection between LSI components is the timeserial approach, originally developed for the optical communication. This method is capable of high speed data transfer. The other is a space-parallel approach, arising from the two-dimensional nature of the light propagation. This approach has the capability of performing parallel processing. A hybrid OEIC, possibly on GaAs, is discussed as an example of future photonic LSI. The lack of key devices is a fundamental barrier to the future improvement of photonic LSI. Direct interaction between photons and electrons is a promissing approach. Some of the Author's ideas to promote the merger of photonics and LSI are proposed.
LR Parsing with a Category Reachability Test Applied to Speech Recognition
Kenji KITA Tsuyoshi MORIMOTO Shigeki SAGAYAMA

PAPER

Vol:
E76-D No:1
Page(s):
23-28
In this paper, we propose an extended LR parsing algorithm, called LR parsing with a category reachability test (the LR-CRT algorithm). The LR-CRT algorithm enables a parser to efficiently recognize those sentences that belong to a specified grammatical category. The key point of the algorithm is to use an augmented LR parsing table in which each action entry contains a set of reachable categories. When executing a shift or reduce action, the parser checks whether the action can reach a given category using the augmented table. We apply the LR-CRT algorithm to improve a speech recognition system based on two-level LR parsing. This system uses two kinds of grammars, inter- and intra-phrase grammars, to recognize Japanese sentential speech. Two-level LR parsing guides the search of speech recognition through two-level symbol prediction, phrase category prediction and phone prediction, based on these grammars. The LR-CRT algorithm makes possible the efficient phone prediction based on the phrase category prediction. The system was evaluated using sentential speech data uttered phrase by phrase, and attained a word accuracy of 97.5% and a sentence accuracy of 91.2%
How Might One Comfortably Converse with a Machine ?
Yasuhisa NIIMI

INVITED PAPER

Vol:
E76-D No:1
Page(s):
9-16
Progress of speech recognition based on the hidden Markov model has made it possible to realize man-machine dialogue systems capable of operating in real time. In spite of considerable effort, however, few systems have been successfully developed because of the lack of appropriate dialogue models. This paper reports on some of technology necessary to develop a dialogue system with which one can converse comfortably. The emphasis is placed on the following three points: how a human converses with a machine; how errors of speech recognition can be recovered through conversation; and what it means for a machine to be cooperative. We examine the first problem by investigating dialogues between human speakers, and dialogues between a human speaker and a simulated machine. As a consideration in the design of dialogue control, we discuss the relation between efficiency and cooperativeness of dialogue, the method for confirming what the machine has recognized, and dynamic adaptation of the machine. Thirdly, we review the research on the friendliness of a natural language interface, mainly concerning the exchange of initiative, corrective and suggestive answers, and indirect questions. Lastly, we describe briefly the current state of the art in speech recognition and synthesis, and suggest what should be done for acceptance of spontaneous speech and production of a voice suitable to the output of a dialogue system.
Practical Consequences of the Discrepancy between Zero-Knowledge Protocols and Their Parallel Execution
Kouichi SAKURAI Toshiya ITOH

PAPER

Vol:
E76-A No:1
Page(s):
14-22
In this paper, we investigate the discrepancy between a serial version and a parallel version of zero-knowledge protocols, and clarify the information "leaked" in the parallel version, which is not zero-knowledge unlike the case of the serial version. We consider two sides: one negative and the other positive in the parallel version of zero-knowledge protocols, especially of the Fiat-Shamir scheme.
A Real-Time Speech Dialogue System Using Spontaneous Speech Understanding
Yoichi TAKEBAYASHI Hiroyuki TSUBOI Hiroshi KANAZAWA Yoichi SADAMOTO Hideki HASHIMOTO Hideaki SHINCHI

PAPER

Vol:
E76-D No:1
Page(s):
112-120
This paper describes a task-oriented speech dialogue system based on spontaneous speech understanding and response generation (TOSBURG). The system has been developed for a fast food ordering task using speaker-independent keyword-based spontaneous speech understanding. Its purpose being to understand the user's intention from spontaneous speech, the system consists of a noise-robust keyword-spotter, a semantic keyword lattice parser, a user-initiated dialogue manager and a multimodal response generator. After noise immunity keyword-spotting is performed, the spotted keyword candidates are analyzed by a keyword lattice parser to extract the semantic content of the input speech. Then, referring to the dialogue history and context, the dialogue manager interprets the semantic content of the input speech. In cases where the interpretation is ambiguous or uncertain, the dialogue manager invites the user to confirm verbally the system's understanding of the speech input. The system's response to the user throughout the dialogue is multimodal; that is, several modes of communication (synthesized speech, text, animated facial expressions and ordered food items) are used to convey the system's state to the user. The object here is to emulate the multimodal interaction that occurs between humans, and so achieve more natural and efficient human-computer interaction. The real-time dialogue system has been constructed using two general purpose workstations and four DSP accelerators (520MFLOPS). Experimental results have shown the effectiveness of the newly developed speech dialogue system.
Methods to Securely Realize Caller-Authenticated and Callee-Specified Telephone Calls
Tomoyuki ASANO Tsutomu MATSUMOTO Hideki IMAI

PAPER

Vol:
E76-A No:1
Page(s):
88-95
This paper presents two methods for securely realizing caller-authenticated and callee-specified calls over telecommunication networks with terminals that accept IC cards having KPS-based cryptographic functions. In the proposed protocols, users can verify that the partner is the proper owner of a certain ID or a certain pen name. Users' privacy is protected even if they do the caller-authenticated and callee-specified calls and do not pay their telephone charge in advance.
Application of Photoexcited Reaction to VLSI Process
Yasuhiro HORIIKE

INVITED PAPER-Opto-Electronics Technology for LSIs

Vol:
E76-C No:1
Page(s):
32-40
Recent progress on photoexcited process applications to fabricating of VLSI and flat panel devices in Japan has been reviewed. The excimer laser melt technique makes it possible to form large-grain poly-Si film on a glass substrate, improving TFT electrical characteristics, and to fill metals into high-aspect-ratio contact holes in VLSI metallization. Scanning of CW laser in poly-Si film led to growth of a single-crystal Si layer on SiO2 to fabricate 3-D (dimensional) devices successfully. Direct writing with pyrolytic reaction was put into practice for interconnection restructuring. In the photochemical process, lower temperature epitaxial growth of Si and dry cleaning of a Si wafer employing Hg lamp irradiation were noted. Directional etching was performed by sidewall film formation, while resolution of better than 0.5 µm was difficult to obtain due to diffraction limit. It was proposed that higher resolution would be obtained by introduction of a nonlinear process which enhanced pattern contrast.
The Effect of Varying Routing Probability in Two Parallel Queues with Dynamic Routing under a Threshold-Type Scheduling
Ivo J. B. F. ADAN Jaap WESSELS W. Henk M. ZIJM

LETTER-Communication Networks and Service

Vol:
E76-B No:1
Page(s):
29-31
In the paper entitled "The effect of varying routing probability in two parallel queues with dynamic routing under a threshold-type scheduling", Kojima et al. derive an expression in the form of a product of powers for the state probabilities of a threshold-type shortest queue problem. In this note it is demonstrated that this expression is essentially more complicated and has the form of an infinite sum of products of powers. In fact, Kojima et al. find the first term in this infinite sum only.
How to Strengthen DES-like Cryptosystems against Differential Cryptanalysis
Kenji KOYAMA Routo TERADA

PAPER

Vol:
E76-A No:1
Page(s):
63-69
We propose a new randomized version of DES in which a key-dependent swapping is used to strengthen DES and DES-like cryptosystems against differential cryptanalysis. This new scheme, called RDES, decreases the probability of success in differential attack by decreasing the characteristic probability. The characteristic is the effect of particular differences in plaintext pairs on the differences in the resultant ciphertext pairs. The characteristic probability for the n-round RDES is 2-n+1 times that for the n-round DES. As for the differential cryptanalysis based on characteristics, the 16-round RDES is as strong as the about 20-round DES. Encryption/decryption speed of n-round RDES is almost the same as that of the n-round DES.
A Complementary Optical Interconnection for Inter-Chip Networks
Hideto FURUYAMA Masaru NAKAMURA

PAPER-Integration of Opto-Electronics and LSI Technologies

Vol:
E76-C No:1
Page(s):
112-117
A new optical interconnection system suitable for high-speed ICs using a novel complementary optical interconnection technique has been developed. This system uses paired light sources and photodetectors for optical complementary operation, and greatly lowers the power consumption compared with conventional systems. Analyses and experimental results indicate that this system can operate in the gigabit range, and reduces power consumption to less than 20% of that in conventional systems at 1 Gb/s.
Three Different LR Parsing Algorithms for Phoneme-Context-Dependent HMM-Based Continuous Speech Recognition
Akito NAGAI Shigeki SAGAYAMA Kenji KITA Hideaki KIKUCHI

PAPER

Vol:
E76-D No:1
Page(s):
29-37
This paper discusses three approaches for combining an efficient LR parser and phoneme-context-dependent HMMs and compares them through continuous speech recognition experiments. In continuous speech recognition, phoneme-context-dependent allophonic models are considered very helpful for enhancing the recognition accuracy. They precisely represent allophonic variations caused by the difference in phoneme-contexts. With grammatical constraints based on a context free grammar (CFG), a generalized LR parser is one of the most efficient parsing algorithms for speech recognition. Therefore, the combination of allophonic models and a generalized LR parser is a powerful scheme enabling accurate and efficient speech recognition. In this paper, three phoneme-context-dependent LR parsing algorithms are proposed, which make it possible to drive allophonic HMMs. The algorithms are outlined as follows: (1) Algorithm for predicting the phonemic context dynamically in the LR parser using a phoneme-context-independent LR table. (2) Algorithm for converting an LR table into a phoneme-context-dependent LR table. (3) Algorithm for converting a CFG into a phoneme-context-dependent CFG. This paper also includes discussion of the results of recognition experiments, and a comparison of performance and efficiency of these three algorithms.
Optical Semiconductor Devices for Interconnection Approach from Optical Transmission Scheme
Hajime IMAI

INVITED PAPER-Integration of Opto-Electronics and LSI Technologies

Vol:
E76-C No:1
Page(s):
100-105
Optical interconnection is a rapidly expanding field of optical signal transmission, but it places some stringent requirements on optical devices. This paper introduces the current device characteristics of lasers and photodiodes and discusses the possibility of intra/inter wafer optical interconnection.
On the Complexity of Composite Numbers
Toshiya ITOH Kenji HORIKAWA

PAPER

Vol:
E76-A No:1
Page(s):
23-30
Given an integer N, it is easy to determine whether or not N is prime, because a set of primes is in LPP. Then given a composite number N, is it easy to determine whether or not N is of a specified form? In this paper, we consider a subset of odd composite numbers +1MOD4 (resp. +3MOD4), which is a subset of odd composite numbers consisting of prime factors congruent to 1 (resp. 3) modulo 4, and show that (1) there exists a four move (blackbox simulation) perfect ZKIP for the complement of +1MOD4 without any unproven assumption; (2) there exists a five move (blackbox simulation) perfect ZKIP for +1MOD4 without any unproven assumption; (3) there exists a four move (blackbox simulation) perfect ZKIP for +3MOD4 without any unproven assumption; and (4) there exists a five move (blackbox simulation) statistical ZKIP for the complement of +3MOD4 without any unproven assumption. To the best of our knowledge, these are the first results for a language L that seems to be not random self-reducible but has a constant move blackbox simulation perfect or statistical ZKIP for L and without any unproven assumption.
Real-Time Feed-Forward Control LSIs for a Direct Wafer Exposure Electron Beam System
Hironori YAMAUCHI Tetsuo MOROSAWA Takashi WATANABE Atsushi IWATA Tsutomu HOSAKA

PAPER-Integrated Electronics

Vol:
E76-C No:1
Page(s):
124-135
Three custom LSIs for EB60, a direct wafer exposure electron beam system, have been developed using 0.8 µm BiCMOS and SST bipolar technologies. The three LSIs are i) a shot cycle control LSI for controlling each exposure cycle time, ii) a linear matrix computation LSI for coordinate modification of the exposure pattern data, and iii) a position calculation LSI for determining the precise position of the wafer. These LSIs allow the deflection corrector block of the revised EB60 to be realized on a single board. A new adaptive pipeline control technique which optimizes each shot period according to the exposure data is implemented in the shot-cycle control LSI. The position calculation LSI implements a new, highly effective 2-level pipeline exposure technique, the levels refer to major-field-deflection and minor-field-deflection. The linear-matrix computation LSI is designed not only for the EB60 but also for a wide variety of parallel digital processing applications.
Sub-Half Micron Exposure System with Optimized Illumination
Akiyoshi SUZUKI Miyoko NOGUCHI

INVITED PAPER-Opto-Electronics Technology for LSIs

Vol:
E76-C No:1
Page(s):
13-18
New illumination principle for photolithography is investigated. As the optical microlithography approaches its own limit, it becomes apparent that the simple extrapolation of the present technology is not sufficient for the future demands. This paper introduces the new imaging technology that overcomes such a boundary. First, the basic imaging formulae are analyzed and the illumination light is classified into 4 cases. 3-beam case and 2-beam case carry the object information, and the comparison of these 2 cases is carried out theoretically. It can be shown that the 2-beam case has greater depth of focus than that of the 3-beam case, though it has inferior contrast at the best focus. Since this degradation, however, has little effect, the enlargement of the depth of focus can be achieved. In reality, 2-dimensional imaging must be considered. Quadrupole effect can be deduced by the results of the analysis. It shows great improvement in the depth of focus near resolution limit. As it can be applied to the conventional masks, it can be a promising candidate for fhe future lithography. Experimental results are also shown to demonstrate the analysis.
A Linguistic Procedure for an Extension Number Guidance System
Naomi INOUE Izuru NOGAITO Masahiko TAKAHASHI

PAPER

Vol:
E76-D No:1
Page(s):
106-111
This paper describes the linguistic procedure of our speech dialogue system. The procedure is composed of two processes, syntactic analysis using a finite state network, and discourse analysis using a plan recognition model. The finite state network is compiled from regular grammar. The regular grammar is described in order to accept sentences with various styles, for example ellipsis and inversion. The regular grammar is automatically generated from the skeleton of the grammar. The discourse analysis module understands the utterance, generates the next question for users and also predicts words which will be in the next utterance. For an extension number guidance task, we obtained correct recognition results for 93% of input sentences without word prediction and for 98% if prediction results include proper words.
On the Complexity of Constant Round ZKIP of Possession of Knowledge
Toshiya ITOH Kouichi SAKURAI

PAPER

Vol:
E76-A No:1
Page(s):
31-39
In this paper, we investigate the round complexity of zero-knowledge interactive proof systems of possession of knowledge, and mainly show that if a relation R has a three move blackbox simulation zero-knowledge interactive proof system of possession of knowledge, then there exists a probabilistic polynomial time algorithm that on input x{0,1}*, outputs y such that (x,y)R with overwhelming probability if xdom R, and outputs "" with probability 1 if x dom R. The result above can not be generalized to zero-knowledge interactive proof systems of possession of knowledge with more than four moves, because it is known that there exists a "four" move blackbox simulation perfect zero-knowledge interactive proof system of possession of knowledge for a nontrivial relation R.
The Sibling Intractable Function Family (SIFF): Notion, Construction and Applications
Yuliang ZHENG Thomas HARDJONO Josef PIEPRZYK

PAPER

Vol:
E76-A No:1
Page(s):
4-13
This paper presents a new concept in cryptography called the sibling intractable function family (SIFF) which has the property that given a set of initial strings colliding with one another, it is computationally infeasible to find another string that would collide with the initial strings. The various concepts behind SIFF are presented together with a construction of SIFF from any one-way function. Applications of SIFF to many practical problems are also discussed. These include the hierarchical access control problem which is a long-standing open problem induced by a paper of Akl and Taylor about ten years ago, the shared mail box problem, access control in distributed systems and the multiple message authentication problem.

30201-30220hit(30728hit)

Keyword Search Result

[Keyword] Ti(30728hit)

A Unification-Based Japanese Parser for Speech-to-Speech Translation

Phrase Recognition in Conversational Speech Using Prosodic and Phonemic Information

Photonic LSI--Merging the Optical Technology into LSI--

LR Parsing with a Category Reachability Test Applied to Speech Recognition

How Might One Comfortably Converse with a Machine ?

Practical Consequences of the Discrepancy between Zero-Knowledge Protocols and Their Parallel Execution

A Real-Time Speech Dialogue System Using Spontaneous Speech Understanding

Methods to Securely Realize Caller-Authenticated and Callee-Specified Telephone Calls

Application of Photoexcited Reaction to VLSI Process

The Effect of Varying Routing Probability in Two Parallel Queues with Dynamic Routing under a Threshold-Type Scheduling

How to Strengthen DES-like Cryptosystems against Differential Cryptanalysis

A Complementary Optical Interconnection for Inter-Chip Networks

Three Different LR Parsing Algorithms for Phoneme-Context-Dependent HMM-Based Continuous Speech Recognition

Optical Semiconductor Devices for Interconnection Approach from Optical Transmission Scheme

On the Complexity of Composite Numbers

Real-Time Feed-Forward Control LSIs for a Direct Wafer Exposure Electron Beam System

Sub-Half Micron Exposure System with Optimized Illumination

A Linguistic Procedure for an Extension Number Guidance System

On the Complexity of Constant Round ZKIP of Possession of Knowledge

The Sibling Intractable Function Family (SIFF): Notion, Construction and Applications

Latest Issue

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles