IEICE global.ieice.org Site

Author Search Result

[Author] Akio OGIHARA(24hit)

1-20hit(24hit)

An Improvement of the Pseudoinverse Rule with Diagonal Elements
Hiroshi UEDA Masaya OHTA Akio OGIHARA Kunio FUKUNAGA

PAPER-Neural Networks

Vol:
E77-A No:6
Page(s):
1007-1014
A pseudoinverse rule, one of major rule to determine a weight matrix for associative memory, has large capacity comparing with other determining rules. However, it is wellknown that the rule has small domains of attraction of memory vectors on account of many spurious states. In this paper, we try to improve the problem by means of subtracting a constant from all diagonal elements of a weight matrix. By this method, many spurious states disappear and eigenvectors with negative eigenvalues are introduced for the orthocomplement of the subspace spanned by memory vectors. This method can be applied to two types of networks: binary network and analog network. Some computer simulations are performed for both two models. The results of the simulations show our improvement is effective to extend error correcting ability for both networks.
An Autocorrelation Associative Neural Network with Self-Feedbacks
Hiroshi UEDA Masaya OHTA Akio OGIHARA Kunio FUKUNAGA

LETTER

Vol:
E76-A No:12
Page(s):
2072-2075
In this article, the autocorrelation associative neural network that is one of well-known applications of neural networks is improved to extend its capacity and error correcting ability. Our approach of the improvement is based on the consideration that negative self-feedbacks remove spurious states. Therefore, we propose a method to determine the self-feedbacks as small as possible within the range that all stored patterns are stable. A state transition rule that enables to escape oscillation is also presented because the method has a possibility of falling into oscillation. The efficiency of the method is confirmed by means of some computer simulations.
A Continuous Speech Recognition Algorithm Utilizing Island-Driven A^* Search
Yoshikazu YAMAGUCHI Akio OGIHARA Yasuhisa HAYASHI Nobuyuki TAKASU Kunio FUKUNAGA

LETTER

Vol:
E76-A No:7
Page(s):
1184-1186
We propose a continuous speech recognition algorithm utilizing island-driven A* search. Conventional left-to-right A* search is probable to lose the optimal solution from a finite stack if some obscurities appear at the start of an input speech. Proposed island-driven A* search proceeds searching forward and backward from the clearest part of an input speech, and thus can avoid to lose the optimal solution from a finite stack.
Switched-Capacitor Neural Networks and Their Application to Character Recognition
Yoichiro ANZAI Koichi MATSUMOTO Shojiro YONEDA Akio OGIHARA

PAPER-Neural Networks

Vol:
E73-E No:12
Page(s):
1932-1939
Recently, the hardware realizations of the neural networks for specially-purposed-use have been in focus. In this paper, two kinds of networks, a two-layer network and the Boltzmann machine, using the switched-capacitor circuit are proposed. The variable synaptic weights of neural circuit are realized by through the programmable capacitor array (PCA) in the switched-capacitor variable-coefficients multiplier. As a result, the recognition system of the handwritten character using a two-layer neural network is constructed by the discrete electronic elements and its desirable effects are shown by the experimental results. The stochastic operation in the processing element (PE) of the Boltzmann machine is realized by using the generation of noise voltage with the random number and is also confirmed by teh experimental results using the discrete electronic elements. Furthermore, the operations of the PE have been also confirmed by using the simulation of Traveling-Salesman Problem.
An Isolated Word Speech Recognition Based on Fusion of Visual and Auditory Information Usisng 30-frame/s and 24-bit Color Image
Akio OGIHARA Shinobu ASAO

PAPER

Vol:
E80-A No:8
Page(s):
1417-1422
In the field of speech recognition, many researchers have proposed speech recognition methods using auditory information like acoustic signal or visual information like shape and motion of lips. Auditory information has valid features for speech recognition, but it is difficult to accomplish speech recognition in noisy environment. On the other side, visual information has advantage to accomplish speech recognition in noisy environment, but it is difficult to extract effective features for speech recognition. Thus, in case of using either auditory information or visual information, it is difficult to accomplish speech recognition perfectly. In this paper, we propose a method to fuse auditory information and visual information in order to realize more accurate speech recognition. The proposed method consists of two processes: (1) two probabilities for auditory information and visual information are calculated by HMM, (2) these probabilities are fused by using linear combination. We have performed speech recognition experiments of isolated words, whose auditory information (22.05kHz sampling, 8-bit quantization) and visual information (30-frame/s sampling, 24-bit quantization) are captured with multi-media personal computer, and have confirmed the validity of the proposed method.
An Analysis on Minimum Searching Principle of Chaotic Neural Network
Masaya OHTA Kazumichi MATSUMIYA Akio OGIHARA Shinobu TAKAMATSU Kunio FUKUNAGA

PAPER

Vol:
E79-A No:3
Page(s):
363-369
This article analyzes dynamics of the chaotic neural network and minimum searching principle of this network. First it is indicated that the dynamics of the chaotic newral network is described like a gradient decent, and the chaotic neural network can roughly find out a local minimum point of a quadratic function using its attractor. Secondly It is guaranteed that the vertex corresponding a local minimum point derived from the chaotic neural network has a lower value of the objective function. Then it is confirmed that the chaotic neural network can escape an invalid local minimum and find out a reasonable one.
Speech Recognition Based on Fusion of Visual and Auditory Information Using Full-Framse Color Image
Satoru IGAWA Akio OGIHARA Akira SHINTANI Shinobu TAKAMATSU

LETTER

Vol:
E79-A No:11
Page(s):
1836-1840
We propose a method to fuse auditory information and visual information for accurate speech recognition. This method fuses two kinds of information by using Iinear combination after calculating two kinds of probabilities by HMM for each word. In addition, we use full-frame color image as visual information in order to improve the accuracy of the proposed speech recognition system. We have performed experiments comparing the proposed method with the method using either auditory information or visual information, and confirmed the validity of the proposed method.
Sound Quality Evaluation for Audio Watermarking Based on Phase Shift Keying Using BCH Code
Harumi MURATA Akio OGIHARA Masaki UESAKA

LETTER

Vol:
E98-D No:1
Page(s):
89-94
Yajima et al. proposed a method based on amplitude and phase coding of audio signals. This method has relatively high sound quality because human auditory property is considered for embedding. However, in this method, the tolerance to attacks tends to be weak. Hence, we propose a high-tolerance watermarking method using BCH code which is one of error correcting code. This paper evaluates whether our method preserves the sound quality while ensuring high tolerance.
Speech Recognition Using HMM Based on Fusion of Visual and Auditory Information
Akira SHINTANI Akio OGIHARA Yoshikazu YAMAGUCHI Yasuhisa HAYASHI Kunio FUKUNAGA

LETTER

Vol:
E77-A No:11
Page(s):
1875-1878
We propose two methods to fuse auditory information and visual information for accurate sppech recognition. The first method fuses two kinds of information by using linear combination after calculating two kinds of probabilities by HMM for each word. The second method fuses two kinds of information by using the histogram which expresses the correlation of them. We have performed experiments comparing the proposed methods with the conventional method and confirmed the validity of the proposed methods.
Recognition of Isolated Digits Using Fuzzy Matrix Quantization
Satoshi KONDO Akio OGIHARA Shojiro YONEDA

PAPER-Speech and Image Processing

Vol:
E74-A No:10
Page(s):
3034-3040
This study proposes fuzzy matrix quantization (FMQ) which is a new coding technique developed to obtain discrete symbols employed for hidden Markov models (HMM's). FMQ is a coding technique combining fuzzy vector quantization with matrix quantization. The validity of FMQ is evaluated by a speaker-independent isolated word recognition task. First, the effect of FMQ is examined when FMQ is applied to the training phase and/or recognition phase. The effects of number of training data, codebook size and codeword matrix size for recognition accuracy are investigated. And the results of the speech recognition based on HMM recognizer using FMQ technique is compared with HMM recognizers using conventional quantization methods, vector quantization and fuzzy vector quantization. As a result, FMQ is the effective coding technique for isolated word recognition on condition that codebook size is large, above all, when FMQ is applied to the training phase and training data set is small.
Correlation-Based Video Watermarking Method Using Inter-Frame Similarity
Motoo YAMAMOTO Akira SHIOZAKI Motoi IWATA Akio OGIHARA

PAPER-Information Hiding

Vol:
E89-A No:1
Page(s):
186-193
This paper presents a correlation-based watermarking method for video using the similarity of adjacent frames. In general, the adjacent frames of a video sequence is very similar. In the proposed scheme, we use an adjoining frame in detection process instead of an original image in the watermarking scheme of Cox et al. So the proposed method does not need an original video sequence in detection process. When a watermarked video sequence is attacked by overwriting copy or frame dropping, the pair of the frames that is not adjoining in an original video sequence is used in detection process. However, since a watermark is embedded in a part of each frame and embedding positions are different for each frame in the proposed method, we can detect the watermark even from an overwriting-copied video sequence and a frame-dropped video sequence. Experimental results show that the proposed method is robust against overwriting copy and frame dropping. Moreover, it is shown from experimental results that the method has robustness to low bitrate MPEG compression and StirMark attack.
Tolerance Evaluation of Audio Watermarking Method Based on Modification of Sound Pressure Level between Channels
Harumi MURATA Akio OGIHARA Shigetoshi HAYASHI

LETTER

Pubricized:
2017/10/16
Vol:
E101-D No:1
Page(s):
68-71
We have proposed an audio watermarking method based on modification of sound pressure level between channels. This method is focused on the invariability of sound localization against sound processing like MP3 and the imperceptibility about slightly change of sound localization. In this paper, we investigate about tolerance evaluation against various attacks in reference to IHC criteria.
Asymmetric Neural Network and Its Application to Knapsack Problem
Akira YAMAMOTO Masaya OHTA Hiroshi UEDA Akio OGIHARA Kunio FUKUNAGA

PAPER-Neural Networks

Vol:
E78-A No:3
Page(s):
300-305
We propose an asymmetric neural network which can solve inequality-constrained combinatorial optimization problems that are difficult to solve using symmetric neural networks. In this article, a knapsack problem that is one of such the problem is solved using the proposed network. Additionally, we study condition for obtaining a valid solution. In computer simulations, we show that the condition is correct and that the proposed network produces better solutions than the simple greedy algorithm.
A Neural Network with a Function of lnhibiting Subtours on TSP
Akira YAMAMOTO Masaya OHTA Hiroshi UEDA Akio OGIHARA Kunio FUKUNAGA

LETTER

Vol:
E76-A No:12
Page(s):
2068-2071
The Traveling Salesman Problem (TSP) can be solved by a neural network using the coding scheme based on the adjacency of city in the tour. Using this coding scheme, the neural network generates a better solution than that using other coding schemes. We, however, often get the invalid solution consisting of some subtours. In this article, we propose a method of eliminating subtours using additional neurons. On the computer simulation it is shown that we get the optimum solution by means of taking only O(n2) additional neurons and trials.
Binary Neural Network with Negative Self-Feedback and Its Application to N-Queens Problem
Masaya OHTA Akio OGIHARA Kunio FUKUNAGA

PAPER-Network Synthesis

Vol:
E77-D No:4
Page(s):
459-465
This article deals with the binary neural network with negative self-feedback connections as a method for solving combinational optimization problems. Although the binary neural network has a high convergence speed, it hardly searches out the optimum solution, because the neuron is selected randomly at each state update. In thie article, an improvement using the negative self-feedback is proposed. First it is shown that the negative self-feedback can make some local minimums be unstable. Second a selection rule is proposed and its property is analyzed in detail. In the binary neural network with negative self-feedback, this selection rule is effective to escape a local minimum. In order to comfirm the effectiveness of this selection rule, some computer simulations are carried out for the N-Queens problem. For N=256, the network is not caught in any local minimum and provides the optimum solution within 2654 steps (about 10 minutes).
A Study on Mouth Shape Features Suitable for HMM Speech Recognition Using Fusion of Visual and Auditory Information
Naoshi DOI Akira SHINTANI Yasuhisa HAYASHI Akio OGIHARA Shinobu TAKAMATSU

LETTER

Vol:
E78-A No:11
Page(s):
1548-1552
Recently, some speech recognition methods using fusion of visual and auditory information have been researched. In this paper, a study on the mouth shape image suitable for fusion of visual and auditory information has been described. Features of mouth shape which are extracted from gray level image and binary image are adopted, and speech recognition using linear combination method has been performed. From results of speech recognition, the studies on the mouth shape features which are effective in fusion of visual and auditory information have been performed. And the effectiveness of using two kinds of mouth shape features also has been confirmed.
Associative Neural Network Models Based on a Measure of Manhattan Length
Hiroshi UEDA Yoichiro ANZAI Masaya OHTA Shojiro YONEDA Akio OGIHARA

PAPER

Vol:
E76-A No:3
Page(s):
277-283
In this paper, two models for associative memory based on a measure of manhattan length are proposed. First, we propose the two-layered model which has an advantage to its implementation by using PDN. We also refer to the way to improve the recalling ability of this model against noisy input patterns. Secondly, we propose the other model which always recalls the nearest memory pattern in a measure of manhattan length by lateral inhibition. Even if a noise of input pattern is so large that the first model can not recall, this model can recall correctly against such a noisy pattern. We also confirm the performance of the two models by computer simulations.
A Theoretical Analysis of Neural Networks with Nonzero Diagonal Elements
Masaya OHTA Yoichiro ANZAI Shojiro YONEDA Akio OGIHARA

PAPER

Vol:
E76-A No:3
Page(s):
284-291
This article analyzes the property of the fully interconnected neural networks as a method of solving combinatorial optimization problems in general. In particular, in order to escape local minimums in this model, we analyze theoretically the relation between the diagonal elements of the connection matrix and the stability of the networks. It is shown that the position of the global minimum point of the energy function on the hyper sphere in n dimensional space is given by the eigen vector corresponding the maximum eigen value of the connection matrix. Then it is shown that the diagonal elements of the connection matrix can be improved without loss of generality. The equilibrium points of the improved networks are classified according to their properties, and their stability is investigated. In order to show that the change of the diagonal elements improves the potential for the global minimum search, computer simulations are carried out by using the theoretical values. In according to the simulation result on 10 neurons, the success rate to get the optimum solution is 97.5%. The result shows that the improvement of the diagonal elements has potential for minimum search.
Discrimination Method of Synthetic Speech Using Pitch Frequency against Synthetic Speech Falsification
Akio OGIHARA Hitoshi UNNO Akira SHIOZAKI

PAPER-Biometrics

Vol:
E88-A No:1
Page(s):
280-286
We propose discrimination method of synthetic speech using pitch pattern of speech signal. By applying the proposed synthetic speech discrimination system as pre-process before the conventional HMM speaker verification system, we can improve the safety of conventional speaker verification system against imposture using synthetic speech. The proposed method distinguishes between synthetic speech and natural speech according to the pitch pattern which is distribution of value of normalized short-range autocorrelation function. We performed the experiment of user verification, and confirmed the validity of the proposed method.
A Continuous Speech Recognition Model Utilizing Subject Transition in Sentences
Nobuyuki TAKASU Akio OGIHARA Satoshi KONDO Shojiro YONEDA

LETTER-Speech Recognition

Vol:
E74-A No:5
Page(s):
1031-1033
The authors propose a model of the top down parser for continuous speech recognition. It utilizes a subject of an input sentence for its top down process and a preceding transition among subjects for the determination of a new subject. A task, a washing machine operation, which has five subjects are examined.

1-20hit(24hit)

Author Search Result

[Author] Akio OGIHARA(24hit)

An Improvement of the Pseudoinverse Rule with Diagonal Elements

An Autocorrelation Associative Neural Network with Self-Feedbacks

A Continuous Speech Recognition Algorithm Utilizing Island-Driven A^* Search

Switched-Capacitor Neural Networks and Their Application to Character Recognition

An Isolated Word Speech Recognition Based on Fusion of Visual and Auditory Information Usisng 30-frame/s and 24-bit Color Image

An Analysis on Minimum Searching Principle of Chaotic Neural Network

Speech Recognition Based on Fusion of Visual and Auditory Information Using Full-Framse Color Image

Sound Quality Evaluation for Audio Watermarking Based on Phase Shift Keying Using BCH Code

Speech Recognition Using HMM Based on Fusion of Visual and Auditory Information

Recognition of Isolated Digits Using Fuzzy Matrix Quantization

Correlation-Based Video Watermarking Method Using Inter-Frame Similarity

Tolerance Evaluation of Audio Watermarking Method Based on Modification of Sound Pressure Level between Channels

Asymmetric Neural Network and Its Application to Knapsack Problem

A Neural Network with a Function of lnhibiting Subtours on TSP

Binary Neural Network with Negative Self-Feedback and Its Application to N-Queens Problem

A Study on Mouth Shape Features Suitable for HMM Speech Recognition Using Fusion of Visual and Auditory Information

Associative Neural Network Models Based on a Measure of Manhattan Length

A Theoretical Analysis of Neural Networks with Nonzero Diagonal Elements

Discrimination Method of Synthetic Speech Using Pitch Frequency against Synthetic Speech Falsification

A Continuous Speech Recognition Model Utilizing Subject Transition in Sentences

Latest Issue

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles