The search functionality is under construction.
The search functionality is under construction.

Keyword Search Result

[Keyword] statistical model(19hit)

1-19hit
  • Prioritization of Lane-Specific Traffic Jam Detection for Automotive Navigation Framework Utilizing Suddenness Index and Automatic Threshold Determination

    Aki HAYASHI  Yuki YOKOHATA  Takahiro HATA  Kouhei MORI  Masato KAMIYA  

     
    PAPER

      Pubricized:
    2023/02/03
      Vol:
    E106-D No:5
      Page(s):
    895-903

    Car navigation systems provide traffic jam information. In this study, we attempt to provide more detailed traffic jam information that considers the lane in which a traffic jam is in. This makes it possible for users to avoid long waits in queued traffic going toward an unintended destination. Lane-specific traffic jam detection utilizes image processing, which incurs long processing time and high cost. To reduce these, we propose a “suddenness index (SI)” to categorize candidate areas as sudden or periodic. Sudden traffic jams are prioritized as they may lead to accidents. This technology aggregates the number of connected cars for each mesh on a map and quantifies the degree of deviation from the ordinary state. In this paper, we evaluate the proposed method using actual global positioning system (GPS) data and found that the proposed index can cover 100% of sudden lane-specific traffic jams while excluding 82.2% of traffic jam candidates. We also demonstrate the effectiveness of time savings by integrating the proposed method into a demonstration framework. In addition, we improved the proposed method's ability to automatically determine the SI threshold to select the appropriate traffic jam candidates to avoid manual parameter settings.

  • Does Student-Submission Allocation Affect Peer Assessment Accuracy?

    Hideaki OHASHI  Toshiyuki SHIMIZU  Masatoshi YOSHIKAWA  

     
    PAPER

      Pubricized:
    2022/01/05
      Vol:
    E105-D No:5
      Page(s):
    888-897

    Peer assessment in education has pedagogical benefits and is a promising method for grading a large number of submissions. At the same time, student reliability has been regarded as a problem; consequently, various methods of estimating highly reliable grades from scores given by multiple students have been proposed. Under most of the existing methods, a nonadaptive allocation pattern, which performs allocation in advance, is assumed. In this study, we analyze the effect of student-submission allocation on score estimation in peer assessment under a nonadaptive allocation setting. We examine three types of nonadaptive allocation methods, random allocation, circular allocation and group allocation, which are considered the commonly used approaches among the existing nonadaptive peer assessment methods. Through simulation experiments, we show that circular allocation and group allocation tend to yield lower accuracy than random allocation. Then, we utilize this result to improve the existing adaptive allocation method, which performs allocation and assessment in parallel and tends to make similar allocation result to circular allocation. We propose the method to replace part of the allocation with random allocation, and show that the method is effective through experiments.

  • Statistical Model Using Geometrical-Optical Space Classification: Expansion of Applicable Frequencies to the 5 GHz Band

    Takahiro HASHIMOTO  Takayuki NAKANISHI  Yoshio INASAWA  Yasuhiro NISHIOKA  Hiroaki MIYASHITA  

     
    BRIEF PAPER-Electromagnetic Theory

      Vol:
    E101-C No:2
      Page(s):
    135-138

    The method for estimating propagation loss that classifies receiving points into multiple groups by focusing on the number of reflections and diffractions, and applies a separate statistical model to each group was extended from only 2.4 GHz band to both 2.4 GHz and 5 GHz band. The extended statistical model was created from received power measurements. First, an appropriate grouping method was investigated based on the fitting error of statistical model. Non-line-of-sight (NLOS) receiving points were grouped in order of points that a wave reflected one time reaches, points that a wave reflected two times reaches, and points that a wave diffracted one time reaches. Next, the effectiveness of the proposed method was verified by comparison with conventional statistical models (one-slope, dual-slope, multi-wall, partitioned) on three office floors that differ from the environment used to create the statistical model. The average NLOS estimation error for the three evaluation environments was 4.9 dB, demonstrating that the proposed method has accuracy equal to or better than that of conventional methods.

  • Distance Estimation Based on Statistical Models of Received Signal Strength

    Masahiro FUJII  Yuma HIROTA  Hiroyuki HATANO  Atsushi ITO  Yu WATANABE  

     
    LETTER

      Vol:
    E99-A No:1
      Page(s):
    199-203

    In this letter, we propose a new distance estimation method based on statistical models of a Received Signal Strength (RSS) at the receiver. The conventional distance estimator estimates the distance between the transmitter and the receiver based on the statistical average of the RSS when the receiver obtains instantaneous RSS and an estimate of the hyperparameters which consists of the path loss exponent and so on. However, it is well-known that instantaneous RSS does not always correspond to the average RSS because the RSS varies in accordance with a statistical model. Although the statistical model has been introduced for the hyperparameters estimation and the localization system, the conventional distance estimator has not yet utilized it. We introduce the statistical model to the distance estimator whose expected value of the estimate corresponds to true distance. Our theoretical analysis establishes that the proposed distance estimator is preferable to the conventional one in order to improve accuracy in the expected value of the distance estimate. Moreover, we evaluate the Mean Square Error (MSE) between true distance and the estimate. We provide evidence that the MSE is always proportional to the square of the distance if the estimate of the hyperparameters is ideally obtained.

  • Speech Enhancement Combining NMF Weighted by Speech Presence Probability and Statistical Model

    Yonggang HU  Xiongwei ZHANG  Xia ZOU  Gang MIN  Meng SUN  Yunfei ZHENG  

     
    LETTER-Speech and Hearing

      Vol:
    E98-A No:12
      Page(s):
    2701-2704

    The conventional non-negative matrix factorization (NMF)-based speech enhancement is accomplished by updating iteratively with the prior knowledge of the clean speech and noise spectra bases. With the probabilistic estimation of whether the speech is present or not in a certain frame, this letter proposes a speech enhancement algorithm incorporating the speech presence probability (SPP) obtained via noise estimation to the NMF process. To take advantage of both the NMF-based and statistical model-based approaches, the final enhanced speech is achieved by applying a statistical model-based filter to the output of the SPP weighted NMF. Objective evaluations using perceptual evaluation of speech quality (PESQ) on TIMIT with 20 noise types at various signal-to-noise ratio (SNR) levels demonstrate the superiority of the proposed algorithm over the conventional NMF and statistical model-based baselines.

  • A Statistical Model-Based Speech Enhancement Using Acoustic Noise Classification for Robust Speech Communication

    Jae-Hun CHOI  Joon-Hyuk CHANG  

     
    LETTER-Multimedia Systems for Communications

      Vol:
    E95-B No:7
      Page(s):
    2513-2516

    In this paper, we present a speech enhancement technique based on the ambient noise classification that incorporates the Gaussian mixture model (GMM). The principal parameters of the statistical model-based speech enhancement algorithm such as the weighting parameter in the decision-directed (DD) method and the long-term smoothing parameter of the noise estimation, are set according to the classified context to ensure best performance under each noise. For real-time context awareness, the noise classification is performed on a frame-by-frame basis using the GMM with the soft decision framework. The speech absence probability (SAP) is used in detecting the speech absence periods and updating the likelihood of the GMM.

  • DOA Estimation of Multiple Speech Sources from a Stereophonic Mixture in Underdetermined Case

    Ning DING  Nozomu HAMADA  

     
    PAPER-Engineering Acoustics

      Vol:
    E95-A No:4
      Page(s):
    735-744

    This paper proposes a direction-of-arrival (DOA) estimation method of multiple speech sources from a stereophonic mixture in an underdetermined case where the number of sources exceeds the number of sensors. The method relies on the sparseness of speech signals in time-frequency (T-F) domain representation which means multiple independent speakers have a small overlap. At first, a selection of T-F cells bearing reliable spatial information is proposed by an introduced reliability index which is defined by the estimated interaural phase difference at each T-F cell. Then, a statistical error propagation model between the phase difference at T-F cell and its consequent DOA is introduced. By employing this model and the sparseness in T-F domain the DOA estimation problem is altered to obtaining local peaks of probability density function of DOA. Finally the kernel density estimator approach based on the proposed statistical model is applied. The performance of the proposed method is assessed by conducted experiments. Our method outperforms others both in accuracy for real observed data and in robustness for simulation with additional diffused noise.

  • Noise Robust Voice Activity Detection Based on Switching Kalman Filter

    Masakiyo FUJIMOTO  Kentaro ISHIZUKA  

     
    PAPER-Voice Activity Detection

      Vol:
    E91-D No:3
      Page(s):
    467-477

    This paper addresses the problem of voice activity detection (VAD) in noisy environments. The VAD method proposed in this paper is based on a statistical model approach, and estimates statistical models sequentially without a priori knowledge of noise. Namely, the proposed method constructs a clean speech / silence state transition model beforehand, and sequentially adapts the model to the noisy environment by using a switching Kalman filter when a signal is observed. In this paper, we carried out two evaluations. In the first, we observed that the proposed method significantly outperforms conventional methods as regards voice activity detection accuracy in simulated noise environments. Second, we evaluated the proposed method on a VAD evaluation framework, CENSREC-1-C. The evaluation results revealed that the proposed method significantly outperforms the baseline results of CENSREC-1-C as regards VAD accuracy in real environments. In addition, we confirmed that the proposed method helps to improve the accuracy of concatenated speech recognition in real environments.

  • Theoretical Modeling of Inter-Frame Prediction Error for High Frame-Rate Video Signal

    Yukihiro BANDOH  Kazuya HAYASE  Seishi TAKAMURA  Kazuto KAMIKURA  Yoshiyuki YASHIMA  

     
    PAPER-Image Processing

      Vol:
    E91-A No:3
      Page(s):
    730-739

    Realistic representations using extremely high quality images are becoming increasingly popular. For example, digital cinemas can now display moving pictures composed of high-resolution digital images. Although these applications focus on increasing the spatial resolution only, higher frame-rates are being considered to achieve more realistic representations. Since increasing the frame-rate increases the total amount of information, efficient coding methods are required. However, its statistical properties are not clarified. This paper establishes for high frame-rate video a mathematical model of the relationship between frame-rate and bit-rate. A coding experiment confirms the validity of the mathematical model.

  • Training Augmented Models Using SVMs

    Mark J.F. GALES  Martin I. LAYTON  

     
    INVITED PAPER

      Vol:
    E89-D No:3
      Page(s):
    892-899

    There has been significant interest in developing new forms of acoustic model, in particular models which allow additional dependencies to be represented than those contained within a standard hidden Markov model (HMM). This paper discusses one such class of models, augmented statistical models. Here, a local exponential approximation is made about some point on a base model. This allows additional dependencies within the data to be modelled than are represented in the base distribution. Augmented models based on Gaussian mixture models (GMMs) and HMMs are briefly described. These augmented models are then related to generative kernels, one approach used for allowing support vector machines (SVMs) to be applied to variable length data. The training of augmented statistical models within an SVM, generative kernel, framework is then discussed. This may be viewed as using maximum margin training to estimate statistical models. Augmented Gaussian mixture models are then evaluated using rescoring on a large vocabulary speech recognition task.

  • Generating F0 Contours by Statistical Manipulation of Natural F0 Shapes

    Takashi SAITO  

     
    PAPER-Speech Analysis

      Vol:
    E89-D No:3
      Page(s):
    1100-1106

    This paper describes a method of generating F0 contours from natural F0 segmental shapes for speech synthesis. The extracted shapes of the F0 units are basically held invariant by eliminating any averaging operations in the analysis phase and by minimizing modification operations in the synthesis phase. The use of natural F0 shapes has great potential to cover a wide variety of speaking styles with the same framework, including not only read-aloud speech, but also dialogues and emotional speech. A linear-regression statistical model is used to "manipulate" the stored raw F0 shapes to build them up into a sentential F0 contour. Through experimental evaluations, the proposed model is shown to provide stable and robust F0 contour prediction for various speakers. By using this model, linguistically derived information about a sentence can be directly mapped, in a purely data-driven manner, to acoustic F0 values of the sentential intonation contour for a given target speaker.

  • A Statistical Model Based on the Three Head Words for Detecting Article Errors

    Ryo NAGATA  Tatsuya IGUCHI  Fumito MASUI  Atsuo KAWAI  Naoki ISU  

     
    PAPER-Educational Technology

      Vol:
    E88-D No:7
      Page(s):
    1700-1706

    In this paper, we propose a statistical model for detecting article errors, which Japanese learners of English often make in English writing. It is based on the three head words--the verb head, the preposition, and the noun head. To overcome the data sparseness problem, we apply the backed-off estimate to it. Experiments show that its performance (F-measure=0.70) is better than that of other methods. Apart from the performance, it has two advantages: (i) Rules for detecting article errors are automatically generated as conditional probabilities once a corpus is given; (ii) Its recall and precision rates are adjustable.

  • A Statistical Model-Based V/UV Decision under Background Noise Environments

    Joon-Hyuk CHANG  Nam Soo KIM  Sanjit K. MITRA  

     
    LETTER-Speech and Hearing

      Vol:
    E87-D No:12
      Page(s):
    2885-2887

    In this letter, we propose an approach to incorporate a statistical model for the voiced/unvoiced (V/UV) speech decision under background noise environments. Our approach consists of splitting the input noisy speech into two separate bands and applying a statistical model for each band. We compute and compare the likelihood ratio (LR) for each band based on the statistical model and estimated noise statistics for the V/UV decision. According to the simulation test, the proposed V/UV decision shows a better performance compared with the selectable mode vocoder (SMV) V/UV decision algorithm, particularly in clean and white noise environments.

  • Statistical Modelling of Speech Segment Duration by Constrained Tree Regression

    Naoto IWAHASHI  Yoshinori SAGISAKA  

     
    PAPER-Speech and Hearing

      Vol:
    E83-D No:7
      Page(s):
    1550-1559

    This paper presents a new method for statistical modelling of prosody control in speech synthesis. The proposed method, which is referred to as Constrained Tree Regression (CTR), can make suitable representation of complex effects of control factors for prosody with a moderate amount of learning data. It is based on recursive splits of predictor variable spaces and partial imposition of constraints of linear independence among predictor variables. It incorporates both linear and tree regressions with categorical predictor variables, which have been conventionally used for prosody control, and extends them to more general models. In addition, a hierarchical error function is presented to consider hierarchical structure in prosody control. This new method is applied to modelling of speech segmental duration. Experimental results show that better duration models are obtained by using the proposed regression method compared with linear and tree regressions using the same number of free parameters. It is also shown that the hierarchical structure of phoneme and syllable durations can be represented efficiently using the hierarchical error function.

  • Estimation of the AR Order of an Inhomogeneous AR Model with Input Expanded by a Set of Basis

    Yukiko YOKOYAMA  Mineo KUMAZAWA  Naoki MIKAMI  

     
    LETTER-Digital Signal Processing

      Vol:
    E83-A No:3
      Page(s):
    551-557

    We proposed a new model for non-stationary time series analysis based on an inhomogeneous AR (autoregressive) equation. Time series data is regarded as white noise plus output of an AR system excited by non-stationary input sequence represented in terms of a set of basis. A method of model parameter estimation was presented when the set of basis and the AR order are given. In order to extend the method, we present a method of parameter estimation when the AR order is unknown: we set two new criteria 1) minimize the root mean square error of the output sequence, and 2) minimize scattering of estimated frequencies. Then, we derive a procedure for the estimation of the AR order and the other unknown parameters.

  • Parameter Estimation of Inhomogeneous AR Model Expanded with Unknown Basis

    Yukiko YOKOYAMA  Mineo KUMAZAWA  Naoki MIKAMI  

     
    LETTER

      Vol:
    E82-A No:8
      Page(s):
    1582-1587

    We proposed a new model for non-stationary time series analysis based on the IAR (inhomogeneous autoregressive) model, and a method for model parameter estimation when the set of basis is given. In this paper, we further propose a method for parameter estimation including that of basis set: we set a new condition that power of the input sequence is concentrated in low-frequency domain, and developed an iterative estimation method. We firstly select an initial set of basis, from which new sets are created in order to minimize the difference between the model and data. Among new sets of basis, we select a good one that gives minimum standard deviation of estimated frequencies.

  • Modeling of Microwave Oven Interference Using Class-A Impulsive Noise and Optimum Reception

    Hideki KANEMOTO  Shinichi MIYAMOTO  Norihiko MORINAGA  

     
    PAPER

      Vol:
    E80-B No:5
      Page(s):
    670-677

    Microwave oven interference much degrades the performance of digital radio communication systems, and, in order to obtain a good error performance under microwave oven interference environment, the digital radio communication systems should be newly designed for microwave oven interference environment. In this paper, using the Middleton's canonical class-A impulsive noise model, we propose a statistical model of microwave oven interference and discuss the performance improvement achieved by an optimum reception based on this statistical model. As the results, although the first order statistic of microwave oven interference can be modeled by class-A impulsive noise, because of the burst high level interference, the performance of optimum receiver designed for class-A noise cannot achieve a good error performance under microwave oven interference environment. In order to eliminate the effect of burst high level interference, we introduce sample interleave scheme and show that the performance of optimum receiver can be much improved by using sample interleave scheme.

  • Optimal Line Fitting and Reliability Evaluation

    Yasushi KANAZAWA  Kenichi KANATANI  

     
    PAPER-Image Processing,Computer Graphics and Pattern Recognition

      Vol:
    E79-D No:9
      Page(s):
    1317-1322

    Introducing a mathematical model of image noise, we formalize the problem of fitting a line to point data as statistical estimation. It is shown that the reliability of the fitted line can be evaluated quantitatively in the form of the covariance matrix of the parameters. We present a numerical scheme called renormalization for computing an optimal fit and at the same time evaluating its reliability. We also present a scheme for visualizing the reliability of the fit by means of the primary deviation pair and derive an analytical expression for the reliability of a line fitted to an edge segment by using an asymptotic approximation. Our method is illustrated by showing simulations and real-image examples.

  • Optimal Conic Fitting and Reliability Evaluation

    Yasushi KANAZAWA  Kenichi KANATANI  

     
    PAPER-Image Processing,Computer Graphics and Pattern Recognition

      Vol:
    E79-D No:9
      Page(s):
    1323-1328

    Introducing a mathematical model of image noise, we formalize the problem of fitting a conic to point data as statistical estimation. It is shown that the reliability of the fitted conic can be evaluated quantitatively in the form of the covariance tensor. We present a numerical scheme called renormalization for computing an optimal fit and at the same time evaluating its reliability. We also present a scheme for visualizing the reliability of the fit by means of the primary deviation pair. Our method is illustrated by showing simulations and real-image examples.