Superfast-Trainable Multi-Class Probabilistic Classifier by Least-Squares Posterior Fitting

Masashi SUGIYAMA

doi:10.1587/transinf.E93.D.2690

IEICE TRANSACTIONS on Information

Superfast-Trainable Multi-Class Probabilistic Classifier by Least-Squares Posterior Fitting

Masashi SUGIYAMA

Full Text Views

0

Cite this

Summary :

Kernel logistic regression (KLR) is a powerful and flexible classification algorithm, which possesses an ability to provide the confidence of class prediction. However, its training--typically carried out by (quasi-)Newton methods--is rather time-consuming. In this paper, we propose an alternative probabilistic classification algorithm called Least-Squares Probabilistic Classifier (LSPC). KLR models the class-posterior probability by the log-linear combination of kernel functions and its parameters are learned by (regularized) maximum likelihood. In contrast, LSPC employs the linear combination of kernel functions and its parameters are learned by regularized least-squares fitting of the true class-posterior probability. Thanks to this linear regularized least-squares formulation, the solution of LSPC can be computed analytically just by solving a regularized system of linear equations in a class-wise manner. Thus LSPC is computationally very efficient and numerically stable. Through experiments, we show that the computation time of LSPC is faster than that of KLR by two orders of magnitude, with comparable classification accuracy.

Publication: IEICE TRANSACTIONS on Information Vol.E93-D No.10 pp.2690-2701

Publication Date: 2010/10/01

Publicized

Online ISSN: 1745-1361

DOI: 10.1587/transinf.E93.D.2690

Type of Manuscript: Special Section PAPER (Special Section on Data Mining and Statistical Science)

Category

Cite this

Copy

Masashi SUGIYAMA, "Superfast-Trainable Multi-Class Probabilistic Classifier by Least-Squares Posterior Fitting" in IEICE TRANSACTIONS on Information, vol. E93-D, no. 10, pp. 2690-2701, October 2010, doi: 10.1587/transinf.E93.D.2690.
Abstract: Kernel logistic regression (KLR) is a powerful and flexible classification algorithm, which possesses an ability to provide the confidence of class prediction. However, its training--typically carried out by (quasi-)Newton methods--is rather time-consuming. In this paper, we propose an alternative probabilistic classification algorithm called Least-Squares Probabilistic Classifier (LSPC). KLR models the class-posterior probability by the log-linear combination of kernel functions and its parameters are learned by (regularized) maximum likelihood. In contrast, LSPC employs the linear combination of kernel functions and its parameters are learned by regularized least-squares fitting of the true class-posterior probability. Thanks to this linear regularized least-squares formulation, the solution of LSPC can be computed analytically just by solving a regularized system of linear equations in a class-wise manner. Thus LSPC is computationally very efficient and numerically stable. Through experiments, we show that the computation time of LSPC is faster than that of KLR by two orders of magnitude, with comparable classification accuracy.
URL: https://global.ieice.org/en_transactions/information/10.1587/transinf.E93.D.2690/_p

Copy

@ARTICLE{e93-d_10_2690,
author={Masashi SUGIYAMA, },
journal={IEICE TRANSACTIONS on Information},
title={Superfast-Trainable Multi-Class Probabilistic Classifier by Least-Squares Posterior Fitting},
year={2010},
volume={E93-D},
number={10},
pages={2690-2701},
abstract={Kernel logistic regression (KLR) is a powerful and flexible classification algorithm, which possesses an ability to provide the confidence of class prediction. However, its training--typically carried out by (quasi-)Newton methods--is rather time-consuming. In this paper, we propose an alternative probabilistic classification algorithm called Least-Squares Probabilistic Classifier (LSPC). KLR models the class-posterior probability by the log-linear combination of kernel functions and its parameters are learned by (regularized) maximum likelihood. In contrast, LSPC employs the linear combination of kernel functions and its parameters are learned by regularized least-squares fitting of the true class-posterior probability. Thanks to this linear regularized least-squares formulation, the solution of LSPC can be computed analytically just by solving a regularized system of linear equations in a class-wise manner. Thus LSPC is computationally very efficient and numerically stable. Through experiments, we show that the computation time of LSPC is faster than that of KLR by two orders of magnitude, with comparable classification accuracy.},
keywords={},
doi={10.1587/transinf.E93.D.2690},
ISSN={1745-1361},
month={October},}

Copy

TY - JOUR
TI - Superfast-Trainable Multi-Class Probabilistic Classifier by Least-Squares Posterior Fitting
T2 - IEICE TRANSACTIONS on Information
SP - 2690
EP - 2701
AU - Masashi SUGIYAMA
PY - 2010
DO - 10.1587/transinf.E93.D.2690
JO - IEICE TRANSACTIONS on Information
SN - 1745-1361
VL - E93-D
IS - 10
JA - IEICE TRANSACTIONS on Information
Y1 - October 2010
AB - Kernel logistic regression (KLR) is a powerful and flexible classification algorithm, which possesses an ability to provide the confidence of class prediction. However, its training--typically carried out by (quasi-)Newton methods--is rather time-consuming. In this paper, we propose an alternative probabilistic classification algorithm called Least-Squares Probabilistic Classifier (LSPC). KLR models the class-posterior probability by the log-linear combination of kernel functions and its parameters are learned by (regularized) maximum likelihood. In contrast, LSPC employs the linear combination of kernel functions and its parameters are learned by regularized least-squares fitting of the true class-posterior probability. Thanks to this linear regularized least-squares formulation, the solution of LSPC can be computed analytically just by solving a regularized system of linear equations in a class-wise manner. Thus LSPC is computationally very efficient and numerically stable. Through experiments, we show that the computation time of LSPC is faster than that of KLR by two orders of magnitude, with comparable classification accuracy.
ER -

IEICE TRANSACTIONS on Information

Superfast-Trainable Multi-Class Probabilistic Classifier by Least-Squares Posterior Fitting

Summary :

Authors

Keyword

Latest Issue

Contents

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles

IEICE TRANSACTIONS on Information

Superfast-Trainable Multi-Class Probabilistic Classifier by Least-Squares Posterior Fitting

Summary :

Authors

Keyword

Latest Issue

Contents

Copyrights notice of machine-translated contents

Cite this

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles