IEICE global.ieice.org Site

Author Search Result

[Author] Xirong MA(2hit)

1-2hit

Auto-Tuning of Thread Assignment for Matrix-Vector Multiplication on GPUs
Jinwei WANG Xirong MA Yuanping ZHU Jizhou SUN

PAPER-Fundamentals of Information Systems

Vol:
E96-D No:11
Page(s):
2319-2326
Modern GPUs have evolved to become a more general processor capable of executing scientific and engineering computations. It provides a highly parallel computing environment due to its large number of computing cores, which are suitable for numerous data parallel arithmetic computations, particularly linear algebra operations. The matrix-vector multiplication is one of the most important dense linear algebraic operations. It is applied to a diverse set of applications in many fields and must therefore be fully optimized to achieve a high-performance. In this paper, we proposed a novel auto-tuning method for matrix-vector multiplication on GPUs, where the number of assigned threads that are used to compute one element of the result vector can be auto-tuned according to the size of matrix. On the Nvidia's GPU GTX 650 with the most recent Kepler architecture, we developed an auto-tuner that can automatically select the optimal number of assigned threads for calculation. Based on the auto-tuner's result, we developed a versatile generic matrix-vector multiplication kernel with the CUDA programming model. A series of experiments on different shapes and sizes of matrices were conducted for comparing the performance of our kernel with that of the kernels from CUBLAS 5.0, MAGMA 1.3 and a warp method. The experiments results show that the performance of our matrix-vector multiplication kernel is close to the optimal behavior with increasing of the size of the matrix and has very little dependency on the shape of the matrix, which is a significant improvement compared to the other three kernels that exhibit unstable performance behavior for different shapes of matrices.
Indoor Scene Classification Based on the Bag-of-Words Model of Local Feature Information Gain
Rong WANG Zhiliang WANG Xirong MA

LETTER-Image Recognition, Computer Vision

Vol:
E96-D No:4
Page(s):
984-987
For the problem of Indoor Home Scene Classification, this paper proposes the BOW Model of Local Feature Information Gain. The experimental results show that not only the performance is improved but also the computation is reduced. Consequently this method out performs the state-of-the-art approach.

Author Search Result

[Author] Xirong MA(2hit)

Auto-Tuning of Thread Assignment for Matrix-Vector Multiplication on GPUs

Indoor Scene Classification Based on the Bag-of-Words Model of Local Feature Information Gain

Latest Issue

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles