IEICE global.ieice.org Site

Keyword Search Result

[Keyword] SPAR(322hit)

21-40hit(322hit)

FPGA Implementation of 3-Bit Quantized Multi-Task CNN for Contour Detection and Disparity Estimation
Masayuki MIYAMA

PAPER-Image Recognition, Computer Vision

Pubricized:
2021/10/26
Vol:
E105-D No:2
Page(s):
406-414
Object contour detection is a task of extracting the shape created by the boundaries between objects in an image. Conventional methods limit the detection targets to specific categories, or miss-detect edges of patterns inside an object. We propose a new method to represent a contour image where the pixel value is the distance to the boundary. Contour detection becomes a regression problem that estimates this contour image. A deep convolutional network for contour estimation is combined with stereo vision to detect unspecified object contours. Furthermore, thanks to similar inference targets and common network structure, we propose a network that simultaneously estimates both contour and disparity with fully shared weights. As a result of experiments, the multi-tasking network drew a good precision-recall curve, and F-measure was about 0.833 for FlyingThings3D dataset. L1 loss of disparity estimation for the dataset was 2.571. This network reduces the amount of calculation and memory capacity by half, and accuracy drop compared to the dedicated networks is slight. Then we quantize both weights and activations of the network to 3-bit. We devise a dedicated hardware architecture for the quantized CNN and implement it on an FPGA. This circuit uses only internal memory to perform forward propagation calculations, that eliminates high-power external memory accesses. This circuit is a stall-free pixel-by-pixel pipeline, and performs 8 rows, 16 input channels, 16 output channels, 3 by 3 pixels convolution calculations in parallel. The convolution calculation performance at the operating frequency of 250 MHz is 9 TOPs/s.
A Novel Transferable Sparse Regression Method for Cross-Database Facial Expression Recognition
Wenjing ZHANG Peng SONG Wenming ZHENG

LETTER-Image Recognition, Computer Vision

Pubricized:
2021/10/12
Vol:
E105-D No:1
Page(s):
184-188
In this letter, we propose a novel transferable sparse regression (TSR) method, for cross-database facial expression recognition (FER). In TSR, we firstly present a novel regression function to regress the data into a latent representation space instead of a strict binary label space. To further alleviate the influence of outliers and overfitting, we impose a row sparsity constraint on the regression term. And a pairwise relation term is introduced to guide the feature transfer learning. Secondly, we design a global graph to transfer knowledge, which can well preserve the cross-database manifold structure. Moreover, we introduce a low-rank constraint on the graph regularization term to uncover additional structural information. Finally, several experiments are conducted on three popular facial expression databases, and the results validate that the proposed TSR method is superior to other non-deep and deep transfer learning methods.
Device-Free Localization via Sparse Coding with a Generalized Thresholding Algorithm
Qin CHENG Linghua ZHANG Bo XUE Feng SHU Yang YU

PAPER-Wireless Communication Technologies

Pubricized:
2021/08/05
Vol:
E105-B No:1
Page(s):
58-66
As an emerging technology, device-free localization (DFL) using wireless sensor networks to detect targets not carrying any electronic devices, has spawned extensive applications, such as security safeguards and smart homes or hospitals. Previous studies formulate DFL as a classification problem, but there are still some challenges in terms of accuracy and robustness. In this paper, we exploit a generalized thresholding algorithm with parameter p as a penalty function to solve inverse problems with sparsity constraints for DFL. The function applies less bias to the large coefficients and penalizes small coefficients by reducing the value of p. By taking the distinctive capability of the p thresholding function to measure sparsity, the proposed approach can achieve accurate and robust localization performance in challenging environments. Extensive experiments show that the algorithm outperforms current alternatives.
Weight Sparseness for a Feature-Map-Split-CNN Toward Low-Cost Embedded FPGAs
Akira JINGUJI Shimpei SATO Hiroki NAKAHARA

PAPER

Pubricized:
2021/09/27
Vol:
E104-D No:12
Page(s):
2040-2047
Convolutional neural network (CNN) has a high recognition rate in image recognition and are used in embedded systems such as smartphones, robots and self-driving cars. Low-end FPGAs are candidates for embedded image recognition platforms because they achieve real-time performance at a low cost. However, CNN has significant parameters called weights and internal data called feature maps, which pose a challenge for FPGAs for performance and memory capacity. To solve these problems, we exploit a split-CNN and weight sparseness. The split-CNN reduces the memory footprint by splitting the feature map into smaller patches and allows the feature map to be stored in the FPGA's high-throughput on-chip memory. Weight sparseness reduces computational costs and achieves even higher performance. We designed a dedicated architecture of a sparse CNN and a memory buffering scheduling for a split-CNN and implemented this on the PYNQ-Z1 FPGA board with a low-end FPGA. An experiment on classification using VGG16 shows that our implementation is 3.1 times faster than the GPU, and 5.4 times faster than an existing FPGA implementation.
Constrained Design of FIR Filters with Sparse Coefficients
Tatsuki ITASAKA Ryo MATSUOKA Masahiro OKUDA

PAPER

Pubricized:
2021/05/13
Vol:
E104-A No:11
Page(s):
1499-1508
We propose an algorithm for the constrained design of FIR filters with sparse coefficients. In general filter design approaches, as the length of the filter increases, the number of multipliers used to construct the filter increases. This is a serious problem, especially in two-dimensional FIR filter designs. The FIR filter coefficients designed by the least-squares method with peak error constraint are optimal in the sense of least-squares within a given order, but not necessarily optimal in terms of constructing a filter that meets the design specification under the constraints on the number of coefficients. That is, a higher-order filter with several zero coefficients can construct a filter that meets the specification with a smaller number of multipliers. We propose a two-step approach to design constrained sparse FIR filters. Our method minimizes the number of non-zero coefficients while the frequency response of the filter that meets the design specification. It achieves better performance in terms of peak error than conventional constrained least-squares designs with the same or higher number of multipliers in both one-dimensional and two-dimensional filter designs.
Matrix Factorization Based Recommendation Algorithm for Sharing Patent Resource
Xueqing ZHANG Xiaoxia LIU Jun GUO Wenlei BAI Daguang GAN

PAPER

Pubricized:
2021/04/26
Vol:
E104-D No:8
Page(s):
1250-1257
As scientific and technological resources are experiencing information overload, it is quite expensive to find resources that users are interested in exactly. The personalized recommendation system is a good candidate to solve this problem, but data sparseness and the cold starting problem still prevent the application of the recommendation system. Sparse data affects the quality of the similarity measurement and consequently the quality of the recommender system. In this paper, we propose a matrix factorization recommendation algorithm based on similarity calculation(SCMF), which introduces potential similarity relationships to solve the problem of data sparseness. A penalty factor is adopted in the latent item similarity matrix calculation to capture more real relationships furthermore. We compared our approach with other 6 recommendation algorithms and conducted experiments on 5 public data sets. According to the experimental results, the recommendation precision can improve by 2% to 9% versus the traditional best algorithm. As for sparse data sets, the prediction accuracy can also improve by 0.17% to 18%. Besides, our approach was applied to patent resource exploitation provided by the wanfang patents retrieval system. Experimental results show that our method performs better than commonly used algorithms, especially under the cold starting condition.
Minimax Design of Sparse IIR Filters Using Sparse Linear Programming Open Access
Masayoshi NAKAMOTO Naoyuki AIKAWA

PAPER-Digital Signal Processing

Pubricized:
2021/02/15
Vol:
E104-A No:8
Page(s):
1006-1018
Recent trends in designing filters involve development of sparse filters with coefficients that not only have real but also zero values. These sparse filters can achieve a high performance through optimizing the selection of the zero coefficients and computing the real (non-zero) coefficients. Designing an infinite impulse response (IIR) sparse filter is more challenging than designing a finite impulse response (FIR) sparse filter. Therefore, studies on the design of IIR sparse filters have been rare. In this study, we consider IIR filters whose coefficients involve zero value, called sparse IIR filter. First, we formulate the design problem as a linear programing problem without imposing any stability condition. Subsequently, we reformulate the design problem by altering the error function and prepare several possible denominator polynomials with stable poles. Finally, by incorporating these methods into successive thinning algorithms, we develop a new design algorithm for the filters. To demonstrate the effectiveness of the proposed method, its performance is compared with that of other existing methods.
Efficient Hardware Accelerator for Compressed Sparse Deep Neural Network
Hao XIAO Kaikai ZHAO Guangzhu LIU

LETTER-Computer System

Pubricized:
2021/02/19
Vol:
E104-D No:5
Page(s):
772-775
This work presents a DNN accelerator architecture specifically designed for performing efficient inference on compressed and sparse DNN models. Leveraging the data sparsity, a runtime processing scheme is proposed to deal with the encoded weights and activations directly in the compressed domain without decompressing. Furthermore, a new data flow is proposed to facilitate the reusage of input activations across the fully-connected (FC) layers. The proposed design is implemented and verified using the Xilinx Virtex-7 FPGA. Experimental results show it achieves 1.99×, 1.95× faster and 20.38×, 3.04× more energy efficient than CPU and mGPU platforms, respectively, running AlexNet.
Optimization and Hole Interpolation of 2-D Sparse Arrays for Accurate Direction-of-Arrival Estimation
Shogo NAKAMURA Sho IWAZAKI Koichi ICHIGE

PAPER-Antennas and Propagation

Pubricized:
2020/10/21
Vol:
E104-B No:4
Page(s):
401-409
This paper presents a method to optimize 2-D sparse array configurations along with a technique to interpolate holes to accurately estimate the direction of arrival (DOA). Conventional 2-D sparse arrays are often defined using a closed-form representation and have the property that they can create hole-free difference co-arrays that can estimate DOAs of incident signals that outnumber the physical elements. However, this property restricts the array configuration to a limited structure and results in a significant mutual coupling effect between consecutive sensors. In this paper, we introduce an optimization-based method for designing 2-D sparse arrays that enhances flexibility of array configuration as well as DOA estimation accuracy. We also propose a method to interpolate holes in 2-D co-arrays by nuclear norm minimization (NNM) that permits holes and to extend array aperture to further enhance DOA estimation accuracy. The performance of the proposed optimum arrays is evaluated through numerical examples.
Multiclass Dictionary-Based Statistical Iterative Reconstruction for Low-Dose CT
Hiryu KAMOSHITA Daichi KITAHARA Ken'ichi FUJIMOTO Laurent CONDAT Akira HIRABAYASHI

PAPER-Numerical Analysis and Optimization

Pubricized:
2020/10/06
Vol:
E104-A No:4
Page(s):
702-713
This paper proposes a high-quality computed tomography (CT) image reconstruction method from low-dose X-ray projection data. A state-of-the-art method, proposed by Xu et al., exploits dictionary learning for image patches. This method generates an overcomplete dictionary from patches of standard-dose CT images and reconstructs low-dose CT images by minimizing the sum of a data fidelity and a regularization term based on sparse representations with the dictionary. However, this method does not take characteristics of each patch, such as textures or edges, into account. In this paper, we propose to classify all patches into several classes and utilize an individual dictionary with an individual regularization parameter for each class. Furthermore, for fast computation, we introduce the orthogonality to column vectors of each dictionary. Since similar patches are collected in the same cluster, accuracy degradation by the orthogonality hardly occurs. Our simulations show that the proposed method outperforms the state-of-the-art in terms of both accuracy and speed.
Radio Techniques Incorporating Sparse Modeling Open Access
Toshihiko NISHIMURA Yasutaka OGAWA Takeo OHGANE Junichiro HAGIWARA

INVITED SURVEY PAPER-Digital Signal Processing

Pubricized:
2020/09/01
Vol:
E104-A No:3
Page(s):
591-603
Sparse modeling is one of the most active research areas in engineering and science. The technique provides solutions from far fewer samples exploiting sparsity, that is, the majority of the data are zero. This paper reviews sparse modeling in radio techniques. The first half of this paper introduces direction-of-arrival (DOA) estimation from signals received by multiple antennas. The estimation is carried out using compressed sensing, an effective tool for the sparse modeling, which produces solutions to an underdetermined linear system with a sparse regularization term. The DOA estimation performance is compared among three compressed sensing algorithms. The second half reviews channel state information (CSI) acquisitions in multiple-input multiple-output (MIMO) systems. In time-varying environments, CSI estimated with pilot symbols may be outdated at the actual transmission time. We describe CSI prediction based on sparse DOA estimation, and show excellent precoding performance when using the CSI prediction. The other topic in the second half is sparse Bayesian learning (SBL)-based channel estimation. A base station (BS) has many antennas in a massive MIMO system. A major obstacle for using the massive MIMO system in frequency-division duplex mode is an overhead for downlink CSI acquisition because we need to send many pilot symbols from the BS and to get the feedback from user equipment. An SBL-based channel estimation method can mitigate this issue. In this paper, we describe the outline of the method, and show that the technique can reduce the downlink pilot symbols.
Expectation-Propagation Detection for Generalized Spatial Modulation with Sparse Orthogonal Precoding
Tatsuya SUGIYAMA Keigo TAKEUCHI

LETTER-Communication Theory and Signals

Pubricized:
2020/09/11
Vol:
E104-A No:3
Page(s):
661-664
Sparse orthogonal matrices are proposed to improve the convergence property of expectation propagation (EP) for sparse signal recovery from compressed linear measurements subject to known dense and ill-conditioned multiplicative noise. As a typical problem, this letter addresses generalized spatial modulation (GSM) in over-loaded and spatially correlated multiple-input multiple-output (MIMO) systems. The proposed sparse orthogonal matrices are used in precoding and constructed efficiently via a generalization of the fast Walsh-Hadamard transform. Numerical simulations show that the proposed sparse orthogonal precoding improves the convergence property of EP in over-loaded GSM MIMO systems with known spatially correlated channel matrices.
An Acceleration Method of Sparse Diffusion LMS based on Message Propagation
Ayano NAKAI-KASAI Kazunori HAYASHI

PAPER-Fundamental Theories for Communications

Pubricized:
2020/08/06
Vol:
E104-B No:2
Page(s):
141-148
Diffusion least-mean-square (LMS) is a method to estimate and track an unknown parameter at multiple nodes in a network. When the unknown vector has sparsity, the sparse promoting version of diffusion LMS, which utilizes a sparse regularization term in the cost function, is known to show better convergence performance than that of the original diffusion LMS. This paper proposes a novel choice of the coefficients involved in the updates of sparse diffusion LMS using the idea of message propagation. Moreover, we optimize the proposed coefficients with respect to mean-square-deviation at the steady-state. Simulation results demonstrate that the proposed method outperforms conventional methods in terms of the convergence performance.
Transparent Glass Quartz Antennas on the Windows of 5G-Millimeter-Wave-Connected Cars
Osamu KAGAYA Yasuo MORIMOTO Takeshi MOTEGI Minoru INOMATA

PAPER-Antennas and Propagation

Pubricized:
2020/07/14
Vol:
E104-B No:1
Page(s):
64-72
This paper proposes a transparent glass quartz antenna for 5G-millimeter-wave-connected vehicles and clarifies the characteristics of signal reception when the glass antennas are placed on the windows of a vehicle traveling in an urban environment. Synthetic fused quartz is a material particularly suited for millimeter-wave devices owing to its excellent low transmission loss. Realizing synthetic fused quartz devices requires accurate micromachining technology specialized for the material coupled with the material technology. This paper presents a transparent antenna comprising a thin mesh pattern on a quartz substrate for installation on a vehicle window. A comparison of distributed transparent antennas and an omnidirectional antenna shows that the relative received power of the distributed antenna system is higher than that of the omnidirectional antenna. In addition, results show that the power received is similar when using vertically and horizontally polarized antennas. The design is verified in a field test using transparent antennas on the windows of a real vehicle.
Diversity Reception and Interference Cancellation for Receivers Using Antenna with Periodically Variable Antenna Pattern Open Access
Nobuhide KINJO Masato SAITO

PAPER

Vol:
E104-A No:1
Page(s):
253-262
In this paper, we propose a model of a diversity receiver which uses an antenna whose antenna pattern can periodically change. We also propose a minimum mean square error (MMSE) based interference cancellation method of the receiver which, in principle, can suffer from the interference in neighboring frequency bands. Since the antenna pattern changes according to the sum of sinusoidal waveforms with different frequencies, the received signals are received at the carrier frequency and the frequencies shifted from the carrier frequency by the frequency of the sinusoidal waveforms. The proposed diversity scheme combines the components in the frequency domain to maximize the signal-to-noise power ratio (SNR) and to maximize the diversity gain. We confirm that the bit error rate (BER) of the proposed receiver can be improved by increase in the number of arrival paths resulting in obtaining path diversity gain. We also confirm that the proposed MMSE based interference canceller works well when interference signals exist and achieves better BER performances than the conventional diversity receiver with maximum ratio combining.
Expectation Propagation Decoding for Sparse Superposition Codes Open Access
Hiroki MAYUMI Keigo TAKEUCHI

LETTER-Coding Theory

Pubricized:
2020/07/06
Vol:
E103-A No:12
Page(s):
1666-1669
Expectation propagation (EP) decoding is proposed for sparse superposition coding in orthogonal frequency division multiplexing (OFDM) systems. When a randomized discrete Fourier transform (DFT) dictionary matrix is used, the EP decoding has the same complexity as approximate message-passing (AMP) decoding, which is a low-complexity and powerful decoding algorithm for the additive white Gaussian noise (AWGN) channel. Numerical simulations show that the EP decoding achieves comparable performance to AMP decoding for the AWGN channel. For OFDM systems, on the other hand, the EP decoding is much superior to the AMP decoding while the AMP decoding has an error-floor in high signal-to-noise ratio regime.
L0 Norm Optimization in Scrambled Sparse Representation Domain and Its Application to EtC System
Takayuki NAKACHI Hitoshi KIYA

PAPER-Cryptography and Information Security

Vol:
E103-A No:12
Page(s):
1589-1598
In this paper, we propose L0 norm optimization in a scrambled sparse representation domain and its application to an Encryption-then-Compression (EtC) system. We design a random unitary transform that conserves L0 norm isometry. The resulting encryption method provides a practical orthogonal matching pursuit (OMP) algorithm that allows computation in the encrypted domain. We prove that the proposed method theoretically has exactly the same estimation performance as the nonencrypted variant of the OMP algorithm. In addition, we demonstrate the security strength of the proposed secure sparse representation when applied to the EtC system. Even if the dictionary information is leaked, the proposed scheme protects the privacy information of observed signals.
SENTEI: Filter-Wise Pruning with Distillation towards Efficient Sparse Convolutional Neural Network Accelerators
Masayuki SHIMODA Youki SADA Ryosuke KURAMOCHI Shimpei SATO Hiroki NAKAHARA

PAPER-Computer System

Pubricized:
2020/08/03
Vol:
E103-D No:12
Page(s):
2463-2470
In the realization of convolutional neural networks (CNNs) in resource-constrained embedded hardware, the memory footprint of weights is one of the primary problems. Pruning techniques are often used to reduce the number of weights. However, the distribution of nonzero weights is highly skewed, which makes it more difficult to utilize the underlying parallelism. To address this problem, we present SENTEI*, filter-wise pruning with distillation, to realize hardware-aware network architecture with comparable accuracy. The filter-wise pruning eliminates weights such that each filter has the same number of nonzero weights, and retraining with distillation retains the accuracy. Further, we develop a zero-weight skipping inter-layer pipelined accelerator on an FPGA. The equalization enables inter-filter parallelism, where a processing block for a layer executes filters concurrently with straightforward architecture. Our evaluation of semantic-segmentation tasks indicates that the resulting mIoU only decreased by 0.4 points. Additionally, the speedup and power efficiency of our FPGA implementation were 33.2× and 87.9× higher than those of the mobile GPU. Therefore, our technique realizes hardware-aware network with comparable accuracy.
A Social Collaborative Filtering Method to Alleviate Data Sparsity Based on Graph Convolutional Networks
Haitao XIE Qingtao FAN Qian XIAO

PAPER-Artificial Intelligence, Data Mining

Pubricized:
2020/08/28
Vol:
E103-D No:12
Page(s):
2611-2619
Nowadays recommender systems (RS) keep drawing attention from academia, and collaborative filtering (CF) is the most successful technique for building RS. To overcome the inherent limitation, which is referred to as data sparsity in CF, various solutions are proposed to incorporate additional social information into recommendation processes, such as trust networks. However, existing methods suffer from multi-source data integration (i.e., fusion of social information and ratings), which is the basis for similarity calculation of user preferences. To this end, we propose a social collaborative filtering method based on novel trust metrics. Firstly, we use Graph Convolutional Networks (GCNs) to learn the associations between social information and user ratings while considering the underlying social network structures. Secondly, we measure the direct-trust values between neighbors by representing multi-source data as user ratings on popular items, and then calculate the indirect-trust values based on trust propagations. Thirdly, we employ all trust values to create a social regularization in user-item rating matrix factorization in order to avoid overfittings. The experiments on real datasets show that our approach outperforms the other state-of-the-art methods on usage of multi-source data to alleviate data sparsity.
Optimization of Deterministic Pilot Pattern Placement Based on Quantum Genetic Algorithm for Sparse Channel Estimation in OFDM Systems
Yang NIE Xinle YU

PAPER-Wireless Communication Technologies

Pubricized:
2020/04/21
Vol:
E103-B No:10
Page(s):
1164-1171
This paper proposes a deterministic pilot pattern placement optimization scheme based on the quantum genetic algorithm (QGA) which aims to improve the performance of sparse channel estimation in orthogonal frequency division multiplexing (OFDM) systems. By minimizing the mutual incoherence property (MIP) of the sensing matrix, the pilot pattern placement optimization is modeled as the solution of a combinatorial optimization problem. QGA is used to solve the optimization problem and generate optimized pilot pattern that can effectively avoid local optima traps. The simulation results demonstrate that the proposed method can generate a sensing matrix with a smaller MIP than a random search or the genetic algorithm (GA), and the optimized pilot pattern performs well for sparse channel estimation in OFDM systems.

21-40hit(322hit)

Keyword Search Result

[Keyword] SPAR(322hit)

FPGA Implementation of 3-Bit Quantized Multi-Task CNN for Contour Detection and Disparity Estimation

A Novel Transferable Sparse Regression Method for Cross-Database Facial Expression Recognition

Device-Free Localization via Sparse Coding with a Generalized Thresholding Algorithm

Weight Sparseness for a Feature-Map-Split-CNN Toward Low-Cost Embedded FPGAs

Constrained Design of FIR Filters with Sparse Coefficients

Matrix Factorization Based Recommendation Algorithm for Sharing Patent Resource

Minimax Design of Sparse IIR Filters Using Sparse Linear Programming Open Access

Efficient Hardware Accelerator for Compressed Sparse Deep Neural Network

Optimization and Hole Interpolation of 2-D Sparse Arrays for Accurate Direction-of-Arrival Estimation

Multiclass Dictionary-Based Statistical Iterative Reconstruction for Low-Dose CT

Radio Techniques Incorporating Sparse Modeling Open Access

Expectation-Propagation Detection for Generalized Spatial Modulation with Sparse Orthogonal Precoding

An Acceleration Method of Sparse Diffusion LMS based on Message Propagation

Transparent Glass Quartz Antennas on the Windows of 5G-Millimeter-Wave-Connected Cars

Diversity Reception and Interference Cancellation for Receivers Using Antenna with Periodically Variable Antenna Pattern Open Access

Expectation Propagation Decoding for Sparse Superposition Codes Open Access

L0 Norm Optimization in Scrambled Sparse Representation Domain and Its Application to EtC System

SENTEI: Filter-Wise Pruning with Distillation towards Efficient Sparse Convolutional Neural Network Accelerators

A Social Collaborative Filtering Method to Alleviate Data Sparsity Based on Graph Convolutional Networks

Optimization of Deterministic Pilot Pattern Placement Based on Quantum Genetic Algorithm for Sparse Channel Estimation in OFDM Systems

Latest Issue

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles