Zhu LI Kojiro TOMOTSUNE Yoichi TOMIOKA Hitoshi KITAZAWA
Template matching for image sequences captured with a moving camera is very important for several applications such as Robot Vision, SLAM, ITS, and video surveillance systems. However, it is difficult to realize accurate template matching using only visual feature information such as HSV histograms, edge histograms, HOG histograms, and SIFT features, because it is affected by several phenomena such as illumination change, viewpoint change, size change, and noise. In order to realize robust tracking, structure information such as the relative position of each part of the object should be considered. In this paper, we propose a method that considers both visual feature information and structure information. Experiments show that the proposed method realizes robust tracking and determine the relationships between object parts in the scenes and those in the template.
Maiko HATANO Norimasa YAFUNE Hirokuni TOKUDA Yoshiyuki YAMAMOTO Shin HASHIMOTO Katsushi AKITA Masaaki KUZUHARA
This paper describes high-temperature electron transport properties of AlGaN-channel HEMT fabricated on a free-standing AlN substrate, estimated at temperatures between 25 and 300. The AlGaN-channel HEMT exhibited significantly reduced temperature dependence in DC and RF device characteristics, as compared to those for the conventional AlGaN/GaN HEMT, resulting in larger values in both saturated drain current and current gain cutoff frequency at 300. Delay time analyses suggested that the temperature dependence of the AlGaN-channel HEMT was primarily dominated by the effective electron velocity in the AlGaN channel. These results indicate that an AlGaN-channel HEMT fabricated on an AlN substrate is promising for high-performance device applications at high temperatures.
Seok-Min CHAE Sung-Hak LEE Hyuk-Ju KWON Kyu-Ik SOHNG
Recently, a new image appearance model, named iCAM06, was developed for High-Dynamic-Range (HDR) image rendering. The dynamic range of a HDR image needs to be mapped onto the range of the output device where it will be displayed, this is called tone reproduction or tone mapping. iCAM06, a representative HDR rendering algorithm also uses tone compression for image reproduction on the dynamic range of output devices. However, iCAM06 causes a white point shift during its tone compression process. Therefore, we propose a compensation method for white point shifts using corrected channel gain. Experiment results show that the proposed method has better performance than iCAM06.
Nikhil JOSHI Adrish BANERJEE Jeong Woo LEE
The convergence behavior of turbo APPM (TAPPM) decoding is analyzed by using a three-dimensional extrinsic information transfer (EXIT) chart and the decoding trajectory. The signal-to-noise ratio (SNR) threshold, below which iterative decoding fails to converge, is predicted by using the 3-D EXIT chart analysis. Bit error rate performances of TAPPM schemes validate the EXIT-chart-based SNR threshold predictions. Outer constituent codes of TAPPM are chosen to show the lowest SNR threshold with the aid of EXIT chart analysis.
Our research is focused on examining a stereoscopic quality assessment model for stereoscopic images with disparate quality in left and right images for glasses-free stereo vision. In this paper, we examine the objective assessment model of 3-D images, considering the difference in image quality between each view-point generated by the disparity-compensated coding. A overall stereoscopic image quality can be estimated by using only predicted values of left and right 2-D image qualities based on the MPEG-7 descriptor information without using any disparity information. As a result, the stereoscopic still image quality is assessed with high prediction accuracy with correlation coefficient=0.98 and average error=0.17.
Chen LIU Xin JIN Tianruo ZHANG Satoshi GOTO
High-definition (HD) videos become more and more popular on portable devices these years. Due to the resolution mismatch between the HD video sources and the relative low-resolution screens of portable devices, the HD videos are usually fully decoded and then down-sampled (FDDS) for the displays, which not only increase the cost of both computational power and memory bandwidth, but also lose the details of video contents. In this paper, an encoder-unconstrained partial decoding scheme for H.264/AVC is presented to solve the problem by only decoding the object of interest (OOI) related region, which is defined by users. A simplified compression domain tracking method is utilized to ensure that the OOI locates in the center of the display area. The decoded partial area (DPA) adaptation, the reference block relocation (RBR) and co-located temporal Intra prediction (CTIP) methods are proposed to improve the visual quality for the DPA with low complexity. The simulation results show that the proposed partial decoding scheme provides an average of 50.16% decoding time reduction comparing to the fully decoding process. The displayed region also presents the original HD granularity of OOI. The proposed partial decoding scheme is especially useful for displaying HD video on the devices of which the battery life is a crucial factor.
Daisuke KANEMOTO Toru IDO Kenji TANIGUCHI
A low power and high performance with third order delta-sigma modulator for audio applications, fabricated in a 0.18 µm CMOS process, is presented. The modulator utilizes a third order noise shaping with only one opamp by using an opamp sharing technique. The opamp sharing among three integrator stages is achieved through the optimal operation timing, which makes use of the load capacitance differences between the three integrator stages. The designed modulator achieves 101.1 dB signal-to-noise ratio (A-weighted) and 101.5 dB dynamic range (A-weighted) with 7.5 mW power consumption from a 3.3 V supply. The die area is 1.27 mm2. The fabricated delta-sigma modulator achieves the highest figure-of-merit among published high performance low power audio delta-sigma modulators.
Lifeng HE Yuyan CHAO Kenji SUZUKI
This paper proposes a new first-scan method for two-scan labeling algorithms. In the first scan, our proposed method first scans every fourth image line, and processes the scan line and its two neighbor lines. Then, it processes the remaining lines from top to bottom one by one. Our method decreases the average number of times that must be checked to process a foreground pixel will; thus, the efficiency of labeling can be improved.
Masayuki ABE Noriaki KOGUSHI Kian Siong ANG René HOFSTETTER Kumar MANOJ Louis Nicholas RETNAM Hong WANG Geok Ing NG Chon JIN Dimitris PAVLIDIS
Novel thermopiles based on modulation doped AlGaAs/InGaAs and AlGaN/GaN heterostructures are proposed and developed for the first time, for uncooled infrared FPA (Focal Plane Array) image sensor application. The high responsivity with the high speed response time are designed to 4,900 V/W with 110 µs for AlGaAs/InGaAs, and to 460 V/W with 9 µs for AlGaN/GaN thermopiles, respectively. Based on integrated HEMT-MEMS technology, the AlGaAs/InGaAs 3232 matrix FPAs are fabricated to demonstrate its enhanced performances by black body measurement. The technology presented here demonstrates the potential of this approach for low-cost uncooled infrared FPA image sensor application.
Chul Bum KIM Doo Hyung WOO Hee Chul LEE
This paper presents a novel CMOS readout circuit for satellite infrared time delay and integration (TDI) arrays. An integrate-while-read method is adopted, and a dead-pixel-elimination circuit for solving a critical problem of the TDI scheme is integrated within a chip. In addition, an adaptive charge capacity control method is proposed to improve the signal-to-noise ratio (SNR) for low-temperature targets. The readout circuit was fabricated with a 0.35-µm CMOS process for a 5004 mid-wavelength infrared (MWIR) HgCdTe detector array. Using the circuit, a 90% background-limited infrared photodetection (BLIP) is satisfied over a wide input range (∼200–330 K), and the SNR is improved by 11 dB for the target temperature of 200 K.
Jing PENG Falin WU Ming ZHU Feixue WANG Kefei ZHANG
In this paper, an improved GPS/RFID integration method based on Sequential Iterated Reduced Sigma Point Kalman Filter (SIRSPKF) is proposed for vehicle navigation applications. It is applied to improve the accuracy, reliability and availability of satellite positioning in the areas where the satellite visibility is limited. An RFID system is employed to assist the GPS system in achieving high accuracy positioning. Further, to reduce the measurement noise and decrease the computational complexity caused by the integrated GPS/RFID, SIRSPKF is investigated as the dominant filter for the proposed integration. Performances and computational complexities of different integration scenarios with different filters are compared in this paper. A field experiment shows that both accuracy and availability of positioning can be improved significantly by this low-cost GPS/RFID integration method with the reduced computational load.
Koji TAKEDA Tomonari SATO Takaaki KAKITSUKA Akihiko SHINYA Kengo NOZAKI Chin-Hui CHEN Hideaki TANIYAMA Masaya NOTOMI Shinji MATSUO
To meet the demand for light sources for on-chip optical interconnections, we demonstrate the continuous-wave (CW) operation of photonic-crystal (PhC) nanocavity lasers at up to 89.8 by using InP buried heterostructures (BH). The wavelength of a PhC laser can be precisely designed over a wide range exceeding 100 nm by controlling the lattice constant of the PhC. The dynamic responses of the PhC laser are also demonstrated with a 3-dB bandwidth of over 7.0 GHz at 66.2. These results reveal the laser's availability for application to wavelength division multiplexed (WDM) optical interconnection on CMOS chips. We discuss the total bandwidths of future on-chip optical interconnections, and report the capabilities of PhC lasers.
Bo LIU Peng CAO Min ZHU Jun YANG Leibo LIU Shaojun WEI Longxing SHI
This paper presents a novel architecture design to optimize the reconfiguration process of a coarse-grained reconfigurable architecture (CGRA) called Reconfigurable Multimedia System II ( REMUS-II ). In REMUS-II, the tasks in multi-media applications are divided into two parts: computing-intensive tasks and control-intensive tasks. Two Reconfigurable Processor Units (RPUs) for accelerating computing-intensive tasks and a Micro-Processor Unit (µPU) for accelerating control-intensive tasks are contained in REMUS-II. As a large-scale CGRA, REMUS-II can provide satisfying solutions in terms of both efficiency and flexibility. This feature makes REMUS-II well-suited for video processing, where higher flexibility requirements are posed and a lot of computation tasks are involved. To meet the high requirement of the dynamic reconfiguration performance for multimedia applications, the reconfiguration architecture of REMUS-II should be well designed. To optimize the reconfiguration architecture of REMUS-II, a hierarchical configuration storage structure and a 3-stage reconfiguration processing structure are proposed. Furthermore, several optimization methods for configuration reusing are also introduced, to further improve the performance of reconfiguration process. The optimization methods include two aspects: the multi-target reconfiguration method and the configuration caching strategies. Experimental results showed that, with the reconfiguration architecture proposed, the performance of reconfiguration process will be improved by 4 times. Based on RTL simulation, REMUS-II can support the 1080p@32 fps of H.264 HiP@Level4 and 1080p@40 fps High-level MPEG-2 stream decoding at the clock frequency of 200 MHz. The proposed REMUS-II system has been implemented on a TSMC 65 nm process. The die size is 23.7 mm2 and the estimated on-chip dynamic power is 620 mW.
Jixin CHEN Wei HONG Hongjun TANG Pinpin YAN Li ZHANG Guangqi YANG Debin HOU Ke WU
In this paper, the research advances in silicon based millimeter wave and THz ICs in the State Key Laboratory of Millimeter Waves is reviewed, which consists of millimeter wave amplifiers, mixers, oscillators at Q, V and W and D band based on CMOS technology, and several research approaches of THz passive ICs including cavity and filter structures using SIW-like (Substrate Integrated Waveguide-like) guided wave structures based on CMOS and MEMs process. The design and performance of these components and devices are presented.
Satoshi YOSHIDA Shoichi TANIFUJI Suguru KAMEDA Noriharu SUEMATSU Tadashi TAKAGI Kazuo TSUBOUCHI
In order to realize millimeter-wave (MMW) 3-D system-in-package (SiP) front-end modules, we propose a 60-GHz band copper ball vertical interconnection structure, which interconnects between vertically stacked substrates. The structure enables ICs to be placed between the vertically stacked substrates. Since the diameter of the copper balls must exceed the thickness of the ICs, the distance between the substrates in the modules is larger than that of the flip-chip interconnection widely used in the MMW-band. Therefore, the conventional flip-chip interconnection does not scale for the interconnection between the substrates in MMW 3-D SiP front-end modules. The layout of grounded copper balls and the patterns of inner ground layers in the upper/lower substrates are designed using 3-D electromagnetic field simulation. The designed structure allows less than 1 dB transmission loss up to 71.1 GHz, compared with a through transmission line. The result is verified with fabrication and measurement and confirms the feasibility of MMW 3-D SiP front-end modules.
Ryeong-hee GWEON Yung-Lyul LEE
The next generation video coding standard HEVC shows high coding performance compared with the H.264/AVC standard, but the computational complexity of the HEVC encoder (HM3.0) is significantly higher. In this letter, the early termination of the CU encoding algorithm is proposed to reduce the computational complexity in the HEVC encoder. The proposed method reduces the encoder complexity by 58.7%, while maintaining the same level of coding efficiency.
Kensaku FUJII Kenji KASHIHARA Isao WAKABAYASHI Mitsuji MUNEYASU Masakazu MORIMOTO
In this paper, we propose a method capable of shortening the distance from a noise detection microphone to a loudspeaker in active noise control system with non-minimum phase secondary path. The distance can be basically shortened by forming the noise control filter, which produces the secondary noise provided by the loudspeaker, with the cascade connection of a non-recursive filter and a recursive filter. The output of the recursive filter, however, diverges even when the secondary path includes only a minimum phase component. In this paper, we prevent the divergence by utilizing MINT (multi-input/output inverse theorem) method increasing the number of secondary paths than that of primary paths. MINT method, however, requires a large scale inverse matrix operation, which increases the processing cost. We hence propose a method reducing the processing cost. Actually, MINT method has only to be applied to the non-minimum phase components of the secondary paths. We hence extract the non-minimum phase components and then apply MINT method only to those. The order of the inverse matrix thereby decreases and the processing cost can be reduced. We finally show a simulation result demonstrating that the proposed method successfully works.
Lu GAN Xiao Qing WANG Hong Shu LIAO
In this letter, a new method is proposed to solve the direction-of-arrivals (DOAs) estimation problem of coherently distributed sources based on the block-sparse signal model of compressed sensing (CS) and the convex optimization theory. We make use of a certain number of point sources and the CS array architecture to establish the compressive version of the discrete model of coherently distributed sources. The central DOA and the angular spread can be estimated simultaneously by solving a convex optimization problem which employs a joint norm constraint. As a result we can avoid the two-dimensional search used in conventional algorithms. Furthermore, the multiple-measurement-vectors (MMV) scenario is also considered to achieve robust estimation. The effectiveness of our method is confirmed by simulation results.
A highly stable microwave exciter system has been developed for 87Sr+ ion microwave frequency standards. The controller was built to optimize the transfer function of the phase-locked loop. The upper limit of the frequency tracking error achieved was 7.7 10-15 at τ = 1 s. A phase frequency discriminator using an FPGA was also made and applied to a phase-locked loop. This paper reports on the design of and results obtained from the microwave exciter for Sr+ ion microwave frequency standards.
Bobo ZENG Guijin WANG Xinggang LIN Chunxiao LIU
This work presents a real-time human detection system for VGA (Video Graphics Array, 640480) video, which well suits visual surveillance applications. To achieve high running speed and accuracy, firstly we design multiple fast scalar feature types on the gradient channels, and experimentally identify that NOGCF (Normalized Oriented Gradient Channel Feature) has better performance with Gentle AdaBoost in cascaded classifiers. A confidence measure for cascaded classifiers is developed and utilized in the subsequent tracking stage. Secondly, we propose to use speedup techniques including a detector pyramid for multi-scale detection and channel compression for integral channel calculation respectively. Thirdly, by integrating the detector's discrete detected humans and continuous detection confidence map, we employ a two-layer tracking by detection algorithm for further speedup and accuracy improvement. Compared with other methods, experiments show the system is significantly faster with 20 fps running speed in VGA video and has better accuracy as well.