Jangwon CHOI Yoonsik CHOE Yong-Goo KIM
This letter proposes a novel depth-guided inpainting scheme for the high quality hole-filling in 2D-to-3D video conversion. The proposed scheme detects and removes foreground depth layers in an image patch, enabling appropriate patch formation using only disoccluded background information. This background only patch formation helps to avoid the propagation of wrong depths over hole area, and thus improve the overall quality of converted 3D video experience. Experimental results demonstrate the proposed scheme provides visually much more pleasing inpainting results with better preserved object edges compared to the state-of-the-art depth-guided inpainting schemes.
Yasuhiro HINOKUMA Zhipeng YUEN Teppei FUKUDA Takahira MITOMI Kiichi HAMAMOTO
1 × N active multi-mode interferometer laser diode (MMI LD) is proposed and demonstrated to realize single-wavelength edge-emitter without using grating configuration. As the 1 × N active-MMI LDs are based on longitudinal mode interference, they have a potential of single-wavelength emission without incorporating any grating layer on/beneath active layer. The fabricated devices showed single-wavelength emission with a side mode suppression ratio (SMSR) of 12dB at a wavelength of 1.57µm.
In 2001, Boneh and Franklin realized the first Identity-Based Encryption (IBE), and at the same time they proposed a simple way to revoke users from the system. Later, Boldyreva et al. pointed out that Boneh-Franklin's revocation method is not scalable well and they proposed the first IBE scheme with efficient revocation. Recently, Tseng and Tsai [Computer Journal, Vol.55 No.4, page 475-486, 2012] claimed that Boldyreva et al.'s scheme requires a secure channel between each user and the key generation center in the key update phase, and proposed a new revocable IBE (RIBE) with a public channel by extending the Boneh-Franklin scheme. In this paper, we revisit Tseng and Tsai's result; we first point out that secure channels (except for the initial key setup) are not mandatory in the definition of RIBE scheme formalized by Boldyreva et al. Next, we show that Boldyreva et al.'s scheme does not require any secure channels (except for the initial key setup), which is different from what Tseng and Tsai claimed and so invalidates their contribution of the first RIBE with a public channel. Moreover, we point out that there are simple techniques to remove secure channels from the Boneh-Franklin RIBE. Interestingly, we show that the secure-channel-free Boneh-Franklin RIBE scheme is secure against decryption key exposure, whereas the Tseng-Tsai RIBE scheme is vulnerable to this attack.
Jin-Ping HE Kun GAO Guo-Qiang NI Guang-Da SU Jian-Sheng CHEN
Considering the real existent fact of the ideal edge and the learning style of image analogy without reference parameters, a blind image recovery algorithm using a self-adaptive learning method is proposed in this paper. We show that a specific local image patch with degradation characteristic can be utilized for restoring the whole image. In the training process, a clear counterpart of the local image patch is constructed based on the ideal edge assumption so that identification of the Point Spread Function is no longer needed. Experiments demonstrate the effectiveness of the proposed method on remote sensing images.
Trung-Nghia PHUNG Thanh-Son PHAN Thang Tat VU Mai Chi LUONG Masato AKAGI
The most important advantage of HMM-based TTS is its highly intelligible. However, speech synthesized by HMM-based TTS is muffled and far from natural, especially under limited data conditions, which is mainly caused by its over-smoothness. Therefore, the motivation for this paper is to improve the naturalness of HMM-based TTS trained under limited data conditions while preserving its intelligibility. To achieve this motivation, a hybrid TTS between HMM-based TTS and the modified restricted Temporal Decomposition (MRTD), named HTD in this paper, was proposed. Here, TD is an interpolation model of decomposing a spectral or prosodic sequence of speech into sparse event targets and dynamic event functions, and MRTD is one simplified version of TD. With a determination of event functions close to the concept of co-articulation in speech, MRTD can synthesize smooth speech and the smoothness in synthesized speech can be adjusted by manipulating event targets of MRTD. Previous studies have also found that event functions of MRTD can represent linguistic information of speech, which is important to perceive speech intelligibility, while sparse event targets can convey the non-linguistics information, which is important to perceive the naturalness of speech. Therefore, prosodic trajectories and MRTD event functions of the spectral trajectory generated by HMM-based TTS were kept unchanged to preserve the high and stable intelligibility of HMM-based TTS. Whereas MRTD event targets of the spectral trajectory generated by HMM-based TTS were rendered with an original speech database to enhance the naturalness of synthesized speech. Experimental results with small Vietnamese datasets revealed that the proposed HTD was equivalent to HMM-based TTS in terms of intelligibility but was superior to it in terms of naturalness. Further discussions show that HTD had a small footprint. Therefore, the proposed HTD showed its strong efficiency under limited data conditions.
Zhong ZHENG Zhiying WANG Li SHEN
Power consumption has become a critical factor for embedded systems, especially for battery powered ones. Caches in these systems consume a large portion of the whole chip power. Embedded systems usually adopt set-associative caches to get better performance. However, parallel accessed cache ways incur more energy dissipation. This paper proposed a region-based way-partitioning scheme to reduce cache way access, and without sacrificing performance, to reduce the cache power consumption. The stack accesses and non-stack accesses are isolated and redirected to different ways of the L1 data cache. Under way-partitioning, cache way accesses are reduced, as well as the memory reference interference. Experimental results show that the proposed approach could save around 27.5% of L1 data cache energy on average, without significant performance degradation.
Yuki KIMURA Sakuyoshi SAITO Yuichi KIMURA
This paper presents design and radiation properties of a radial line microstrip antenna array (RL-MSAA) for linear polarization. A stacked circular microstrip antenna (C-MSA) is used as a radiation element for the RL-MSAA. Radiation phase of the stacked C-MSA is controlled by tuning radii of the lower and upper patches, therefore, the desired phase distribution of the RL-MSAA can be designed. In this paper, a linearly polarized RL-MSAA with three concentric rows of the stacked C-MSAs at a spacing of 0.65 wavelengths for uniform aperture distribution is designed and tested in 12GHz. The experimental results reveal that validity of the linearly polarized RL-MSAA with the stacked C-MSAs for radiation phase control is demonstrated.
To develop an envelope-tracking (ET) amplifier for next-generation cellular base stations, we propose a wideband envelope modulator, consisting of a linear-mode class-B amplifier and a switch-mode class-D amplifier. The function of the modulator is to track the envelope signal and supply voltage to an RF amplifier. To meet the requirements of a large-current and high-voltage supply that can handle a wideband signal, an “Alexander current-feedback amplifier topology” is applied to the linear-mode class-B amplifier. The Alexander topology not only boosts the voltage but also enhances the current capacity of a commercial high-speed operational amplifier (op-amp) by means of a push-pull stage with current mirrors and a buffer amplifier at the output of the op-amp. With this topology, a linear-mode amplifier can provide several-ampere-level current to a 11-Ω load. A prototype of the wideband envelope modulator is shown to achieve the efficiency of 71% with a 20-MHz WiMAX envelope signal at output power of 72W.
Masaaki FUJII Ji-Yun SEOL TaeYoung KIM JaeWeon CHO
A subarray signal processing scheme is described for a large-scale two-dimensional analog-digital hybrid beamformer to be used in quasi-millimeter-wave-band mobile communication systems. Multiple analog phased arrays direct their respective beams to multiple users, enabling space-division multiple access (SDMA). An iterative soft-input soft-output (SISO) multi-user detector recovers multi-user signals from subarray output signals corrupted by inter-user interference (IUI). In addition, a phased-array directivity control algorithm is derived based on inter-subarray signal phase-difference estimation from inter-beam-interference (IBI)-cancelled subarray output signals. Simulation results demonstrate that our proposed scheme achieves reduced hardware complexity, IUI-resistant multi-user signal detection, and IBI-resistant multi-user-tracking phased-array directivity control.
Ryoichi TAKASHIMA Tetsuya TAKIGUCHI Yasuo ARIKI
This paper presents a voice conversion (VC) technique for noisy environments, where parallel exemplars are introduced to encode the source speech signal and synthesize the target speech signal. The parallel exemplars (dictionary) consist of the source exemplars and target exemplars, having the same texts uttered by the source and target speakers. The input source signal is decomposed into the source exemplars, noise exemplars and their weights (activities). Then, by using the weights of the source exemplars, the converted signal is constructed from the target exemplars. We carried out speaker conversion tasks using clean speech data and noise-added speech data. The effectiveness of this method was confirmed by comparing its effectiveness with that of a conventional Gaussian Mixture Model (GMM)-based method.
Yuta KOBAYASHI Satoshi YOSHIDA Zen-ichi YAMAMOTO Shigeo KAWASAKI
An S-band GaN on Si based 1kW-class SSPA system for space wireless applications is proposed. Since high-efficiency and high-reliability amplifier is one of the most important technologies for power and communication systems in a future space base station on a planet, compact, high-power, and high-efficiency SSPA is strongly requested instead of TWTA. Thus, we adopt GaN on Si based amplifier due to its remarkable material properties. At the beginning, thermal vacuum and radiation test of GaN on Si are conducted so as to confirm the space applicability. Fabricated SSPA system consists of eight 200W HPAs and coaxial waveguide power combiner. It achieves high efficiency such as 57% of drain efficiency and 87% of combining efficiency when RF output power achieves more than 60dBm. Furthermore, long-term stable operation and good phase noise characteristics are also confirmed.
Kentaro SAITO Koshiro KITAO Tetsuro IMAI Yukihiko OKUMURA
MIMO transmission technologies have become an essential component of cellular systems such as Long Term Evolution (LTE) and LTE-Advanced. Recently, evaluating the communication performance of mobile users in cellular MIMO systems has become an urgent requirement. In this paper, we propose dynamic MIMO channel modeling for the urban environment. Our proposal is based on Geometry-based Stochastic Channel Modeling (GSCM). The cluster parameters such as the local scatterer locations around the measurement course are estimated by applying the particle filtering to measured data. We carried out radio propagation measurements in an urban environment at 3.35GHz band, and generated the dynamic channel from the measured data. The experiments showed that both the spreads and auto-correlation of Time of Arrival (ToA), Angle of Arrival (AoA) and Angle of Departure (AoD) were reconstructed within the acceptable error range in our dynamic channel model.
Hirofumi TSUZUKI Mauricio KUGLER Susumu KUROYANAGI Akira IWATA
This paper presents a Complex-Valued Neural Network-based sound localization method. The proposed approach uses two microphones to localize sound sources in the whole horizontal plane. The method uses time delay and amplitude difference to generate a set of features which are then classified by a Complex-Valued Multi-Layer Perceptron. The advantage of using complex values is that the amplitude information can naturally masks the phase information. The proposed method is analyzed experimentally with regard to the spectral characteristics of the target sounds and its tolerance to noise. The obtained results emphasize and confirm the advantages of using Complex-Valued Neural Networks for the sound localization problem in comparison to the traditional Real-Valued Neural Network model.
Kazumitsu SAKAMOTO Ken HIRAGA Tomohiro SEKI Tadao NAKAGAWA Kazuhiro UEHARA
A Simple decoding method for short-range MIMO (SR-MIMO) transmission can reduce the power consumption for MIMO decoding, but the distance between the transceivers requires millimeter-order accuracy in order to satisfy the required transmission quality. In this paper, we propose a phase difference control method between each propagation channel to alleviate the requirements for the transmission distance accuracy. In the proposed method, the phase difference between each propagation channel is controlled by changing the transmission (or received) power ratio of each element of sub-array antennas. In millimeter-wave broadband transmission simulation, we clarified that when sub-array antenna spacing is set to 6.6 mm and element spacing of sub-array antenna is set to 2.48mm, the proposed method can extend the transmission distance range satisfying the required transmission quality, which is that bit error rate (BER) before error correction is less than 10-2 from 9∼29mm to 0∼50mm in QPSK, from 15∼19mm to 0∼30mm in 16QAM, and from only 15mm to 4∼22mm in 64QAM.
Tuan Thanh TA Shoichi TANIFUJI Suguru KAMEDA Noriharu SUEMATSU Tadashi TAKAGI Kazuo TSUBOUCHI
In this paper, we propose a novel baseband (BB) phase shifter (PS) using a fixed-gain-amplifier (FGA) matrix. The proposed BB PS consists of 5 stages of a vector synthesis type FGA matrix with in-phase/quadrature-phase (I/Q) input/output interfaces. In order to achieve low gain variation between phase shift states, 3rd to 5th stages are designed to have a phase shift of +φi and -φi (i=3,4,5). To change between +φi and -φi phase shift states, two FGAs with DC bias in-phase/out-phase switches are used. The two FGAs have the same gain, therefore ideally no gain variation can be achieved. Using this configuration, phase shift error and gain variation caused by process mismatch and temperature variation can be reduced. Fabricated 5-bit BB PS has 3-dB bandwidth of 1.05GHz, root-mean-square (rms) phase errors lower than 2.2°, rms gain variations lower than 0.42dB. Power consumption of the PS core and output buffer are 4.9mW and 14.3mW, respectively. 1-dB compression output power is -12.5dBm. The fabricated PS shows that the total phase shift error and gain variation are within the required accuracy of a 5-bit PS with no requirement of calibration.
Yasutaka OGAWA Kanako YAMAGUCHI Huu Phu BUI Toshihiko NISHIMURA Takeo OHGANE
We evaluated the behavior of a multi-user multiple-input multiple-output (MIMO) system in time-varying channels using measured data. A base station for downlink or broadcast transmission requires downlink channel state information (CSI), which is outdated in time-varying environments and we encounter degraded performance due to interference. One of the countermeasures against time-variant environments is predicting channels with an autoregressive (AR) model-based method. We modified the AR prediction for a time division duplex system. We conducted measurement campaigns in indoor environments to verify the performance of the scheme of channel prediction in an actual environment and measured channel data. We obtained the bit-error rate (BER) using these data. The AR-model-based technique of prediction assuming the Jakes' model was found to reduce BER. Also, the optimum AR-model order was investigated by using the channel data we measured.
In this paper, we present a real-time hand pose recognition method to provide an intuitive user interface through hand poses or gestures without a keyboard and a mouse. For this, the areas of right and left hands are segmented from the depth camera image, and noise compensation is performed. Then, the rotation angle and the centroid point of each hand area are calculated. Subsequently, joint points and end points of a finger are detected by expanding a circle at regular intervals from a centroid point of the hand. Lastly, the hand pose is recognized by matching between the current hand information and the hand model of previous frame and the hand model is updated for the next frame. This method enables users to predict the hidden fingers through the hand model information of the previous frame using temporal coherence in consecutive frames. As a result of the experiment on various hand poses with the hidden fingers using both hands, the accuracy showed over 95% and the performance indicated over 32fps. The proposed method can be used as a contactless input interface in presentation, advertisement, education, and game applications.
Kensuke SAITO Daijiro ISHIBASHI Nobuhiro KUGA
In this letter, we propose a partial impedance-matching method using a two-strip resonator for noncontact Passive Intermodulation (PIM) measurements using a coaxial tube. It is shown that the strip closer to the inner tube of the coaxial tube is dominant in the observed PIM characteristics while both strips are excited equally. The ideal efficiency of power to each strip is 50%, which is a significant improvement in comparison with conventional methods.
Song JIA Li LIU Xiayu LI Fengfeng WU Yuan WANG Ganggang ZHANG
Information security has been seriously threatened by the differential power analysis (DPA). Delay-based dual-rail precharge logic (DDPL) is an effective solution to resist these attacks. However, conventional DDPL convertors have some shortcomings. In this paper, we propose improved convertor pairs based on dynamic logic and a sense amplifier (SA). Compared with the reference CMOS-to-DDPL convertor, our scheme could save 69% power consumption. As to the comparison of DDPL-to-CMOS convertor, the speed and power performances could be improved by 39% and 54%, respectively.
This paper presents the opportunity-based software rejuvenation policy and the optimization problem of software rejuvenation trigger time maximizing the system performance index. Our model is based on a basic semi-Markov software rejuvenation model by Dohi et al. 2000 under the environment where possible time, called opportunity, to execute software rejuvenation is limited. In the paper, we consider two stochastic point processes; renewal process and Markovian arrival process to represent the opportunity process. In particular, we derive the existence condition of the optimal trigger time under the two point processes analytically. In numerical examples, we illustrate the optimal design of the rejuvenation trigger schedule based on empirical data.