1-11hit |
Mengmeng ZHANG Zeliang ZHANG Yuan LI Ran CHENG Hongyuan JING Zhi LIU
Point cloud video contains not only color information but also spatial position information and usually has large volume of data. Typical rate distortion optimization algorithms based on Human Visual System only consider the color information, which limit the coding performance. In this paper, a Coding Tree Unit (CTU) level quantization parameter (QP) adjustment algorithm based on JND and spatial complexity is proposed to improve the subjective and objective quality of Video-Based Point Cloud Compression (V-PCC). Firstly, it is found that the JND model is degraded at CTU level for attribute video due to the pixel filling strategy of V-PCC, and an improved JND model is designed using the occupancy map. Secondly, a spatial complexity detection metric is designed to measure the visual importance of each CTU. Finally, a CTU-level QP adjustment scheme based on both JND levels and visual importance is proposed for geometry and attribute video. The experimental results show that, compared with the latest V-PCC (TMC2-18.0) anchors, the BD-rate is reduced by -2.8% and -3.2% for D1 and D2 metrics, respectively, and the subjective quality is improved significantly.
Kimiko KAWASHIMA Kazuhisa YAMAGISHI Takanori HAYASHI
Many subjective quality assessment methods have been standardized. Experimenters can select a method from these methods in accordance with the aim of the planned subjective assessment experiment. It is often argued that the results of subjective quality assessment are affected by range effects that are caused by the quality distribution of the assessment videos. However, there are no studies on the double stimulus continuous quality-scale (DSCQS) and absolute category rating with hidden reference (ACR-HR) methods that investigate range effects in the high-quality range. Therefore, we conduct experiments using high-quality assessment videos (high-quality experiment) and low-to-high-quality assessment videos (low-to-high-quality experiment) and compare the DSCQS and ACR-HR methods in terms of accuracy, stability, and discrimination ability. Regarding accuracy, we find that the mean opinion scores of the DSCQS and ACR-HR methods were marginally affected by range effects, although almost all common processed video sequences showed no significant difference for the high- and low-to-high-quality experiments. Second, the DSCQS and ACR-HR methods were equally stable in the low-to-high-quality experiment, whereas the DSCQS method was more stable than the ACR-HR method in the high-quality experiment. Finally, the DSCQS method had higher discrimination ability than the ACR-HR method in the low-to-high-quality experiment, whereas both methods had almost the same discrimination ability for the high-quality experiment. We thus determined that the DSCQS method is better at minimizing the range effects than the ACR-HR method in the high-quality range.
Huyen T. T. TRAN Cuong T. PHAM Nam PHAM NGOC Anh T. PHAM Truong Cong THANG
360 videos have recently become a popular virtual reality content type. However, a good quality metric for 360 videos is still an open issue. In this work, our goal is to identify appropriate objective quality metrics for 360 video communications. Especially, fourteen objective quality measures at different processing phases are considered. Also, a subjective test is conducted in this study. The relationship between objective quality and subjective quality is investigated. It is found that most of the PSNR-related quality measures are well correlated with subjective quality. However, for evaluating video quality across different contents, a content-based quality metric is needed.
Toshiko TOMINAGA Kanako SATO Noriko YOSHIMURA Masataka MASUDA Hitoshi AOKI Takanori HAYASHI
Web browsing services are expanding as smartphones are becoming increasingly popular worldwide. To provide customers with appropriate quality of web-browsing services, quality design and in-service quality management on the basis of quality of experience (QoE) is important. We propose a web-browsing QoE estimation model. The most important QoE factor for web-browsing is the waiting time for a web page to load. Next, the variation in the communication quality based on a mobile network should be considered. We developed a subjective quality assessment test to clarify QoE characteristics in terms of waiting time using 20 different types of web pages and constructed a web-page QoE estimation model. We then conducted a subjective quality assessment test of web-browsing to clarify the relationship between web-page QoE and web-browsing QoE for three web sites. We obtained the following two QoE characteristics. First, the main factor influencing web-browsing QoE is the average web-page QoE. Second, when web-page QoE variation occurs, a decrease in web-page QoE with a huge amplitude causes the web-browsing QoE to decrease. We used these characteristics in constructing our web-browsing QoE estimation model. The verification test results using non-training data indicate the accuracy of the model. We also show that our findings are applicable to web-browsing quality design and solving management issues on the basis of QoE.
Kazuhisa YAMAGISHI Taichi KAWANO Takanori HAYASHI Jiro KATTO
Three-dimensional (3D) video service is expected to be introduced as a next-generation television service. Stereoscopic video is composed of two 2D video signals for the left and right views, and these 2D video signals are encoded. Video quality between the left and right views is not always consistent because, for example, each view is encoded at a different bit rate. As a result, the video quality difference between the left and right views degrades the quality of stereoscopic video. However, these characteristics have not been thoroughly studied or modeled. Therefore, it is necessary to better understand how the video quality difference affects stereoscopic video quality and to model the video quality characteristics. To do that, we conducted subjective quality assessments to derive subjective video quality characteristics. The characteristics showed that 3D video quality was affected by the difference in video quality between the left and right views, and that when the difference was small, 3D video quality correlated with the highest 2D video quality of the two views. We modeled these characteristics as a subjective quality metric using a training data set. Finally, we verified the performance of our proposed model by applying it to unknown data sets.
Lasith YASAKETHU Steven ADEDOYIN Anil FERNANDO Ahmet M. KONDOZ
In this paper, we propose a rate control technique for H.264/AVC using subjective quality of video for off line video coding. We propose to use Video Quality Metric (VQM) with an evolution strategy algorithm, which is capable of identifying the best possible quantization parameters for each frame/macroblock to encode the video sequence such that it would maximize the subjective quality of the entire video sequence subjected to the target bit rate. Simulation results suggest that the proposed technique can improve the RD performance of the H.264/AVC codec significantly. With the proposed technique, up to 40% bit rate reduction can be achieved at the same video quality. Furthermore, results show that the proposed technique can improve the subjective quality of the encoded video significantly for video sequences especially with high motion.
Cheon Seog KIM Hosik SOHN Wesley De NEVE Yong Man RO
In this paper, we propose an Adaptation Decision-Taking Engine (ADTE) that targets the delivery of scalable video content in mobile usage environments. Our ADTE design relies on an objective perceptual quality metric in order to achieve video adaptation according to human visual perception, thus allowing to maximize the Quality of Service (QoS). To describe the characteristics of a particular usage environment, as well as the properties of the scalable video content, MPEG-21 Digital Item Adaptation (DIA) is used. Our experimental results show that the proposed ADTE design provides video content with a higher subjective quality than an ADTE using the conventional maximum-bit-allocation method.
Sylvain TOURANCHEAU Patrick LE CALLET Dominique BARBA
In this paper, the impact of display on quality assessment is addressed. Subjective quality assessment experiments have been performed on both LCD and CRT displays. Two sets of still images and two sets of moving pictures have been assessed using either an ACR or a SAMVIQ protocol. Altogether, eight experiments have been led. Results are presented and discussed, some differences are pointed out. Concerning moving pictures, these differences seem to be mainly due to LCD moving artefacts such as motion blur. LCD motion blur has been measured objectively and with psycho-physics experiments. A motion-blur metric based on the temporal characteristics of LCD can be defined. A prediction model have been then designed which predict the differences of perceived quality between CRT and LCD. This motion-blur-based model enables the estimation of perceived quality on LCD with respect to the perceived quality on CRT. Technical solutions to LCD motion blur can thus be evaluated on natural contents by this mean.
Noritsugu EGI Hitoshi AOKI Akira TAKAHASHI
We present a method for the objective quality evaluation of noise-reduced speech in wideband speech communication services, which utilize speech with a wider bandwidth (e.g., 7 kHz) than the usual telephone bandwidth. Experiments indicate that the amount of residual noise and the distortion of speech and noise, which are quality factors, influence the perceived quality degradation of noise-reduced speech. From the results, we observe the principal relationships between these quality factors and perceived speech quality. On the basis of these relationships, we propose a method that quantifies each quality factor in noise-reduced speech by analyzing signals that can be measured and assesses the overall perceived quality of noise-reduced speech using values of these quality factors. To verify the validity of the method, we perform a subjective listening test and compare subjective quality of noise-reduced speech with its estimation. In the test, we use various types of background noise and noise-reduction algorithms. The verification results indicate that the correlation between subjective quality and its objective estimation is sufficiently high regardless of the type of background noise and noise-reduction algorithm.
Akira TAKAHASHI Noritsugu EGI Atsuko KURASHIMA
VoIP is one of the key technologies for recent telecommunication services. In addition to the migration from the conventional PSTN to IP networks, mobile networks will follow the PSTN in moving to an IP-based infrastructure. Due to limited radio resources, the speech bitrate in mobile networks must be more strongly compressed than is true in PSTN. This will lead to a heterogeneous network environment, in which different speech codecs are employed in fixed and mobile networks. Therefore, from the viewpoint of designing and managing the QoE (Quality of Experience) of end-to-end telephony services, establishing a method to evaluate the quality of VoIP in such a heterogeneous network environment is very important. The quality of speech communication services should be discussed in subjective terms. Subjective quality assessment is time-consuming and expensive, however, so objective quality assessment which estimates subjective quality without carrying out subjective quality experiments is desirable. To establish an objective method to evaluate the end-to-end quality of speech in a heterogeneous network environment, this paper proposes a method for estimating the end-to-end listening quality based on the quality in each individual segment. This method is very important because conventional technologies such as the E-model, which was standardized as ITU-T Recommendation G.107, cannot accurately estimate overall quality based on segmental qualities. The experimentals show that the proposed method offers better performance in terms of quality estimation than the conventional method.
Akira TAKAHASHI Masataka MASUDA Atsuko KURASHIMA
VoIP is one of the key technologies for recent telecommunication services. The quality of its services should be discussed in subjective terms. Since subjective quality assessment is time-consuming and expensive, however, objective quality assessment which estimates subjective quality without carrying out subjective quality experiments is desirable. This paper discusses the performance of the objective quality measure that was standardized as ITU-T Recommendation P.862 and clarifies the quality factors that can be evaluated with satisfactory accuracy based on it. We found that P.862 can be applied to the evaluation of coding distortion, tandeming of codecs, transmission bit-errors, packet loss, and silence compression in a codec, at least for clean Japanese speech. In addition, we propose a method of estimating the subjective quality evaluation value from objective measurement results and show the validity of this method. We also evaluate the uniqueness of objective quality assessment based on P.862 from the viewpoints of the effect of measurement noise and the variation of test speech samples, and propose how to improve the reproducibility of objective quality assessment.