An Efficient Caption Insertion Scheme for MPEG Video in MC-DCT Compressed Domain

Jongho NANG; Seungwook HONG; Ohyeong KWON

An Efficient Caption Insertion Scheme for MPEG Video in MC-DCT Compressed Domain

Jongho NANG, Seungwook HONG, Ohyeong KWON

Full Text Views

0

Cite this

Summary :

The (cinema) caption processing that adds descriptive text on a sequence of frames is an important video manipulation function that a video editor should support. This paper proposes an efficient MC-DCT compressed domain approach to insert the caption into the MPEG compressed video stream. It basically adds the DCT blocks of the caption image to the corresponding DCT blocks of the input frames one by one in the MC-DCT domain as in [6]. However, the strength of the caption image is adjusted in the DCT domain to prevent the resulting DCT coefficients from exceeding the maximum value allowed in MPEG. In order to adjust the strength of the caption image adaptively we need to know the exact pixel value of the input image. This is a difficult task in DCT domain. We propose an approximation scheme for the pixel values in which the DC value of a block is used as the expected pixel value for all pixels in that block. Although this approximation may lead to some errors in the caption area, it still provides a relatively high image quality in the non-caption area, whereas the processing time is about 4.9 times faster than the decode-captioning-reencode method.

Publication: IEICE TRANSACTIONS on Communications Vol.E84-B No.8 pp.2292-2300

Publication Date: 2001/08/01

Publicized

Online ISSN

DOI

Type of Manuscript: PAPER

Category: Multimedia Systems

Cite this

Copy

Jongho NANG, Seungwook HONG, Ohyeong KWON, "An Efficient Caption Insertion Scheme for MPEG Video in MC-DCT Compressed Domain" in IEICE TRANSACTIONS on Communications, vol. E84-B, no. 8, pp. 2292-2300, August 2001, doi: .
Abstract: The (cinema) caption processing that adds descriptive text on a sequence of frames is an important video manipulation function that a video editor should support. This paper proposes an efficient MC-DCT compressed domain approach to insert the caption into the MPEG compressed video stream. It basically adds the DCT blocks of the caption image to the corresponding DCT blocks of the input frames one by one in the MC-DCT domain as in [6]. However, the strength of the caption image is adjusted in the DCT domain to prevent the resulting DCT coefficients from exceeding the maximum value allowed in MPEG. In order to adjust the strength of the caption image adaptively we need to know the exact pixel value of the input image. This is a difficult task in DCT domain. We propose an approximation scheme for the pixel values in which the DC value of a block is used as the expected pixel value for all pixels in that block. Although this approximation may lead to some errors in the caption area, it still provides a relatively high image quality in the non-caption area, whereas the processing time is about 4.9 times faster than the decode-captioning-reencode method.
URL: https://global.ieice.org/en_transactions/communications/10.1587/e84-b_8_2292/_p

Copy

@ARTICLE{e84-b_8_2292,
author={Jongho NANG, Seungwook HONG, Ohyeong KWON, },
journal={IEICE TRANSACTIONS on Communications},
title={An Efficient Caption Insertion Scheme for MPEG Video in MC-DCT Compressed Domain},
year={2001},
volume={E84-B},
number={8},
pages={2292-2300},
abstract={The (cinema) caption processing that adds descriptive text on a sequence of frames is an important video manipulation function that a video editor should support. This paper proposes an efficient MC-DCT compressed domain approach to insert the caption into the MPEG compressed video stream. It basically adds the DCT blocks of the caption image to the corresponding DCT blocks of the input frames one by one in the MC-DCT domain as in [6]. However, the strength of the caption image is adjusted in the DCT domain to prevent the resulting DCT coefficients from exceeding the maximum value allowed in MPEG. In order to adjust the strength of the caption image adaptively we need to know the exact pixel value of the input image. This is a difficult task in DCT domain. We propose an approximation scheme for the pixel values in which the DC value of a block is used as the expected pixel value for all pixels in that block. Although this approximation may lead to some errors in the caption area, it still provides a relatively high image quality in the non-caption area, whereas the processing time is about 4.9 times faster than the decode-captioning-reencode method.},
keywords={},
doi={},
ISSN={},
month={August},}

Copy

TY - JOUR
TI - An Efficient Caption Insertion Scheme for MPEG Video in MC-DCT Compressed Domain
T2 - IEICE TRANSACTIONS on Communications
SP - 2292
EP - 2300
AU - Jongho NANG
AU - Seungwook HONG
AU - Ohyeong KWON
PY - 2001
DO -
JO - IEICE TRANSACTIONS on Communications
SN -
VL - E84-B
IS - 8
JA - IEICE TRANSACTIONS on Communications
Y1 - August 2001
AB - The (cinema) caption processing that adds descriptive text on a sequence of frames is an important video manipulation function that a video editor should support. This paper proposes an efficient MC-DCT compressed domain approach to insert the caption into the MPEG compressed video stream. It basically adds the DCT blocks of the caption image to the corresponding DCT blocks of the input frames one by one in the MC-DCT domain as in [6]. However, the strength of the caption image is adjusted in the DCT domain to prevent the resulting DCT coefficients from exceeding the maximum value allowed in MPEG. In order to adjust the strength of the caption image adaptively we need to know the exact pixel value of the input image. This is a difficult task in DCT domain. We propose an approximation scheme for the pixel values in which the DC value of a block is used as the expected pixel value for all pixels in that block. Although this approximation may lead to some errors in the caption area, it still provides a relatively high image quality in the non-caption area, whereas the processing time is about 4.9 times faster than the decode-captioning-reencode method.
ER -