Multiple Object Segmentation in Videos Using Max-Flow Decomposition

Yihang BO; Hao JIANG

doi:10.1587/transfun.E99.A.2547

Multiple Object Segmentation in Videos Using Max-Flow Decomposition

Yihang BO, Hao JIANG

Full Text Views

0

Cite this

Summary :

In this paper, we propose a novel decomposition method to segment multiple object regions simultaneously in cluttered videos. This method formulates object regions segmentation as a labeling problem in which we assign object IDs to the superpixels in a sequence of video frames so that the unary color matching cost is low, the assignment induces compact segments, and the superpixel labeling is consistent through time. Multi-object segmentation in a video is a combinatorial problem. We propose a binary linear formulation. Since the integer linear programming is hard to solve directly, we relax it and further decompose the relaxation into a sequence of much simpler max-flow problems. The proposed method is guaranteed to converge in a finite number of steps to the global optimum of the relaxation. It also has a high chance to obtain all integer solution and therefore achieves the global optimum. The rounding of the relaxation result gives an N-approximation solution, where N is the number of objects. Comparing to directly solving the integer program, the novel decomposition method speeds up the computation by orders of magnitude. Our experiments show that the proposed method is robust against object pose variation, occlusion and is more accurate than the competing methods while at the same time maintains the efficiency.

Publication: IEICE TRANSACTIONS on Fundamentals Vol.E99-A No.12 pp.2547-2557

Publication Date: 2016/12/01

Publicized

Online ISSN: 1745-1337

DOI: 10.1587/transfun.E99.A.2547

Type of Manuscript: PAPER

Category: Vision

Authors

Yihang BO
the Beijing Film Academy
Hao JIANG
the Boston College

Keyword

video segmentation, linear programming, max-flow decomposition, multi-object

Cite this

Copy

Yihang BO, Hao JIANG, "Multiple Object Segmentation in Videos Using Max-Flow Decomposition" in IEICE TRANSACTIONS on Fundamentals, vol. E99-A, no. 12, pp. 2547-2557, December 2016, doi: 10.1587/transfun.E99.A.2547.
Abstract: In this paper, we propose a novel decomposition method to segment multiple object regions simultaneously in cluttered videos. This method formulates object regions segmentation as a labeling problem in which we assign object IDs to the superpixels in a sequence of video frames so that the unary color matching cost is low, the assignment induces compact segments, and the superpixel labeling is consistent through time. Multi-object segmentation in a video is a combinatorial problem. We propose a binary linear formulation. Since the integer linear programming is hard to solve directly, we relax it and further decompose the relaxation into a sequence of much simpler max-flow problems. The proposed method is guaranteed to converge in a finite number of steps to the global optimum of the relaxation. It also has a high chance to obtain all integer solution and therefore achieves the global optimum. The rounding of the relaxation result gives an N-approximation solution, where N is the number of objects. Comparing to directly solving the integer program, the novel decomposition method speeds up the computation by orders of magnitude. Our experiments show that the proposed method is robust against object pose variation, occlusion and is more accurate than the competing methods while at the same time maintains the efficiency.
URL: https://global.ieice.org/en_transactions/fundamentals/10.1587/transfun.E99.A.2547/_p

Copy

@ARTICLE{e99-a_12_2547,
author={Yihang BO, Hao JIANG, },
journal={IEICE TRANSACTIONS on Fundamentals},
title={Multiple Object Segmentation in Videos Using Max-Flow Decomposition},
year={2016},
volume={E99-A},
number={12},
pages={2547-2557},
abstract={In this paper, we propose a novel decomposition method to segment multiple object regions simultaneously in cluttered videos. This method formulates object regions segmentation as a labeling problem in which we assign object IDs to the superpixels in a sequence of video frames so that the unary color matching cost is low, the assignment induces compact segments, and the superpixel labeling is consistent through time. Multi-object segmentation in a video is a combinatorial problem. We propose a binary linear formulation. Since the integer linear programming is hard to solve directly, we relax it and further decompose the relaxation into a sequence of much simpler max-flow problems. The proposed method is guaranteed to converge in a finite number of steps to the global optimum of the relaxation. It also has a high chance to obtain all integer solution and therefore achieves the global optimum. The rounding of the relaxation result gives an N-approximation solution, where N is the number of objects. Comparing to directly solving the integer program, the novel decomposition method speeds up the computation by orders of magnitude. Our experiments show that the proposed method is robust against object pose variation, occlusion and is more accurate than the competing methods while at the same time maintains the efficiency.},
keywords={},
doi={10.1587/transfun.E99.A.2547},
ISSN={1745-1337},
month={December},}

Copy

TY - JOUR
TI - Multiple Object Segmentation in Videos Using Max-Flow Decomposition
T2 - IEICE TRANSACTIONS on Fundamentals
SP - 2547
EP - 2557
AU - Yihang BO
AU - Hao JIANG
PY - 2016
DO - 10.1587/transfun.E99.A.2547
JO - IEICE TRANSACTIONS on Fundamentals
SN - 1745-1337
VL - E99-A
IS - 12
JA - IEICE TRANSACTIONS on Fundamentals
Y1 - December 2016
AB - In this paper, we propose a novel decomposition method to segment multiple object regions simultaneously in cluttered videos. This method formulates object regions segmentation as a labeling problem in which we assign object IDs to the superpixels in a sequence of video frames so that the unary color matching cost is low, the assignment induces compact segments, and the superpixel labeling is consistent through time. Multi-object segmentation in a video is a combinatorial problem. We propose a binary linear formulation. Since the integer linear programming is hard to solve directly, we relax it and further decompose the relaxation into a sequence of much simpler max-flow problems. The proposed method is guaranteed to converge in a finite number of steps to the global optimum of the relaxation. It also has a high chance to obtain all integer solution and therefore achieves the global optimum. The rounding of the relaxation result gives an N-approximation solution, where N is the number of objects. Comparing to directly solving the integer program, the novel decomposition method speeds up the computation by orders of magnitude. Our experiments show that the proposed method is robust against object pose variation, occlusion and is more accurate than the competing methods while at the same time maintains the efficiency.
ER -