Fast and Memory-Efficient Regular Expression Matching Using Transition Sharing

Shuzhuang ZHANG; Hao LUO; Binxing FANG; Xiaochun YUN

doi:10.1587/transinf.E92.D.1953

IEICE TRANSACTIONS on Information

Fast and Memory-Efficient Regular Expression Matching Using Transition Sharing

Shuzhuang ZHANG, Hao LUO, Binxing FANG, Xiaochun YUN

Full Text Views

0

Cite this

Summary :

Scanning packet payload at a high speed has become a crucial task in modern network management due to its wide variety applications on network security and application-specific services. Traditionally, Deterministic finite automatons (DFAs) are used to perform this operation in linear time. However, the memory requirements of DFAs are prohibitively high for patterns used in practical packet scanning, especially when many patterns are compiled into a single DFA. Existing solutions for memory blow-up are making a trade-off between memory requirement and memory access of processing per input character. In this paper we proposed a novel method to drastically reduce the memory requirements of DFAs while still maintain the high matching speed and provide worst-case guarantees. We removed the duplicate transitions between states by dividing all the DFA states into a number of groups and making each group of states share a merged transition table. We also proposed an efficient algorithm for transition sharing between states. The high efficiency in time and space made our approach adapted to frequently updated DFAs. We performed several experiments on real world rule sets. Overall, for all rule sets and approach evaluated, our approach offers the best memory versus run-time trade-offs.

Publication: IEICE TRANSACTIONS on Information Vol.E92-D No.10 pp.1953-1960

Publication Date: 2009/10/01

Publicized

Online ISSN: 1745-1361

DOI: 10.1587/transinf.E92.D.1953

Type of Manuscript: Special Section PAPER (Special Section on New Technologies and their Applications of the Internet)

Category: DRM and Security

Cite this

Copy

Shuzhuang ZHANG, Hao LUO, Binxing FANG, Xiaochun YUN, "Fast and Memory-Efficient Regular Expression Matching Using Transition Sharing" in IEICE TRANSACTIONS on Information, vol. E92-D, no. 10, pp. 1953-1960, October 2009, doi: 10.1587/transinf.E92.D.1953.
Abstract: Scanning packet payload at a high speed has become a crucial task in modern network management due to its wide variety applications on network security and application-specific services. Traditionally, Deterministic finite automatons (DFAs) are used to perform this operation in linear time. However, the memory requirements of DFAs are prohibitively high for patterns used in practical packet scanning, especially when many patterns are compiled into a single DFA. Existing solutions for memory blow-up are making a trade-off between memory requirement and memory access of processing per input character. In this paper we proposed a novel method to drastically reduce the memory requirements of DFAs while still maintain the high matching speed and provide worst-case guarantees. We removed the duplicate transitions between states by dividing all the DFA states into a number of groups and making each group of states share a merged transition table. We also proposed an efficient algorithm for transition sharing between states. The high efficiency in time and space made our approach adapted to frequently updated DFAs. We performed several experiments on real world rule sets. Overall, for all rule sets and approach evaluated, our approach offers the best memory versus run-time trade-offs.
URL: https://global.ieice.org/en_transactions/information/10.1587/transinf.E92.D.1953/_p

Copy

@ARTICLE{e92-d_10_1953,
author={Shuzhuang ZHANG, Hao LUO, Binxing FANG, Xiaochun YUN, },
journal={IEICE TRANSACTIONS on Information},
title={Fast and Memory-Efficient Regular Expression Matching Using Transition Sharing},
year={2009},
volume={E92-D},
number={10},
pages={1953-1960},
abstract={Scanning packet payload at a high speed has become a crucial task in modern network management due to its wide variety applications on network security and application-specific services. Traditionally, Deterministic finite automatons (DFAs) are used to perform this operation in linear time. However, the memory requirements of DFAs are prohibitively high for patterns used in practical packet scanning, especially when many patterns are compiled into a single DFA. Existing solutions for memory blow-up are making a trade-off between memory requirement and memory access of processing per input character. In this paper we proposed a novel method to drastically reduce the memory requirements of DFAs while still maintain the high matching speed and provide worst-case guarantees. We removed the duplicate transitions between states by dividing all the DFA states into a number of groups and making each group of states share a merged transition table. We also proposed an efficient algorithm for transition sharing between states. The high efficiency in time and space made our approach adapted to frequently updated DFAs. We performed several experiments on real world rule sets. Overall, for all rule sets and approach evaluated, our approach offers the best memory versus run-time trade-offs.},
keywords={},
doi={10.1587/transinf.E92.D.1953},
ISSN={1745-1361},
month={October},}

Copy

TY - JOUR
TI - Fast and Memory-Efficient Regular Expression Matching Using Transition Sharing
T2 - IEICE TRANSACTIONS on Information
SP - 1953
EP - 1960
AU - Shuzhuang ZHANG
AU - Hao LUO
AU - Binxing FANG
AU - Xiaochun YUN
PY - 2009
DO - 10.1587/transinf.E92.D.1953
JO - IEICE TRANSACTIONS on Information
SN - 1745-1361
VL - E92-D
IS - 10
JA - IEICE TRANSACTIONS on Information
Y1 - October 2009
AB - Scanning packet payload at a high speed has become a crucial task in modern network management due to its wide variety applications on network security and application-specific services. Traditionally, Deterministic finite automatons (DFAs) are used to perform this operation in linear time. However, the memory requirements of DFAs are prohibitively high for patterns used in practical packet scanning, especially when many patterns are compiled into a single DFA. Existing solutions for memory blow-up are making a trade-off between memory requirement and memory access of processing per input character. In this paper we proposed a novel method to drastically reduce the memory requirements of DFAs while still maintain the high matching speed and provide worst-case guarantees. We removed the duplicate transitions between states by dividing all the DFA states into a number of groups and making each group of states share a merged transition table. We also proposed an efficient algorithm for transition sharing between states. The high efficiency in time and space made our approach adapted to frequently updated DFAs. We performed several experiments on real world rule sets. Overall, for all rule sets and approach evaluated, our approach offers the best memory versus run-time trade-offs.
ER -

IEICE TRANSACTIONS on Information

Fast and Memory-Efficient Regular Expression Matching Using Transition Sharing

Summary :

Authors

Keyword

Latest Issue

Contents

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles

IEICE TRANSACTIONS on Information

Fast and Memory-Efficient Regular Expression Matching Using Transition Sharing

Summary :

Authors

Keyword

Latest Issue

Contents

Copyrights notice of machine-translated contents

Cite this

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles