LaFA: Lookahead finite automata for scalable regular expression detection

Masanori Bando, N. Sertac Artan, H. Jonathan Chao

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Although Regular Expressions (RegExes) have been widely used in network security applications, their inherent complexity often limits the total number of RegExes that can be detected using a single chip for a reasonable throughput. This limit on the number of RegExes impairs the scalability of today's RegEx detection systems. The scalability of existing schemes is generally limited by the traditional per character state processing and state transition detection paradigm. The main focus of existing schemes is in optimizing the number of states and the required transitions, but not the suboptimal character-based detection method. Furthermore, the potential benefits of reduced number of operations and states using out-of-sequence detection methods have not been explored. In this paper, we propose Looka-head Finite Automata (LaFA) to perform scalable RegEx detection using very small amount of memory. LaFA's memory requirement is very small due to the following three areas of effort described in this paper: (1) Different parts of a RegEx, namely RegEx components, are detected using different detectors, each of which is specialized and optimized for the detection of a certain RegEx component. (2) We systematically reorder the RegEx component detection sequence, which provides us with new possibilities for memory optimization. (3) Many redundant states in classical finite automata are identified and eliminated in LaFA. Our simulations show that LaFA requires an order of magnitude less memory compared to today's state-of-the-art RegEx detection systems. A single commodity Field Programmable Gate Array (FPGA) chip can accommodate up to twenty-five thousand (25k) RegExes. Based on the throughput of our LaFA prototype on FPGA, we estimated that a 34-Gbps throughput can be achieved.

Original languageEnglish (US)
Title of host publicationANCS'09
Subtitle of host publicationSymposium on Architecture for Networking and Communications Systems
Pages40-49
Number of pages10
DOIs
StatePublished - 2009
Event2009 Symposium on Architecture for Networking and Communications Systems, ANCS'09 - Princeton, NJ, United States
Duration: Oct 19 2009Oct 20 2009

Publication series

NameANCS'09: Symposium on Architecture for Networking and Communications Systems

Other

Other2009 Symposium on Architecture for Networking and Communications Systems, ANCS'09
Country/TerritoryUnited States
CityPrinceton, NJ
Period10/19/0910/20/09

Keywords

  • FPGA
  • LaFA
  • deep packet inspection
  • finite automation
  • network intrusion detection system
  • regular expressions

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Hardware and Architecture
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'LaFA: Lookahead finite automata for scalable regular expression detection'. Together they form a unique fingerprint.

Cite this