Skip to main content
Top

2015 | OriginalPaper | Chapter

Accelerating DFA Construction by Parallelizing Subset Construction

Authors : Yan Shao, Yanbing Liu, Jianlong Tan

Published in: Trustworthy Computing and Services

Publisher: Springer Berlin Heidelberg

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Recently there have been increasing research interests on regular expression matching for its widely application in network systems to recognize predefined signatures in suspicious network traffic. Deterministic Finite Automaton (DFA) is the basic data structure for regular expression matching. Though DFA satisfies the need of real-time processing of network traffic in on-line systems, the construction of DFA is very time-consuming that prevents it from being applied on large sets of signature. In this article, we propose two approaches to parallelize the traditional sequential subset construction algorithm, which is used to convert a non-deterministic finite automaton (NFA) to an equivalent DFA, in order to accelerate DFA construction. The first proposed algorithm PRW is based on fine-grained locks on shared data structures to ensure multiple worker threads construct a DFA in parallel safely. The second proposed algorithm ARW splits the read and write accesses on shared data structures, and distributes the read operations and write operations to the multi-thread stage and the single-thread stage respectively. Experiment demonstrates the efficiency of our algorithms on real signatures of open source systems, and the speed-up ratio of PRW and ARW is up to 1.72 and 2.71 respectively with 4 worker threads.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Aho, V., Sethi, R., Ullman, J.D.: Compilers: Principles, Techniques, and Tools. Addison-Wesley Publishing Co., Boston (1986) Aho, V., Sethi, R., Ullman, J.D.: Compilers: Principles, Techniques, and Tools. Addison-Wesley Publishing Co., Boston (1986)
2.
go back to reference Maged, M.M., Michael, L.S.: Simple, fast, and practical non-blocking and blocking concurrent queue algorithms. In: Proceedings of the 15th annual ACM symposium on Principles of distributed computing, pp. 267 − 275 (1996) Maged, M.M., Michael, L.S.: Simple, fast, and practical non-blocking and blocking concurrent queue algorithms. In: Proceedings of the 15th annual ACM symposium on Principles of distributed computing, pp. 267 − 275 (1996)
3.
go back to reference Liu, Y., Guo, L., Guo, M., Liu, P.: Accelerating DFA Construction by Hierarchical Merging. In: Proceedings of the 2011 IEEE Ninth International Symposium on Parallel and Distributed Processing with Applications (ISPA), pp. 1 − 6 (2011) Liu, Y., Guo, L., Guo, M., Liu, P.: Accelerating DFA Construction by Hierarchical Merging. In: Proceedings of the 2011 IEEE Ninth International Symposium on Parallel and Distributed Processing with Applications (ISPA), pp. 1 − 6 (2011)
4.
go back to reference Becchi, M., Crowley, P.: An improved algorithm to accelerate regular expression evaluation. In: Proceedings of the 3rd ACM/IEEE Symposium on Architecture for networking and communications systems, pp. 145–154 (2007) Becchi, M., Crowley, P.: An improved algorithm to accelerate regular expression evaluation. In: Proceedings of the 3rd ACM/IEEE Symposium on Architecture for networking and communications systems, pp. 145–154 (2007)
5.
go back to reference Hopcroft, J.E., Motwani, R., Ullman, J.D.: Introduction to automata theory, languages, and computation, 2nd edn. Addison Wesley, Boston (2000) Hopcroft, J.E., Motwani, R., Ullman, J.D.: Introduction to automata theory, languages, and computation, 2nd edn. Addison Wesley, Boston (2000)
6.
go back to reference Leiss, E.: Constructing a finite automaton for a given regular expression. ACM SIGACT News 12(3), 81–87 (1980)CrossRef Leiss, E.: Constructing a finite automaton for a given regular expression. ACM SIGACT News 12(3), 81–87 (1980)CrossRef
7.
go back to reference Leslie, T.: Efficient approaches to subset construction. Master thesis, the University of Waterloo, Canada (1995) Leslie, T.: Efficient approaches to subset construction. Master thesis, the University of Waterloo, Canada (1995)
8.
go back to reference Chang, C.H., Paige, R.: From regular expressions to DFA’s using compressed NFA’s. In: Apostolico, A., Crochemore, M., Galil, Z., Manber, U. (eds.) Combinatorial Pattern Matching, vol. 644, pp. 90–110. Springer, Heidelberg (1992)CrossRef Chang, C.H., Paige, R.: From regular expressions to DFA’s using compressed NFA’s. In: Apostolico, A., Crochemore, M., Galil, Z., Manber, U. (eds.) Combinatorial Pattern Matching, vol. 644, pp. 90–110. Springer, Heidelberg (1992)CrossRef
9.
go back to reference Chen, S., Su, J.: Protocol identification research based on content analysis. J. Nat. Univ. Def. Technol. 30(4), 82–87 (2008)MathSciNet Chen, S., Su, J.: Protocol identification research based on content analysis. J. Nat. Univ. Def. Technol. 30(4), 82–87 (2008)MathSciNet
10.
go back to reference Choi, H., Burgstaller, B.: Non-blocking Parallel Subset construction on shared-memory multicore architectures. In: proceedings of the 11th Australasian symposium on parallel and distributed computing (AusPDC 2013), pp. 13 − 20 (2013) Choi, H., Burgstaller, B.: Non-blocking Parallel Subset construction on shared-memory multicore architectures. In: proceedings of the 11th Australasian symposium on parallel and distributed computing (AusPDC 2013), pp. 13 − 20 (2013)
Metadata
Title
Accelerating DFA Construction by Parallelizing Subset Construction
Authors
Yan Shao
Yanbing Liu
Jianlong Tan
Copyright Year
2015
Publisher
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/978-3-662-47401-3_3

Premium Partner