Hostname: page-component-76fb5796d-9pm4c Total loading time: 0 Render date: 2024-04-27T17:27:53.666Z Has data issue: false hasContentIssue false

Regular expressions for language engineering

Published online by Cambridge University Press:  01 December 1996

L. KARTTUNEN
Affiliation:
Rank Xerox Research Centre (RXRC), 6 Chemin de Maupertuis, 38240 Meylan, France. e-mail: Lauri.Karttunen@grenoble.rxrc.xerox.com, Jean-Pierre.Chanod@grenoble.rxrc.xerox.com, Gregory.Grefenstette@grenoble.rxrc.xerox.com, Anne.Schiller@grenoble.rxrc.xerox.com
J-P. CHANOD
Affiliation:
Rank Xerox Research Centre (RXRC), 6 Chemin de Maupertuis, 38240 Meylan, France. e-mail: Lauri.Karttunen@grenoble.rxrc.xerox.com, Jean-Pierre.Chanod@grenoble.rxrc.xerox.com, Gregory.Grefenstette@grenoble.rxrc.xerox.com, Anne.Schiller@grenoble.rxrc.xerox.com
G. GREFENSTETTE
Affiliation:
Rank Xerox Research Centre (RXRC), 6 Chemin de Maupertuis, 38240 Meylan, France. e-mail: Lauri.Karttunen@grenoble.rxrc.xerox.com, Jean-Pierre.Chanod@grenoble.rxrc.xerox.com, Gregory.Grefenstette@grenoble.rxrc.xerox.com, Anne.Schiller@grenoble.rxrc.xerox.com
A. SCHILLE
Affiliation:
Rank Xerox Research Centre (RXRC), 6 Chemin de Maupertuis, 38240 Meylan, France. e-mail: Lauri.Karttunen@grenoble.rxrc.xerox.com, Jean-Pierre.Chanod@grenoble.rxrc.xerox.com, Gregory.Grefenstette@grenoble.rxrc.xerox.com, Anne.Schiller@grenoble.rxrc.xerox.com

Abstract

Many of the processing steps in natural language engineering can be performed using finite state transducers. An optimal way to create such transducers is to compile them from regular expressions. This paper is an introduction to the regular expression calculus, extended with certain operators that have proved very useful in natural language applications ranging from tokenization to light parsing. The examples in the paper illustrate in concrete detail some of these applications.

Type
Research Article
Copyright
1997 Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)