2006 | OriginalPaper | Chapter
Searching Text Corpora with grep
Author : Tomasz Obrębski
Published in: Intelligent Information Processing and Web Mining
Publisher: Springer Berlin Heidelberg
Activate our intelligent search to find suitable subject content or patents.
Select sections of text to find matching patents with Artificial Intelligence. powered by
Select sections of text to find additional relevant content using AI-assisted search. powered by
The paper presents simple methods for perfoming pattern search on annotated text corpora. Elementary text processing techniques are applied, based on the use of common text scanning tools:
flex
and
grep
. The methods allow to properly handle ambiguous annotation, as well as structured tags. Processing times for some types of queries are comparable to those attained by elaborated search engines using indexing techniques with query languages of similar expressiveness.