Erschienen in:

2006 | OriginalPaper | Buchkapitel

Position-Restricted Substring Searching

verfasst von : Veli Mäkinen, Gonzalo Navarro

Erschienen in: LATIN 2006: Theoretical Informatics

Verlag: Springer Berlin Heidelberg

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

A full-text index is a data structure built over a text string

[1,

]. The most basic functionality provided is (

) counting how many times a pattern string

[1,

] appears in

and (

) locating all those

occ

positions. There exist several indexes that solve (

) in

(

) time and (

) in

(

occ

) time. In this paper we propose two new queries, (

) counting how many times

[1,

] appears in

[

] and (

) locating all those

occ

positions. These can be solved using (

) and (

) but this requires

(

occ

) time. We present two solutions to (

) and (

) in this paper. The first is an index that requires

(

log

) bits of space and answers (

) in

(

+log

) time and (

) in

(log

) time per occurrence (that is,

(

occ

log

) time overall). A variant of the first solution answers (

) in

(

+loglog

) time and (

) in constant time per occurrence, but requires

(

log

$^{\rm 1+{\it \epsilon}}$

) bits of space for any constant

> 0. The second solution requires

(

log

) bits of space, solving (

) in

(

⌈log

/ loglog

⌉) time and (

) in

(

⌈log

/ loglog

⌉) time per occurrence, where

is the alphabet size. This second structure takes less space when the text is compressible.

Our solutions can be seen as a generalization of

rank

and

select

dictionaries, which allow computing how many times a given character

appears in a prefix

[1,

] and also locate the

-th occurrence of

. Our solution to (

) extends character

rank

queries to

substring rank

queries, and our solution to (

) extends character

select

substring select

queries.

As a byproduct, we show how

rank

queries can be used to implement fractional cascading in little space, so as to obtain an alternative implementation of a well-known two-dimensional range search data structure by Chazelle. We also show how Grossi et al.’s

wavelet trees

are suitable for two-dimensional range searching, and their connection with Chazelle’s data structure.

Springer Professional

Position-Restricted Substring Searching

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Springer Professional

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"