article

Partial fillup and search time in LC tries

Authors:
Svante Janson

Uppsala University, Uppsala, Sweden

Uppsala University, Uppsala, Sweden
View Profile

,
Wojciech Szpankowski

Purdue University, West Lafayette, IN

Purdue University, West Lafayette, IN
View Profile

Authors Info & Claims

ACM Transactions on Algorithms Volume 3 Issue 4pp 44–eshttps://doi.org/10.1145/1290672.1290681

Published:01 November 2007Publication History

ACM Transactions on Algorithms

Abstract

Andersson and Nilsson introduced in 1993 a level-compressed trie (for short, LC trie) in which a full subtree of a node is compressed to a single node of degree being the size of the subtree. Recent experimental results indicated a “dramatic improvement” when full subtrees are replaced by “partially filled subtrees.” In this article, we provide a theoretical justification of these experimental results, showing, among others, a rather moderate improvement in search time over the original LC tries. For such an analysis, we assume that n strings are generated independently by a binary memoryless source, with p denoting the probability of emitting a “1” (and q = 1 − p). We first prove that the so-called α-fillup level F_n(α) (i.e., the largest level in a trie with α fraction of nodes present at this level) is concentrated on two values with high probability: either F_n(α) = k_n or F_n(α) = k_n + 1, where k_n = log_1/√pq n − |ln (p/q)|/2 ln^3/2 (1√pq) Φ⁻¹ (α) √ ln n + O(1) is an integer and Φ(x) denotes the normal distribution function. This result directly yields the typical depth (search time) D_n(α) in the α-LC tries, namely, we show that with high probability D_n(α) ∼ C₂ log log n, where C₂ = 1/|log(1 − h/log(1/√pq))| for p ≠ q and h = −plog p−qlog q is the Shannon entropy rate. This should be compared with recently found typical depth in the original LC tries, which is C₁log log n, where C₁ = 1/|log(1−h/log(1/min{p, 1−p}))|. In conclusion, we observe that α affects only the lower term of the α-fillup level F_n(α), and the search time in α-LC tries is of the same order as in the original LC tries.

References

Andersson, A., and Nilsson, S. 1993. Improved behavior of tries by adaptive branching, Inf. Proc. Lett. 46, 295--300. Google ScholarDigital Library
Devroye, L. 1992. A note on the probabilistic analysis of Patricia tries. Random Struct. Alg. 3, 203--214.Google ScholarDigital Library
Devroye, L. 2001. An analysis of random LC tries. Random Struc. Alg. 19, 359--375. Google ScholarDigital Library
Devroye, L., and Szpankowski, W. 2005. Probabilistic behavior of asymmetric level compressed tries. Random Struct. Alg. 27, 2, 185--200. Google ScholarDigital Library
de Moivre, A. 1738. The Doctrine of Chances, 2nd ed., H. Woodfall, London.Google Scholar
Feller, W. 1971. An Introduction to Probability Theory and its Applications. Vol. II. 2nd ed., Wiley, New York.Google Scholar
Gusfield, D. 1997. Algorithms on Strings, Trees, and Sequences. Cambridge University Press, New York. Google ScholarDigital Library
Iivonen, P., Nilsson, S., and Tikkanen, M. 1999. An experimental study of compression methods for functional tries. In Workshop on Algorithmic Aspects of Advanced Programming Languages (WAAAPL).Google Scholar
Jacquet, P., and Szpankowski, W. 1991. Analysis of digital tries with Markovian dependency. IEEE Trans. Inf. Theory 37, 1470--1475.Google ScholarDigital Library
Jacquet, P., and Szpankowski, W. 1998. Analytical dePoissonization and its applications. Theor. Comput. Sci. 201, 1--62. Google ScholarDigital Library
Knessl, C., and Szpankowski, W. 2004. On the number of full levels in tries. Random Struct. Alg. 25, 247--276. Google ScholarDigital Library
Knuth, D. E. 1997. The Art of Computer Programming. Vol. 1: Fundamental Algorithms, 3rd ed. Addison-Wesley, Reading, MA. Google ScholarDigital Library
Knuth, D. E. 2000. Selected Papers on Analysis of Algorithms, CSLI, Stanford, CA. Google ScholarDigital Library
Mahmoud, H. 1992. Evolution of Random Search Trees. John Wiley and Sons, New York.Google Scholar
Nilsson, S. 1996. Radix Sorting and Searching. PhD Thesis, Lund University.Google Scholar
Nilsson, S., and Karlsson, G. 1998. Fast address look-up for Internet routers. In Proceedings of the IFIP 4th International Conference on Broadband Communications, 11--22. Google ScholarDigital Library
Nilsson, S., and Karlsson, G. 1999. IP-address lookup using LC-tries. IEEE J. Select. Areas in Commun. 17, 6, 1083--1092. Google ScholarDigital Library
Nilsson, S., and Tikkanen, M. 2002. An experimental study of compression methods for dynamic tries. Algorithmica 33, 1, 19--33.Google ScholarDigital Library
Pittel, B. 1985. Asymptotic growth of a class of random trees. Ann. Probab. 18, 414--427.Google ScholarCross Ref
Pittel, B. 1986. Paths in a random digital tree: limiting distributions, Adv. Appl. Probab. 18, 139--155.Google ScholarCross Ref
Reznik, Y. 2002. Some results on tries with adaptive branching. Theor. Comput. Sci. 289, 1009--1026. Google ScholarDigital Library
Reznik, Y. 2005. On the average density and selectivity of nodes in multi-digit tries. In Proceedings of the 7th Workshop on Algorithm Engineering and Experiments and the 2nd Workshop on Analytic Algorithmics and Combinatorics (ALENEX/ANALCO) (Vancouver, British Columbia, Canada). SIAM, 230--239.Google Scholar
Srinivasan, V., and Varghese, G. 1998. Fast address lookups using controlled prefix expansions. ACM SIGMETRICS. Google ScholarDigital Library
Szpankowski, W. 1991. On the height of digital trees and related problems. Algorithmica 6, 256--277.Google ScholarDigital Library
Szpankowski, W. 2001. Average Case Analysis of Algorithms on Sequences. John Wiley, New York. Google ScholarDigital Library

Index Terms

Partial fillup and search time in LC tries
1. Mathematics of computing
  1. Discrete mathematics
    1. Combinatorics
      1. Enumeration
      2. Generating functions
2. Theory of computation
  1. Randomness, geometry and discrete structures

Recommendations

Partial fillup and search time in LC tries
ANALCO '06: Proceedings of the Meeting on Analytic Algorithmics and Combinatorics

Andersson and Nilsson introduced in 1993 a level-compressed trie (in short: LC trie) in which a full subtree of a node is compressed to a single node of degree being the size of the subtree. Recent experimental results indicated a "dramatic improvement" ...
Read More
Profiles of PATRICIA Tries

A PATRICIA trie is a trie in which non-branching paths are compressed. The external profile $$B_{n,k}$$Bn,k, defined to be the number of leaves at level k of a PATRICIA trie on n nodes, is an important "summarizing" parameter, in terms of which several ...
Read More
Succinct indexes for strings, binary relations and multilabeled trees

We define and design succinct indexes for several abstract data types (ADTs). The concept is to design auxiliary data structures that ideally occupy asymptotically less space than the information-theoretic lower bound on the space required to encode the ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

ACM Transactions on Algorithms Volume 3, Issue 4
November 2007
293 pages
ISSN:1549-6325
EISSN:1549-6333
DOI:10.1145/1290672
Issue’s Table of Contents

Copyright © 2007 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 November 2007
Published in talg Volume 3, Issue 4

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Digital trees
Poissonization
level-compressed tries
partial fillup
probabilistic analysis
strings
trees
Qualifiers
- article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 2
  Total Citations
  View Citations
- 316
  Total Downloads
- Downloads (Last 12 months)3
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Partial fillup and search time in LC tries

ACM Transactions on Algorithms

Abstract

References

Cited By

Index Terms

Recommendations

Partial fillup and search time in LC tries

Profiles of PATRICIA Tries

Succinct indexes for strings, binary relations and multilabeled trees

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Partial fillup and search time in LC tries

ACM Transactions on Algorithms

Abstract

References

Cited By

Index Terms

Recommendations

Partial fillup and search time in LC tries

Profiles of PATRICIA Tries

Succinct indexes for strings, binary relations and multilabeled trees

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media