Skip to main content
Top

2020 | OriginalPaper | Chapter

Lyndon Words versus Inverse Lyndon Words: Queries on Suffixes and Bordered Words

Authors : Paola Bonizzoni, Clelia De Felice, Rocco Zaccagnino, Rosalba Zizza

Published in: Language and Automata Theory and Applications

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The Lyndon factorization of a word has been extensively studied in different contexts and several variants of it have been proposed. In particular, the canonical inverse Lyndon factorization \(\mathrm {ICFL}\), introduced in [5], maintains the main properties of the Lyndon factorization since it can be computed in linear time and it is uniquely determined. In this paper we investigate new properties of this factorization with the purpose of exploring its use in string queries.
As a main result, we prove an upper bound on the length of the longest common extension (or longest common prefix) for two factors of a word w. This bound is at most the maximum length of two consecutive factors of \(\mathrm {ICFL}(w)\). A tool used in the proof is a property that we state for factors with nonempty borders in \(\mathrm {ICFL}(w)\): a nonempty border of a factor \(m_i\) cannot be a prefix of the next factor \(m_{i+1}\). Another interesting result relates sorting of global suffixes, i.e., suffixes of a word w, and sorting of local suffixes, i.e., suffixes of the factors in \(\mathrm {ICFL}(w)\).
Finally, given a word w and a factor x of w, we prove that their Lyndon factorizations share factors, except for the first and last term of the Lyndon factorization of x. This property suggests that, given two words sharing a common overlap, their Lyndon factorizations could be used to capture the common overlap of these two words.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Apostolico, A., Crochemore, M.: Fast parallel Lyndon factorization with applications. Math. Syst. Theory 28(2), 89–108 (1995)MathSciNetCrossRef Apostolico, A., Crochemore, M.: Fast parallel Lyndon factorization with applications. Math. Syst. Theory 28(2), 89–108 (1995)MathSciNetCrossRef
2.
go back to reference Bannai, H., Tomohiro, I., Inenaga, S., Nakashima, Y., Takeda, M., Tsuruta, K.: The “Runs” theorem. SIAM J. Comput. 46(5), 1501–1514 (2017)MathSciNetCrossRef Bannai, H., Tomohiro, I., Inenaga, S., Nakashima, Y., Takeda, M., Tsuruta, K.: The “Runs” theorem. SIAM J. Comput. 46(5), 1501–1514 (2017)MathSciNetCrossRef
3.
go back to reference Bannai, H., Tomohiro, I., Inenaga, S., Nakashima, Y., Takeda, M., Tsuruta, K.: A new characterization of maximal repetitions by Lyndon trees. In: Proceedings of the Twenty-Sixth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2015, San Diego, CA, USA, 4–6 January 2015, pp. 562–571 (2015) Bannai, H., Tomohiro, I., Inenaga, S., Nakashima, Y., Takeda, M., Tsuruta, K.: A new characterization of maximal repetitions by Lyndon trees. In: Proceedings of the Twenty-Sixth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2015, San Diego, CA, USA, 4–6 January 2015, pp. 562–571 (2015)
4.
go back to reference Berstel, J., Perrin, D., Reutenauer, C.: Codes and Automata. Encyclopedia of Mathematics and its Applications, vol. 129. Cambridge University Press, Cambridge (2009)CrossRef Berstel, J., Perrin, D., Reutenauer, C.: Codes and Automata. Encyclopedia of Mathematics and its Applications, vol. 129. Cambridge University Press, Cambridge (2009)CrossRef
5.
go back to reference Bonizzoni, P., De Felice, C., Zaccagnino, R., Zizza, R.: Inverse Lyndon words and inverse Lyndon factorizations of words. Adv. Appl. Math. 101, 281–319 (2018)MathSciNetCrossRef Bonizzoni, P., De Felice, C., Zaccagnino, R., Zizza, R.: Inverse Lyndon words and inverse Lyndon factorizations of words. Adv. Appl. Math. 101, 281–319 (2018)MathSciNetCrossRef
7.
go back to reference Bonizzoni, P., Della Vedova, G., Pirola, Y., Previtali, M., Rizzi, R.: An external-memory algorithm for string graph construction. Algorithmica 78(2), 394–424 (2017)MathSciNetCrossRef Bonizzoni, P., Della Vedova, G., Pirola, Y., Previtali, M., Rizzi, R.: An external-memory algorithm for string graph construction. Algorithmica 78(2), 394–424 (2017)MathSciNetCrossRef
8.
go back to reference Bonizzoni, P., Della Vedova, G., Pirola, Y., Previtali, M., Rizzi, R.: FSG: fast string graph construction for de novo assembly. J. Comput. Biol. 24(10), 953–968 (2017)MathSciNetCrossRef Bonizzoni, P., Della Vedova, G., Pirola, Y., Previtali, M., Rizzi, R.: FSG: fast string graph construction for de novo assembly. J. Comput. Biol. 24(10), 953–968 (2017)MathSciNetCrossRef
9.
go back to reference Chen, K.T., Fox, R.H., Lyndon, R.C.: Free differential calculus, IV. The quotient groups of the lower central series. Ann. Math. 68, 81–95 (1958)MathSciNetCrossRef Chen, K.T., Fox, R.H., Lyndon, R.C.: Free differential calculus, IV. The quotient groups of the lower central series. Ann. Math. 68, 81–95 (1958)MathSciNetCrossRef
11.
go back to reference Crochemore, M., Désarménien, J., Perrin, D.: A note on the Burrows-Wheeler transformation. Theoret. Comput. Sci. 332(1), 567–572 (2005)MathSciNetCrossRef Crochemore, M., Désarménien, J., Perrin, D.: A note on the Burrows-Wheeler transformation. Theoret. Comput. Sci. 332(1), 567–572 (2005)MathSciNetCrossRef
12.
go back to reference Crochemore, M., Hancart, C., Lecroq, T.: Algorithms on Strings. Cambridge University Press, Cambridge (2007)CrossRef Crochemore, M., Hancart, C., Lecroq, T.: Algorithms on Strings. Cambridge University Press, Cambridge (2007)CrossRef
13.
go back to reference Daykin, J.W., Iliopoulos, C.S., Smyth, W.F.: Parallel RAM algorithms for factorizing words. Theor. Comput. Sci. 127(1), 53–67 (1994)MathSciNetCrossRef Daykin, J.W., Iliopoulos, C.S., Smyth, W.F.: Parallel RAM algorithms for factorizing words. Theor. Comput. Sci. 127(1), 53–67 (1994)MathSciNetCrossRef
14.
16.
go back to reference Fredricksen, H., Maiorana, J.: Necklaces of beads in \(k\) colors and \(k\)-ary de Brujin sequences. Discrete Math. 23(3), 207–210 (1978)MathSciNetCrossRef Fredricksen, H., Maiorana, J.: Necklaces of beads in \(k\) colors and \(k\)-ary de Brujin sequences. Discrete Math. 23(3), 207–210 (1978)MathSciNetCrossRef
17.
go back to reference Ghuman, S.S., Giaquinta, E., Tarhio, J.: Alternative algorithms for Lyndon factorization. In: Proceedings of the Prague Stringology Conference 2014, Prague, Czech Republic, 1–3 September 2014, pp. 169–178 (2014) Ghuman, S.S., Giaquinta, E., Tarhio, J.: Alternative algorithms for Lyndon factorization. In: Proceedings of the Prague Stringology Conference 2014, Prague, Czech Republic, 1–3 September 2014, pp. 169–178 (2014)
18.
go back to reference Kärkkäinen, J., Kempa, D., Nakashima, Y., Puglisi, S.J., Shur, A.M.: On the size of Lempel-Ziv and Lyndon factorizations. In: 34th Symposium on Theoretical Aspects of Computer Science, STACS 2017, Hannover, Germany, 8–11 March 2017, pp. 45:1–45:13 (2017) Kärkkäinen, J., Kempa, D., Nakashima, Y., Puglisi, S.J., Shur, A.M.: On the size of Lempel-Ziv and Lyndon factorizations. In: 34th Symposium on Theoretical Aspects of Computer Science, STACS 2017, Hannover, Germany, 8–11 March 2017, pp. 45:1–45:13 (2017)
19.
go back to reference Kufleitner, M.: On bijective variants of the Burrows-Wheeler transform. In: Proceedings of the Prague Stringology Conference 2009, Prague, Czech Republic, 31 August–2 September 2009, pp. 65–79 (2009) Kufleitner, M.: On bijective variants of the Burrows-Wheeler transform. In: Proceedings of the Prague Stringology Conference 2009, Prague, Czech Republic, 31 August–2 September 2009, pp. 65–79 (2009)
20.
go back to reference Lothaire, M.: Algebraic Combinatorics on Words. Encyclopedia of Mathematics and its Applications, vol. 90. Cambridge University Press, Cambridge (1997)CrossRef Lothaire, M.: Algebraic Combinatorics on Words. Encyclopedia of Mathematics and its Applications, vol. 90. Cambridge University Press, Cambridge (1997)CrossRef
21.
go back to reference Lothaire, M.: Combinatorics on Words. Cambridge University Press, Cambridge (1997)CrossRef Lothaire, M.: Combinatorics on Words. Cambridge University Press, Cambridge (1997)CrossRef
22.
go back to reference Lothaire, M.: Applied Combinatorics on Words. Cambridge University Press, Cambridge (2005)CrossRef Lothaire, M.: Applied Combinatorics on Words. Cambridge University Press, Cambridge (2005)CrossRef
23.
go back to reference Mantaci, S., Restivo, A., Rosone, G., Sciortino, M.: An extension of the Burrows-Wheeler transform. Theor. Comput. Sci. 387(3), 298–312 (2007)MathSciNetCrossRef Mantaci, S., Restivo, A., Rosone, G., Sciortino, M.: An extension of the Burrows-Wheeler transform. Theor. Comput. Sci. 387(3), 298–312 (2007)MathSciNetCrossRef
24.
go back to reference Mantaci, S., Restivo, A., Rosone, G., Sciortino, M.: Sorting suffixes of a text via its Lyndon factorization. In: Proceedings of the Prague Stringology Conference 2013, Prague, Czech Republic, 2–4 September 2013, pp. 119–127 (2013) Mantaci, S., Restivo, A., Rosone, G., Sciortino, M.: Sorting suffixes of a text via its Lyndon factorization. In: Proceedings of the Prague Stringology Conference 2013, Prague, Czech Republic, 2–4 September 2013, pp. 119–127 (2013)
25.
go back to reference Mantaci, S., Restivo, A., Rosone, G., Sciortino, M.: Suffix array and Lyndon factorization of a text. J. Discrete Algorithms 28, 2–8 (2014)MathSciNetCrossRef Mantaci, S., Restivo, A., Rosone, G., Sciortino, M.: Suffix array and Lyndon factorization of a text. J. Discrete Algorithms 28, 2–8 (2014)MathSciNetCrossRef
26.
go back to reference Mucha, M.: Lyndon words and short superstrings. In: Proceedings of the Twenty-Fourth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2013, New Orleans, Louisiana, USA, 6–8 January 2013, pp. 958–972 (2013) Mucha, M.: Lyndon words and short superstrings. In: Proceedings of the Twenty-Fourth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2013, New Orleans, Louisiana, USA, 6–8 January 2013, pp. 958–972 (2013)
27.
go back to reference Reutenauer, C.: Free lie algebras. Oxford University Press (1993) Reutenauer, C.: Free lie algebras. Oxford University Press (1993)
28.
go back to reference Urabe, Y., Kempa, D., Nakashima, Y., Inenaga, S., Bannai, H., Takeda, M.: On the size of overlapping Lempel-Ziv and Lyndon factorizations. In: 30th Annual Symposium on Combinatorial Pattern Matching, CPM 2019, 18–20 June 2019, Pisa, Italy. LIPIcs, vol. 128, pp. 29:1–29:11. Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik (2019) Urabe, Y., Kempa, D., Nakashima, Y., Inenaga, S., Bannai, H., Takeda, M.: On the size of overlapping Lempel-Ziv and Lyndon factorizations. In: 30th Annual Symposium on Combinatorial Pattern Matching, CPM 2019, 18–20 June 2019, Pisa, Italy. LIPIcs, vol. 128, pp. 29:1–29:11. Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik (2019)
Metadata
Title
Lyndon Words versus Inverse Lyndon Words: Queries on Suffixes and Bordered Words
Authors
Paola Bonizzoni
Clelia De Felice
Rocco Zaccagnino
Rosalba Zizza
Copyright Year
2020
DOI
https://doi.org/10.1007/978-3-030-40608-0_27

Premium Partner