Skip to main content
Erschienen in: Mathematics in Computer Science 1/2022

01.03.2022

Topological Analysis of Syntactic Structures

verfasst von: Alexander Port, Taelin Karidi, Matilde Marcolli

Erschienen in: Mathematics in Computer Science | Ausgabe 1/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

We use the persistent homology method of topological data analysis and dimensional analysis techniques to study data of syntactic structures of world languages. We analyze relations between syntactic parameters in terms of dimensionality, of hierarchical clustering structures, and of non-trivial loops. We show there are relations that hold across language families and additional relations that are family-specific. We then analyze the trees describing the merging structure of persistent connected components for languages in different language families and we show that they partly correlate to historical phylogenetic trees but with significant differences. We also show the existence of interesting non-trivial persistent first homology groups in various language families. We give examples where explicit generators for the persistent first homology can be identified, some of which appear to correspond to homoplasy phenomena, while others may have an explanation in terms of historical linguistics, corresponding to known cases of syntactic borrowing across different language subfamilies.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Baker, M.: The Atoms of Language. Basic Books, New York (2001) Baker, M.: The Atoms of Language. Basic Books, New York (2001)
2.
Zurück zum Zitat Barannikov, S.A.: The Framed Morse complex and its invariants. Adv. Soviet Math. 21, 93–115 (1994)MathSciNetMATH Barannikov, S.A.: The Framed Morse complex and its invariants. Adv. Soviet Math. 21, 93–115 (1994)MathSciNetMATH
3.
Zurück zum Zitat Belkin, M., Niyogi, P.: Laplacian eigenmaps for dimensionality reduction and data representation. Neural Comput. 15(6), 1373–1396 (2003)CrossRefMATH Belkin, M., Niyogi, P.: Laplacian eigenmaps for dimensionality reduction and data representation. Neural Comput. 15(6), 1373–1396 (2003)CrossRefMATH
4.
Zurück zum Zitat Boissonnat, J.D., Chazal, F., Yvinec, M.: Geometric and Topological Inference. Cambridge University Press, Cambridge (2018)CrossRefMATH Boissonnat, J.D., Chazal, F., Yvinec, M.: Geometric and Topological Inference. Cambridge University Press, Cambridge (2018)CrossRefMATH
5.
Zurück zum Zitat Bouckaert, R., Lemey, P., Dunn, M., Greenhill, S.J., Alekseyenko, A.V., Drummond, A.J., Gray, R.D., Suchard, M.A., Atkinson, Q.D.: Mapping the origins and expansion of the Indo-European language family. Science 337, 957–960 (2012)CrossRef Bouckaert, R., Lemey, P., Dunn, M., Greenhill, S.J., Alekseyenko, A.V., Drummond, A.J., Gray, R.D., Suchard, M.A., Atkinson, Q.D.: Mapping the origins and expansion of the Indo-European language family. Science 337, 957–960 (2012)CrossRef
8.
Zurück zum Zitat Ceolin, A., Guardiano, C., Irimia, M.A., Longobardi, G.: Formal syntax and deep history. Front. Psychol. 11, 2384 (2020)CrossRef Ceolin, A., Guardiano, C., Irimia, M.A., Longobardi, G.: Formal syntax and deep history. Front. Psychol. 11, 2384 (2020)CrossRef
9.
Zurück zum Zitat Chomsky, N.: Lectures on Government and Binding. Foris Publications, Dordrecht (1982) Chomsky, N.: Lectures on Government and Binding. Foris Publications, Dordrecht (1982)
10.
Zurück zum Zitat Chomsky, N., Lasnik, H.: The theory of principles and parameters. In: Syntax: An International Handbook of Contemporary Research, pp. 506–569. de Gruyter (1993) Chomsky, N., Lasnik, H.: The theory of principles and parameters. In: Syntax: An International Handbook of Contemporary Research, pp. 506–569. de Gruyter (1993)
11.
Zurück zum Zitat Edelsbrunner, H., Harer, J.L.: Computational Topology. American Mathematical Society, Providence (2010)MATH Edelsbrunner, H., Harer, J.L.: Computational Topology. American Mathematical Society, Providence (2010)MATH
13.
Zurück zum Zitat Genis, R.: Comparing verbal aspect in Slavic and Gothic. In: Language for its own sake: essays on language and literature offered to Harry Perridon. Amsterdam contributions to Scandinavian studies. No. 8, 59–80 (2012) Genis, R.: Comparing verbal aspect in Slavic and Gothic. In: Language for its own sake: essays on language and literature offered to Harry Perridon. Amsterdam contributions to Scandinavian studies. No. 8, 59–80 (2012)
15.
Zurück zum Zitat Guardiano, C., Michelioudakis, D., Ceolin, A., Irimia, M., Longobardi, G., Radkevich, N., Sitaridou, I., Silvestri, G.: South by Southeast. A syntactic approach to greek and romance microvariation. L’Italia Dialettale 77, 95–166 (2016) Guardiano, C., Michelioudakis, D., Ceolin, A., Irimia, M., Longobardi, G., Radkevich, N., Sitaridou, I., Silvestri, G.: South by Southeast. A syntactic approach to greek and romance microvariation. L’Italia Dialettale 77, 95–166 (2016)
16.
Zurück zum Zitat Ghrist, R.: Elementary Applied Topology. CreateSpace (2014) Ghrist, R.: Elementary Applied Topology. CreateSpace (2014)
17.
Zurück zum Zitat Holzer, G.: Germanische Lehnwörter im Urslavischen: Methodologisches zu ihrer Identifizierung. Croatica, Slavica, Indoeuropea. Wien: Österreichischen Akademie der Wissenschaften. Series: Wiener Slawistisches Jahrbuch, Ergänzungsband; VIII, pp. 59–67 (1990) Holzer, G.: Germanische Lehnwörter im Urslavischen: Methodologisches zu ihrer Identifizierung. Croatica, Slavica, Indoeuropea. Wien: Österreichischen Akademie der Wissenschaften. Series: Wiener Slawistisches Jahrbuch, Ergänzungsband; VIII, pp. 59–67 (1990)
18.
Zurück zum Zitat Karimi, S., Piattelli-Palmarini, M. (Eds.): Special Issue on Parameters, Linguistic Analysis, Vol. 41, No. 3–4 (2017) Karimi, S., Piattelli-Palmarini, M. (Eds.): Special Issue on Parameters, Linguistic Analysis, Vol. 41, No. 3–4 (2017)
20.
Zurück zum Zitat Kazakov, D., Cordoni, G., Algahtani, E., Ceolin, A., Irimia, M., Kim, S.S., Michelioudakis, D., Radkevich, N., Guardiano, C., Longobardi, G.: Learning Implicational Models of Universal Grammar Parameters. EVOLANG XII: 16–19 April 2018, Torun, Poland Kazakov, D., Cordoni, G., Algahtani, E., Ceolin, A., Irimia, M., Kim, S.S., Michelioudakis, D., Radkevich, N., Guardiano, C., Longobardi, G.: Learning Implicational Models of Universal Grammar Parameters. EVOLANG XII: 16–19 April 2018, Torun, Poland
21.
Zurück zum Zitat Longobardi, G.: Methods in parametric linguistics and cognitive history. Linguist. Var. Yearb. 3, 101–138 (2003)CrossRef Longobardi, G.: Methods in parametric linguistics and cognitive history. Linguist. Var. Yearb. 3, 101–138 (2003)CrossRef
22.
Zurück zum Zitat Longobardi, G.: Principles, parameters, and schemata. A constructivist UG. Linguist. Anal. 41(3–4), 517–557 (2017) Longobardi, G.: Principles, parameters, and schemata. A constructivist UG. Linguist. Anal. 41(3–4), 517–557 (2017)
23.
Zurück zum Zitat Longobardi, G., Ceolin, A.: The mathematics of parametric comparison, talk at workshop “Phylogenetic Linguistics and Linguistic Theory.” University of York, York Centre for Linguistic History and Diversity (2018) Longobardi, G., Ceolin, A.: The mathematics of parametric comparison, talk at workshop “Phylogenetic Linguistics and Linguistic Theory.” University of York, York Centre for Linguistic History and Diversity (2018)
24.
Zurück zum Zitat Longobardi, G., Guardiano, C.: Evidence for syntax as a signal of historical relatedness. Lingua 119, 1679–1706 (2009)CrossRef Longobardi, G., Guardiano, C.: Evidence for syntax as a signal of historical relatedness. Lingua 119, 1679–1706 (2009)CrossRef
25.
Zurück zum Zitat Longobardi, G., Guardiano, C., Silvestri, G., Boattini, A., Ceolin, A.: Towards a syntactic phylogeny of modern Indo-European languages. J. Hist. Linguist. 3(1), 122–152 (2013)CrossRef Longobardi, G., Guardiano, C., Silvestri, G., Boattini, A., Ceolin, A.: Towards a syntactic phylogeny of modern Indo-European languages. J. Hist. Linguist. 3(1), 122–152 (2013)CrossRef
26.
Zurück zum Zitat Longobardi, G., Buch, A., Ceolin, A., Ecay, A., Guardiano, C., Irimia, M., Michelioudakis, D., Radkevich, N., Jaeger, G.: Correlated evolution or not? Phylogenetic linguistics with syntactic, cognacy, and phonetic data. In: Roberts, S.G. et al. (eds.) The Evolution of Language: Proceedings of the 11th International Conference (EVOLANGX11). http://evolang.org/neworleans/papers/162.html (2011) Longobardi, G., Buch, A., Ceolin, A., Ecay, A., Guardiano, C., Irimia, M., Michelioudakis, D., Radkevich, N., Jaeger, G.: Correlated evolution or not? Phylogenetic linguistics with syntactic, cognacy, and phonetic data. In: Roberts, S.G. et al. (eds.) The Evolution of Language: Proceedings of the 11th International Conference (EVOLANGX11). http://​evolang.​org/​neworleans/​papers/​162.​html (2011)
27.
Zurück zum Zitat Manin, Yu, I., Marcolli, M.: Nori diagrams and persistent homology. Math. Comput. Sci 14(1), 77–102 (2020) Manin, Yu, I., Marcolli, M.: Nori diagrams and persistent homology. Math. Comput. Sci 14(1), 77–102 (2020)
28.
Zurück zum Zitat Marcantonio, A.: The Uralic Language Family: Facts, Myths and Statistics, Publications of the Philological Society, vol. 35. Blackwell, London (2002) Marcantonio, A.: The Uralic Language Family: Facts, Myths and Statistics, Publications of the Philological Society, vol. 35. Blackwell, London (2002)
29.
Zurück zum Zitat Marcolli, M.: Syntactic parameters and a coding theory perspective on entropy and complexity of language families. Entropy 18, 110 (2016)MathSciNetCrossRef Marcolli, M.: Syntactic parameters and a coding theory perspective on entropy and complexity of language families. Entropy 18, 110 (2016)MathSciNetCrossRef
30.
Zurück zum Zitat Martynov, V.V.: Iazyk v prostranstve i vremeni. Nauka (1983) Martynov, V.V.: Iazyk v prostranstve i vremeni. Nauka (1983)
31.
Zurück zum Zitat Militarev, A.: Genealogical classification of Afro-Asiatic languages according to the latest data, talk at the conference on the 70th anniversary of V.M. Illich-Svitych, Moscow (2004) Militarev, A.: Genealogical classification of Afro-Asiatic languages according to the latest data, talk at the conference on the 70th anniversary of V.M. Illich-Svitych, Moscow (2004)
32.
Zurück zum Zitat Mišeska-Tomić, O.: Balkan Sprachbund. Morpho-syntactic Features. Springer, Dordrecht (2006) Mišeska-Tomić, O.: Balkan Sprachbund. Morpho-syntactic Features. Springer, Dordrecht (2006)
33.
Zurück zum Zitat Ortegaray, A., Berwick, R.C., Marcolli, M.: Heat Kernel analysis of syntactic structures. Math. Comput. Sci 15(4), 643–660 (2021)MathSciNetCrossRefMATH Ortegaray, A., Berwick, R.C., Marcolli, M.: Heat Kernel analysis of syntactic structures. Math. Comput. Sci 15(4), 643–660 (2021)MathSciNetCrossRefMATH
34.
Zurück zum Zitat Pachter, L., Sturmfels, B.: Algebraic Statistics for Computational Biology. Cambridge University Press, Cambridge (2005)CrossRefMATH Pachter, L., Sturmfels, B.: Algebraic Statistics for Computational Biology. Cambridge University Press, Cambridge (2005)CrossRefMATH
35.
Zurück zum Zitat Park, J.J., Boettcher, R., Zhao, A., Mun, A., Yuh, K., Kumar, V., Marcolli, M.: Prevalence and recoverability of syntactic parameters in sparse distributed memories. In: Geometric Science of Information. Third International Conference GSI 2017, pp. 265–272, Lecture Notes in Computer Science, vol. 10589. Springer (2017) Park, J.J., Boettcher, R., Zhao, A., Mun, A., Yuh, K., Kumar, V., Marcolli, M.: Prevalence and recoverability of syntactic parameters in sparse distributed memories. In: Geometric Science of Information. Third International Conference GSI 2017, pp. 265–272, Lecture Notes in Computer Science, vol. 10589. Springer (2017)
36.
Zurück zum Zitat Perelysvaig, A., Lewis, M.W.: The Indo-European Controversy: Facts and Fallacies in Historical Linguistics. Cambridge University Press, Cambridge (2015)CrossRef Perelysvaig, A., Lewis, M.W.: The Indo-European Controversy: Facts and Fallacies in Historical Linguistics. Cambridge University Press, Cambridge (2015)CrossRef
37.
Zurück zum Zitat Port, A., Gheorghita, I., Guth, D., Clark, J.M., Liang, C., Dasu, S., Marcolli, M.: Persistent topology of syntax. Math. Comput. Sci. 12(1), 33–50 (2018)MathSciNetCrossRefMATH Port, A., Gheorghita, I., Guth, D., Clark, J.M., Liang, C., Dasu, S., Marcolli, M.: Persistent topology of syntax. Math. Comput. Sci. 12(1), 33–50 (2018)MathSciNetCrossRefMATH
38.
Zurück zum Zitat Ringe, D., Warnow, T., Taylor, A.: Indo-European and computational cladistics. Trans. Philol. Soc. 100, 59–129 (2002)CrossRef Ringe, D., Warnow, T., Taylor, A.: Indo-European and computational cladistics. Trans. Philol. Soc. 100, 59–129 (2002)CrossRef
39.
Zurück zum Zitat Shu, K., Ortegaray, A., Berwick, R.C., Marcolli, M.: Phylogenetics of Indo-European language families via an Phylogenetics of Indo-European language families via an algebro-geometric analysis of their syntactic structures. Math. Comput. Sci 15(4), 803–857 (2021)MathSciNetCrossRefMATH Shu, K., Ortegaray, A., Berwick, R.C., Marcolli, M.: Phylogenetics of Indo-European language families via an Phylogenetics of Indo-European language families via an algebro-geometric analysis of their syntactic structures. Math. Comput. Sci 15(4), 803–857 (2021)MathSciNetCrossRefMATH
41.
Zurück zum Zitat Shu, K., Aziz, S., Huynh, V.L., Warrick, D., Marcolli, M.: Syntactic phylogenetic trees. In: Foundations of Mathematics and Physics One Century After Hilbert, pp. 417–441. Springer, Cham (2018) Shu, K., Aziz, S., Huynh, V.L., Warrick, D., Marcolli, M.: Syntactic phylogenetic trees. In: Foundations of Mathematics and Physics One Century After Hilbert, pp. 417–441. Springer, Cham (2018)
42.
Zurück zum Zitat Sinor, D.: The problem of the Ural–Altaic relationship. In: The Uralic Languages: Description, History and Modern Influences. Brill, pp. 706–741 (1988) Sinor, D.: The problem of the Ural–Altaic relationship. In: The Uralic Languages: Description, History and Modern Influences. Brill, pp. 706–741 (1988)
43.
Zurück zum Zitat Siva, K., Tao, J., Marcolli, M.: Spin glass models of syntax and language evolution. Linguist. Anal. 41(3–4), 559–608 (2017) Siva, K., Tao, J., Marcolli, M.: Spin glass models of syntax and language evolution. Linguist. Anal. 41(3–4), 559–608 (2017)
44.
Zurück zum Zitat Zomorodian, A.J.: Topology for computing. Cambridge University Press, Cambridge (2005)CrossRefMATH Zomorodian, A.J.: Topology for computing. Cambridge University Press, Cambridge (2005)CrossRefMATH
Metadaten
Titel
Topological Analysis of Syntactic Structures
verfasst von
Alexander Port
Taelin Karidi
Matilde Marcolli
Publikationsdatum
01.03.2022
Verlag
Springer International Publishing
Erschienen in
Mathematics in Computer Science / Ausgabe 1/2022
Print ISSN: 1661-8270
Elektronische ISSN: 1661-8289
DOI
https://doi.org/10.1007/s11786-021-00520-5

Weitere Artikel der Ausgabe 1/2022

Mathematics in Computer Science 1/2022 Zur Ausgabe

Premium Partner