Skip to main content
Top

2018 | OriginalPaper | Chapter

9. Functional Data Analysis and Knowledge-Based Systems

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In the present study, the challenge is whether a distant reading of the history of a discipline can be achieved by analysing the temporal evolution of keywords retrieved from papers in the discipline’s mainstream journals. This calls for the so-called knowledge-based system (KBS), i.e. a computer-based system that supports human learning not only by acquiring and manipulating large volumes of data and information, but also by integrating knowledge from different sources. In this chapter, we introduce a KBS that, starting from a large database of texts retrieved from scientific articles published over a lengthy period by a selection of the discipline’s premier journals, leads to the construction of a well-founded corpus of scientific literature and from this to a possible outline of the discipline’s history. Our work is based on the idea that the temporal course of a word occurrence is a proxy of the word’s life cycle. We then adopt a functional data analysis (FDA) approach under which we first reconstruct words’ life cycles. Second, by clustering words with similar life cycles, we detect any prototypical or exemplary temporal patterns representing the latent dynamics of word micro-histories. The major dynamics uncovered at this stage are then submitted to subject matter experts for interpretation and guidance in decision-making, thus making it possible to trace a history of the discipline. Moreover, we propose several kinds of data normalisation which involve different concepts of life cycle similarity and hence a different reading of the history of the discipline under examination.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Chiou, J. M., & Li, P. L. (2007). Functional clustering and identifying substructures of longitudinal data. Journal of the Royal Statistical Society: Series B, 69, 679–699.MathSciNetCrossRef Chiou, J. M., & Li, P. L. (2007). Functional clustering and identifying substructures of longitudinal data. Journal of the Royal Statistical Society: Series B, 69, 679–699.MathSciNetCrossRef
go back to reference Cobo, M., López-Herrera, A., Herrera-Viedma, E., & Herrera, F. (2011). An approach for detecting, quantifying, and visualizing the evolution of a research field: A practical application to the Fuzzy Sets Theory Field. Journal of Informetrics, 5(1), 146–166.CrossRef Cobo, M., López-Herrera, A., Herrera-Viedma, E., & Herrera, F. (2011). An approach for detecting, quantifying, and visualizing the evolution of a research field: A practical application to the Fuzzy Sets Theory Field. Journal of Informetrics, 5(1), 146–166.CrossRef
go back to reference Cobo, M., López-Herrera, A., Herrera-Viedma, E., & Herrera, F. (2012). SciMAT: A new science mapping analysis software tool. Journal of the American Society for Information Science and Technology, 63(8), 1609–1630.CrossRef Cobo, M., López-Herrera, A., Herrera-Viedma, E., & Herrera, F. (2012). SciMAT: A new science mapping analysis software tool. Journal of the American Society for Information Science and Technology, 63(8), 1609–1630.CrossRef
go back to reference Desgraupes, B. (2016). clusterCrit: Clustering indices, R package version 1.2.7. Desgraupes, B. (2016). clusterCrit: Clustering indices, R package version 1.2.7.
go back to reference Ding, W., & Chen, C. (2014). Dynamic topic detection and tracking: A comparison of HDP, C-word, and cocitation methods. Journal of the Association for Information Science and Technology, 65(10), 2084–2097.CrossRef Ding, W., & Chen, C. (2014). Dynamic topic detection and tracking: A comparison of HDP, C-word, and cocitation methods. Journal of the Association for Information Science and Technology, 65(10), 2084–2097.CrossRef
go back to reference Genolini, C., Alacoque, X., Sentenac, M., & Arnaud, C. (2015). kml and kml3d: R packages to cluster longitudinal data. Journal of Statistical Software, 65(4), 1–34.CrossRef Genolini, C., Alacoque, X., Sentenac, M., & Arnaud, C. (2015). kml and kml3d: R packages to cluster longitudinal data. Journal of Statistical Software, 65(4), 1–34.CrossRef
go back to reference Griffiths, T. L., & Steyvers, M. (2004). Finding scientific topics. Proceedings of the National Academy of Sciences, 101(Suppl 1), 5228–5235.CrossRef Griffiths, T. L., & Steyvers, M. (2004). Finding scientific topics. Proceedings of the National Academy of Sciences, 101(Suppl 1), 5228–5235.CrossRef
go back to reference Jacques, J., & Preda, C. (2014). Functional data clustering: A survey. Advances in Data Analysis and Classification, 8(3), 231–255.MathSciNetCrossRef Jacques, J., & Preda, C. (2014). Functional data clustering: A survey. Advances in Data Analysis and Classification, 8(3), 231–255.MathSciNetCrossRef
go back to reference Lukas, M. A., de Hoog, F. R., & Anderssen, R. S. (2016). Practical use of robust GCV and modified GCV for spline smoothing. Computational Statistics, 31(1), 269–289.MathSciNetCrossRef Lukas, M. A., de Hoog, F. R., & Anderssen, R. S. (2016). Practical use of robust GCV and modified GCV for spline smoothing. Computational Statistics, 31(1), 269–289.MathSciNetCrossRef
go back to reference Montero, P., & Vilar, J. (2014). Tsclust: An R package for time series clustering. Journal of Statistical Software, 62(1), 1–43.CrossRef Montero, P., & Vilar, J. (2014). Tsclust: An R package for time series clustering. Journal of Statistical Software, 62(1), 1–43.CrossRef
go back to reference Peng, J., & Muller, H. G. (2008). Distance-based clustering of sparsely observed stochastic processes, with applications to online auctions. Annals of Applied Statistics, 2, 1056–1077.MathSciNetCrossRef Peng, J., & Muller, H. G. (2008). Distance-based clustering of sparsely observed stochastic processes, with applications to online auctions. Annals of Applied Statistics, 2, 1056–1077.MathSciNetCrossRef
go back to reference R Core Team. (2017). R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing. R Core Team. (2017). R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing.
go back to reference Ramsay, J., & Silverman, B. W. (2005). Functional data analysis (Springer series in statistics). New York: Springer. Ramsay, J., & Silverman, B. W. (2005). Functional data analysis (Springer series in statistics). New York: Springer.
go back to reference Rand, W. M. (1971). Objective criteria for the evaluation of clustering methods. Journal of the American Statistical Association, 66(336), 846–850.CrossRef Rand, W. M. (1971). Objective criteria for the evaluation of clustering methods. Journal of the American Statistical Association, 66(336), 846–850.CrossRef
go back to reference Shabunina, E., & Pasi, G. (2018). A graph-based approach to ememes identification and tracking in social media streams. Knowledge-Based Systems, 139(Suppl C), 108–118.CrossRef Shabunina, E., & Pasi, G. (2018). A graph-based approach to ememes identification and tracking in social media streams. Knowledge-Based Systems, 139(Suppl C), 108–118.CrossRef
go back to reference Trevisani, M., & Tuzzi, A. (2012). Chronological analysis of textual data and curve clustering: preliminary results based on wavelets. In Società Italiana di Statistica (Ed.), Proceedings of the XLVI Scientific Meeting (pp. 1–4). Padova: Cleup. Trevisani, M., & Tuzzi, A. (2012). Chronological analysis of textual data and curve clustering: preliminary results based on wavelets. In Società Italiana di Statistica (Ed.), Proceedings of the XLVI Scientific Meeting (pp. 1–4). Padova: Cleup.
go back to reference Trevisani, M., & Tuzzi, A. (2013a). Shaping the history of words. In I. Obradovic, E. Kelih, & R. Köhler (Eds.), Methods and applications of quantitative linguistics: Selected papers of the VIIIth international conference on quantitative linguistics (pp. 84–95). Belgrad: Academic Mind. Trevisani, M., & Tuzzi, A. (2013a). Shaping the history of words. In I. Obradovic, E. Kelih, & R. Köhler (Eds.), Methods and applications of quantitative linguistics: Selected papers of the VIIIth international conference on quantitative linguistics (pp. 84–95). Belgrad: Academic Mind.
go back to reference Trevisani, M., & Tuzzi, A. (2013b). Through the JASA’s looking-glass, and what we found there. In Proceedings of the 28th International Workshop on Statistical Modelling (vol. 1, pp. 417–422). Istituto Palermo: Poligrafico Europeo. Trevisani, M., & Tuzzi, A. (2013b). Through the JASA’s looking-glass, and what we found there. In Proceedings of the 28th International Workshop on Statistical Modelling (vol. 1, pp. 417–422). Istituto Palermo: Poligrafico Europeo.
go back to reference Trevisani, M., & Tuzzi, A. (2015). A portrait of JASA: The history of statistics through analysis of keyword counts in an early scientific journal. Quality and Quantity, 49(3), 1287–1304.CrossRef Trevisani, M., & Tuzzi, A. (2015). A portrait of JASA: The history of statistics through analysis of keyword counts in an early scientific journal. Quality and Quantity, 49(3), 1287–1304.CrossRef
go back to reference Trevisani, M., & Tuzzi, A. (2018). Learning the evolution of disciplines from scientific literature: A functional clustering approach to normalized keyword count trajectories. Knowledge-Based Systems, 146, 129–141.CrossRef Trevisani, M., & Tuzzi, A. (2018). Learning the evolution of disciplines from scientific literature: A functional clustering approach to normalized keyword count trajectories. Knowledge-Based Systems, 146, 129–141.CrossRef
go back to reference Tuzzi, A., & Köhler, R. (2015). Tracing the history of words. In A. Tuzzi, M. Benesová, & J. Macutek (Eds.), Recent contributions to quantitative linguistics (pp. 203–214). New York: DeGruyter. Tuzzi, A., & Köhler, R. (2015). Tracing the history of words. In A. Tuzzi, M. Benesová, & J. Macutek (Eds.), Recent contributions to quantitative linguistics (pp. 203–214). New York: DeGruyter.
go back to reference Wang, J. L., Chiou, J. M., & Mueller, H. G. (2016). Functional data analysis. Annual Review of Statistics and Its Application, 3(1), 257–295.CrossRef Wang, J. L., Chiou, J. M., & Mueller, H. G. (2016). Functional data analysis. Annual Review of Statistics and Its Application, 3(1), 257–295.CrossRef
go back to reference Yamamoto, M. (2012). Clustering of functional data in a low-dimensional subspace. Advances in Data Analysis and Classification, 6, 219–247.MathSciNetCrossRef Yamamoto, M. (2012). Clustering of functional data in a low-dimensional subspace. Advances in Data Analysis and Classification, 6, 219–247.MathSciNetCrossRef
go back to reference Zhang, Y., Zhang, G., Chen, H., Porter, A. L., Zhu, D., & Lu, J. (2016). Topic analysis and forecasting for science, technology and innovation: Methodology with a case study focusing on big data research. Technological Forecasting and Social Change, 105, 179–191.CrossRef Zhang, Y., Zhang, G., Chen, H., Porter, A. L., Zhu, D., & Lu, J. (2016). Topic analysis and forecasting for science, technology and innovation: Methodology with a case study focusing on big data research. Technological Forecasting and Social Change, 105, 179–191.CrossRef
go back to reference Zhang, Y., Chen, H., Lu, J., & Zhang, G. (2017). Detecting and predicting the topic change of knowledge-based systems: A topic-based bibliometric analysis from 1991 to 2016. Knowledge-Based Systems, 133(Suppl C), 255–268.CrossRef Zhang, Y., Chen, H., Lu, J., & Zhang, G. (2017). Detecting and predicting the topic change of knowledge-based systems: A topic-based bibliometric analysis from 1991 to 2016. Knowledge-Based Systems, 133(Suppl C), 255–268.CrossRef
go back to reference Zhou, H., Yu, H., Hu, R., & Hu, J. (2017). A survey on trends of cross-media topic evolution map. Knowledge-Based Systems, 124(Suppl C), 164–175.CrossRef Zhou, H., Yu, H., Hu, R., & Hu, J. (2017). A survey on trends of cross-media topic evolution map. Knowledge-Based Systems, 124(Suppl C), 164–175.CrossRef
Metadata
Title
Functional Data Analysis and Knowledge-Based Systems
Author
Matilde Trevisani
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-319-97064-6_9

Premium Partner