Skip to main content
Erschienen in: Advances in Data Analysis and Classification 2/2023

19.08.2022 | Regular Article

Localization processes for functional data analysis

verfasst von: Antonio Elías, Raúl Jiménez, J. E. Yukich

Erschienen in: Advances in Data Analysis and Classification | Ausgabe 2/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

We propose an alternative to k-nearest neighbors for functional data whereby the approximating neighboring curves are piecewise functions built from a functional sample. Using a locally defined distance function that satisfies stabilization criteria, we establish pointwise and global approximation results in function spaces when the number of data curves is large. We exploit this feature to develop the asymptotic theory when a finite number of curves is observed at time-points given by an i.i.d. sample whose cardinality increases up to infinity. We use these results to investigate the problem of estimating unobserved segments of a partially observed functional data sample as well as to study the problem of functional classification and outlier detection. For such problems our methods are competitive with and sometimes superior to benchmark predictions in the field. The R package localFDA provides routines for computing the localization processes and the estimators proposed in this article.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
Zurück zum Zitat Arribas-Gil A, Romo J (2014) Shape outlier detection and visualization for functional data: the outliergram. Biostatistics 15(4):603–619CrossRef Arribas-Gil A, Romo J (2014) Shape outlier detection and visualization for functional data: the outliergram. Biostatistics 15(4):603–619CrossRef
Zurück zum Zitat Biau B, Cérou F, Guyader A (2010) Rates of convergence of the functional \(k\)-nearest neighbor estimate. IEEE Trans Inf Theory 56:2034–2040MathSciNetCrossRefMATH Biau B, Cérou F, Guyader A (2010) Rates of convergence of the functional \(k\)-nearest neighbor estimate. IEEE Trans Inf Theory 56:2034–2040MathSciNetCrossRefMATH
Zurück zum Zitat Brito MR, Chávez EL, Quiroz AJ et al (1997) Connectivity of the mutual k-nearest-neighbor graph in clustering and outlier detection. Stat Probab Lett 35:33–42MathSciNetCrossRefMATH Brito MR, Chávez EL, Quiroz AJ et al (1997) Connectivity of the mutual k-nearest-neighbor graph in clustering and outlier detection. Stat Probab Lett 35:33–42MathSciNetCrossRefMATH
Zurück zum Zitat Cuesta-Albertos JA, Febrero-Bande M, Oviedo de la Fuente M (2017) The DD\(^g\)-classifier in the functional setting. TEST 26(1):119–142MathSciNetCrossRefMATH Cuesta-Albertos JA, Febrero-Bande M, Oviedo de la Fuente M (2017) The DD\(^g\)-classifier in the functional setting. TEST 26(1):119–142MathSciNetCrossRefMATH
Zurück zum Zitat Febrero-Bande M, Oviedo M (2012) Statistical computing in functional data analysis: the R package fda.usc. J Stat Softw 51(4):1–28CrossRef Febrero-Bande M, Oviedo M (2012) Statistical computing in functional data analysis: the R package fda.usc. J Stat Softw 51(4):1–28CrossRef
Zurück zum Zitat Febrero-Bande M, Galeano P, González-Manteiga W (2019) Estimation, imputation and prediction for the functional linear model with scalar response with responses missing at random. Comput Stat Data Anal 131:91–103MathSciNetCrossRefMATH Febrero-Bande M, Galeano P, González-Manteiga W (2019) Estimation, imputation and prediction for the functional linear model with scalar response with responses missing at random. Comput Stat Data Anal 131:91–103MathSciNetCrossRefMATH
Zurück zum Zitat Ferraty F, Vieu P (2006) Nonparametric functional data analysis. Springer, New YorkMATH Ferraty F, Vieu P (2006) Nonparametric functional data analysis. Springer, New YorkMATH
Zurück zum Zitat Gao Y, Shang HL, Yang Y (2019) High-dimensional functional time series forecasting: an application to age-specific mortality rates. J Multivar Anal 170:232–243MathSciNetCrossRefMATH Gao Y, Shang HL, Yang Y (2019) High-dimensional functional time series forecasting: an application to age-specific mortality rates. J Multivar Anal 170:232–243MathSciNetCrossRefMATH
Zurück zum Zitat Györfi L, Kohler M, Krzyzak A et al (2002) A distribution-free theory of nonparametric regression. Springer, New YorkCrossRefMATH Györfi L, Kohler M, Krzyzak A et al (2002) A distribution-free theory of nonparametric regression. Springer, New YorkCrossRefMATH
Zurück zum Zitat Hubert M, Rousseeuw P, Segaert P (2017) Multivariate and functional classification using depth and distance. Adv Data Anal Classif 11:445–466MathSciNetCrossRefMATH Hubert M, Rousseeuw P, Segaert P (2017) Multivariate and functional classification using depth and distance. Adv Data Anal Classif 11:445–466MathSciNetCrossRefMATH
Zurück zum Zitat Hyndman RJ, Booth H (2008) Stochastic population forecasts using functional data models for mortality, fertility and migration. Int J Forecast 24:323–342CrossRef Hyndman RJ, Booth H (2008) Stochastic population forecasts using functional data models for mortality, fertility and migration. Int J Forecast 24:323–342CrossRef
Zurück zum Zitat Hyndman RJ, Shang HL (2010) Rainbow plots, bagplots and boxplots for functional data. J Comput Graph Stat 19(1):29–45MathSciNetCrossRef Hyndman RJ, Shang HL (2010) Rainbow plots, bagplots and boxplots for functional data. J Comput Graph Stat 19(1):29–45MathSciNetCrossRef
Zurück zum Zitat Hyndman RJ, Ullah S (2007) Robust forecasting of mortality and fertility rates: a functional data approach. Comput Stat Data Anal 51:4942–4956MathSciNetCrossRefMATH Hyndman RJ, Ullah S (2007) Robust forecasting of mortality and fertility rates: a functional data approach. Comput Stat Data Anal 51:4942–4956MathSciNetCrossRefMATH
Zurück zum Zitat Kara LZ, Laksaci A, Rachdi M et al (2017) Data-driven \(k\)nn estimation in nonparametric functional data analysis. J Multivar Anal 153:176–188CrossRefMATH Kara LZ, Laksaci A, Rachdi M et al (2017) Data-driven \(k\)nn estimation in nonparametric functional data analysis. J Multivar Anal 153:176–188CrossRefMATH
Zurück zum Zitat Kraus D (2015) Components and completion of partially observed functional data. J R Stat Soc Ser B-Stat Methodol 77:777–801MathSciNetCrossRefMATH Kraus D (2015) Components and completion of partially observed functional data. J R Stat Soc Ser B-Stat Methodol 77:777–801MathSciNetCrossRefMATH
Zurück zum Zitat Kudraszow N, Vieu P (2013) Uniform consistency of \(k\)nn regressors for functional variables. Stat Probab Lett 83:1863–1870CrossRefMATH Kudraszow N, Vieu P (2013) Uniform consistency of \(k\)nn regressors for functional variables. Stat Probab Lett 83:1863–1870CrossRefMATH
Zurück zum Zitat Li J, Cuesta-Albertos JA, Liu RY (2012) DD-classifier: nonparametric classification procedure based on DD-plot. J Am Stat Assoc Theory Methods 107:737–753MathSciNetCrossRefMATH Li J, Cuesta-Albertos JA, Liu RY (2012) DD-classifier: nonparametric classification procedure based on DD-plot. J Am Stat Assoc Theory Methods 107:737–753MathSciNetCrossRefMATH
Zurück zum Zitat Lian H (2011) Convergence of functional k-nearest neighbor regression estimate with functional responses. Electron J Stat 5:31–40MathSciNetCrossRefMATH Lian H (2011) Convergence of functional k-nearest neighbor regression estimate with functional responses. Electron J Stat 5:31–40MathSciNetCrossRefMATH
Zurück zum Zitat Liebl D (2019) Nonparametric testing for differences in electricity prices: the case of the fukushima nuclear accident. Ann Appl Stat 13:1128–1146MathSciNetCrossRefMATH Liebl D (2019) Nonparametric testing for differences in electricity prices: the case of the fukushima nuclear accident. Ann Appl Stat 13:1128–1146MathSciNetCrossRefMATH
Zurück zum Zitat López-Pintado S, Romo J (2009) On the concept of depth for functional data. J Am Stat Assoc Theory Methods 104(486):718–734MathSciNetCrossRefMATH López-Pintado S, Romo J (2009) On the concept of depth for functional data. J Am Stat Assoc Theory Methods 104(486):718–734MathSciNetCrossRefMATH
Zurück zum Zitat Martínez F, Frías MP, Pérez MD et al (2017) A methodology for applying \(k\)-nearest neighbor to time series forecasting. Artif Intell Rev 52:2019–2037CrossRef Martínez F, Frías MP, Pérez MD et al (2017) A methodology for applying \(k\)-nearest neighbor to time series forecasting. Artif Intell Rev 52:2019–2037CrossRef
Zurück zum Zitat Ramaswamy S, Rastogi R, Shim K (2000) Efficient algorithms for mining outliers from large data sets. In: Proceedings of the ACM SIGMOD conference on management of data, pp 427–438 Ramaswamy S, Rastogi R, Shim K (2000) Efficient algorithms for mining outliers from large data sets. In: Proceedings of the ACM SIGMOD conference on management of data, pp 427–438
Zurück zum Zitat Schreiber T (2010) New perspectives in stochastic geometry, Oxford University Press, Oxford, chap Limit theorems in stochastic geometry, pp 111–144 Schreiber T (2010) New perspectives in stochastic geometry, Oxford University Press, Oxford, chap Limit theorems in stochastic geometry, pp 111–144
Zurück zum Zitat Shang HL, Hyndman RJ (2017) Grouped functional time series forecasting: an application to age-specific mortality rates. J Comput Graph Stat 26(2):330–343MathSciNetCrossRef Shang HL, Hyndman RJ (2017) Grouped functional time series forecasting: an application to age-specific mortality rates. J Comput Graph Stat 26(2):330–343MathSciNetCrossRef
Zurück zum Zitat Wang JL, Chiou JM, Müller HG (2016) Functional data analysis. Annu Rev Stat Appl 3:257–295CrossRef Wang JL, Chiou JM, Müller HG (2016) Functional data analysis. Annu Rev Stat Appl 3:257–295CrossRef
Zurück zum Zitat Wu X, Kumar V, Ross Quinlan J et al (2008) Top 10 algorithms in data mining. Knowl Inf Syst 14:1–37CrossRef Wu X, Kumar V, Ross Quinlan J et al (2008) Top 10 algorithms in data mining. Knowl Inf Syst 14:1–37CrossRef
Zurück zum Zitat Yao F, Müller HG, Wang JL (2005) Functional data analysis for sparse longitudinal data. J Am Stat Assoc Theory Methods 100:577–590MathSciNetCrossRefMATH Yao F, Müller HG, Wang JL (2005) Functional data analysis for sparse longitudinal data. J Am Stat Assoc Theory Methods 100:577–590MathSciNetCrossRefMATH
Zurück zum Zitat Zhang S, Jank W, Shmueli G (2010) Real-time forecasting of online auctions via functional k-nearest neighbors. Int J Forecast 26:666–683CrossRef Zhang S, Jank W, Shmueli G (2010) Real-time forecasting of online auctions via functional k-nearest neighbors. Int J Forecast 26:666–683CrossRef
Metadaten
Titel
Localization processes for functional data analysis
verfasst von
Antonio Elías
Raúl Jiménez
J. E. Yukich
Publikationsdatum
19.08.2022
Verlag
Springer Berlin Heidelberg
Erschienen in
Advances in Data Analysis and Classification / Ausgabe 2/2023
Print ISSN: 1862-5347
Elektronische ISSN: 1862-5355
DOI
https://doi.org/10.1007/s11634-022-00512-8

Weitere Artikel der Ausgabe 2/2023

Advances in Data Analysis and Classification 2/2023 Zur Ausgabe

Premium Partner