Skip to main content
Erschienen in: Pattern Analysis and Applications 1/2023

18.07.2022 | Theoretical Advances

Interval regression model adequacy checking and its application to estimate school dropout in Brazilian municipality educational scenario

verfasst von: Rafaella L. S. do Nascimento, Roberta A. de A. Fagundes, Renata M. C. R. de Souza, Francisco José A. Cysneiros

Erschienen in: Pattern Analysis and Applications | Ausgabe 1/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Interval-valued data have been commonly encountered in practice, and Symbolic Data Analysis provides a solution to the statistical treatment of these data. Regression analysis for interval-valued symbolic data is a topic that has been widely investigated in the literature of symbolic data analysis, and several models from different paradigms have been proposed. There are basic regression assumptions, and it is essential to validate them. This paper introduces an approach to check interval regression model adequacy based on residual analysis. Concepts of ordinary and standardized interval residual are presented, and graphical analysis of these residuals is also proposed. To show the usefulness of the proposed approach, an application for estimating school dropout in the scenario of Brazilian municipalities is performed. We observed some outliers from the interval residuals analysis, and interval robust regression models are more suitable for estimating school dropout.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
1.
Zurück zum Zitat Bertrand P, Goupil F (2000) Descriptive statistics for symbolic data. In: Analysis of symbolic data, pp. 106–124. Springer: Berlin Bertrand P, Goupil F (2000) Descriptive statistics for symbolic data. In: Analysis of symbolic data, pp. 106–124. Springer: Berlin
2.
Zurück zum Zitat Billard L, Diday E (2000) Regression analysis for interval-valued data. In: Data analysis, classification, and related methods, pp. 369–374. Springer: Berlin Billard L, Diday E (2000) Regression analysis for interval-valued data. In: Data analysis, classification, and related methods, pp. 369–374. Springer: Berlin
3.
Zurück zum Zitat Billard L, Diday E (2002) Symbolic regression analysis. In: Classification, clustering, and data analysis, pp. 281–288. Springer: Berlin Billard L, Diday E (2002) Symbolic regression analysis. In: Classification, clustering, and data analysis, pp. 281–288. Springer: Berlin
5.
Zurück zum Zitat Billard L, Diday E (2006) Symbolic data analysis: conceptual statistics and data mining. Wiley, ChichesterCrossRefMATH Billard L, Diday E (2006) Symbolic data analysis: conceptual statistics and data mining. Wiley, ChichesterCrossRefMATH
6.
9.
Zurück zum Zitat Diday E (2016) Thinking by classes in data science: the symbolic data analysis paradigm. WIREs Comput Stat 8(5):172–205MathSciNetCrossRef Diday E (2016) Thinking by classes in data science: the symbolic data analysis paradigm. WIREs Comput Stat 8(5):172–205MathSciNetCrossRef
10.
Zurück zum Zitat Diday E, Noirhomme-Fraiture M (2008) Symbolic data analysis and the SODAS software. Wiley, ChichesterMATH Diday E, Noirhomme-Fraiture M (2008) Symbolic data analysis and the SODAS software. Wiley, ChichesterMATH
12.
Zurück zum Zitat Fagundes RAA, de Souza RMCR, Soares YMG (2016) Quantile regression of interval-valued data. In: 2016 23rd international conference on pattern recognition (ICPR), pp. 2586–2591 Fagundes RAA, de Souza RMCR, Soares YMG (2016) Quantile regression of interval-valued data. In: 2016 23rd international conference on pattern recognition (ICPR), pp. 2586–2591
15.
Zurück zum Zitat Koedinger K, Cunningham K, Skogsholm A, Leber B (2008) An open repository and analysis tools for fine-grained, longitudinal learner data. In: proceedings of the 1st international conference educational data mining, pp. 157–166. Montreal, Canada Koedinger K, Cunningham K, Skogsholm A, Leber B (2008) An open repository and analysis tools for fine-grained, longitudinal learner data. In: proceedings of the 1st international conference educational data mining, pp. 157–166. Montreal, Canada
17.
Zurück zum Zitat Lima Neto EA, Cordeiro GM, De Carvalho FAT (2011) Bivariate symbolic regression models for interval-valued variables. J Stat Comput Simul 81(11):1727–1744MathSciNetCrossRefMATH Lima Neto EA, Cordeiro GM, De Carvalho FAT (2011) Bivariate symbolic regression models for interval-valued variables. J Stat Comput Simul 81(11):1727–1744MathSciNetCrossRefMATH
22.
Zurück zum Zitat Nascimento RLS, Fagundes RAA, Maciel AMA (2019) Prediction of school efficiency rates through ensemble regression application. In: proceedings of the 19th international conference on advanced learning technologies (ICALT), pp. 194–198. IEEE, Maceió, Brazil. https://doi.org/10.1109/ICALT.2019.00050 Nascimento RLS, Fagundes RAA, Maciel AMA (2019) Prediction of school efficiency rates through ensemble regression application. In: proceedings of the 19th international conference on advanced learning technologies (ICALT), pp. 194–198. IEEE, Maceió, Brazil. https://​doi.​org/​10.​1109/​ICALT.​2019.​00050
32.
Zurück zum Zitat Xu W (2010) Symbolic data analysis: interval-valued data regression. Ph.D. thesis, University of Georgia Athens, GA Xu W (2010) Symbolic data analysis: interval-valued data regression. Ph.D. thesis, University of Georgia Athens, GA
Metadaten
Titel
Interval regression model adequacy checking and its application to estimate school dropout in Brazilian municipality educational scenario
verfasst von
Rafaella L. S. do Nascimento
Roberta A. de A. Fagundes
Renata M. C. R. de Souza
Francisco José A. Cysneiros
Publikationsdatum
18.07.2022
Verlag
Springer London
Erschienen in
Pattern Analysis and Applications / Ausgabe 1/2023
Print ISSN: 1433-7541
Elektronische ISSN: 1433-755X
DOI
https://doi.org/10.1007/s10044-022-01093-0

Weitere Artikel der Ausgabe 1/2023

Pattern Analysis and Applications 1/2023 Zur Ausgabe

Premium Partner