Skip to main content
Top
Published in: Pattern Analysis and Applications 1/2023

18-07-2022 | Theoretical Advances

Interval regression model adequacy checking and its application to estimate school dropout in Brazilian municipality educational scenario

Authors: Rafaella L. S. do Nascimento, Roberta A. de A. Fagundes, Renata M. C. R. de Souza, Francisco José A. Cysneiros

Published in: Pattern Analysis and Applications | Issue 1/2023

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Interval-valued data have been commonly encountered in practice, and Symbolic Data Analysis provides a solution to the statistical treatment of these data. Regression analysis for interval-valued symbolic data is a topic that has been widely investigated in the literature of symbolic data analysis, and several models from different paradigms have been proposed. There are basic regression assumptions, and it is essential to validate them. This paper introduces an approach to check interval regression model adequacy based on residual analysis. Concepts of ordinary and standardized interval residual are presented, and graphical analysis of these residuals is also proposed. To show the usefulness of the proposed approach, an application for estimating school dropout in the scenario of Brazilian municipalities is performed. We observed some outliers from the interval residuals analysis, and interval robust regression models are more suitable for estimating school dropout.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Literature
1.
go back to reference Bertrand P, Goupil F (2000) Descriptive statistics for symbolic data. In: Analysis of symbolic data, pp. 106–124. Springer: Berlin Bertrand P, Goupil F (2000) Descriptive statistics for symbolic data. In: Analysis of symbolic data, pp. 106–124. Springer: Berlin
2.
go back to reference Billard L, Diday E (2000) Regression analysis for interval-valued data. In: Data analysis, classification, and related methods, pp. 369–374. Springer: Berlin Billard L, Diday E (2000) Regression analysis for interval-valued data. In: Data analysis, classification, and related methods, pp. 369–374. Springer: Berlin
3.
go back to reference Billard L, Diday E (2002) Symbolic regression analysis. In: Classification, clustering, and data analysis, pp. 281–288. Springer: Berlin Billard L, Diday E (2002) Symbolic regression analysis. In: Classification, clustering, and data analysis, pp. 281–288. Springer: Berlin
5.
go back to reference Billard L, Diday E (2006) Symbolic data analysis: conceptual statistics and data mining. Wiley, ChichesterCrossRefMATH Billard L, Diday E (2006) Symbolic data analysis: conceptual statistics and data mining. Wiley, ChichesterCrossRefMATH
9.
go back to reference Diday E (2016) Thinking by classes in data science: the symbolic data analysis paradigm. WIREs Comput Stat 8(5):172–205MathSciNetCrossRef Diday E (2016) Thinking by classes in data science: the symbolic data analysis paradigm. WIREs Comput Stat 8(5):172–205MathSciNetCrossRef
10.
go back to reference Diday E, Noirhomme-Fraiture M (2008) Symbolic data analysis and the SODAS software. Wiley, ChichesterMATH Diday E, Noirhomme-Fraiture M (2008) Symbolic data analysis and the SODAS software. Wiley, ChichesterMATH
12.
go back to reference Fagundes RAA, de Souza RMCR, Soares YMG (2016) Quantile regression of interval-valued data. In: 2016 23rd international conference on pattern recognition (ICPR), pp. 2586–2591 Fagundes RAA, de Souza RMCR, Soares YMG (2016) Quantile regression of interval-valued data. In: 2016 23rd international conference on pattern recognition (ICPR), pp. 2586–2591
15.
go back to reference Koedinger K, Cunningham K, Skogsholm A, Leber B (2008) An open repository and analysis tools for fine-grained, longitudinal learner data. In: proceedings of the 1st international conference educational data mining, pp. 157–166. Montreal, Canada Koedinger K, Cunningham K, Skogsholm A, Leber B (2008) An open repository and analysis tools for fine-grained, longitudinal learner data. In: proceedings of the 1st international conference educational data mining, pp. 157–166. Montreal, Canada
17.
go back to reference Lima Neto EA, Cordeiro GM, De Carvalho FAT (2011) Bivariate symbolic regression models for interval-valued variables. J Stat Comput Simul 81(11):1727–1744MathSciNetCrossRefMATH Lima Neto EA, Cordeiro GM, De Carvalho FAT (2011) Bivariate symbolic regression models for interval-valued variables. J Stat Comput Simul 81(11):1727–1744MathSciNetCrossRefMATH
22.
go back to reference Nascimento RLS, Fagundes RAA, Maciel AMA (2019) Prediction of school efficiency rates through ensemble regression application. In: proceedings of the 19th international conference on advanced learning technologies (ICALT), pp. 194–198. IEEE, Maceió, Brazil. https://doi.org/10.1109/ICALT.2019.00050 Nascimento RLS, Fagundes RAA, Maciel AMA (2019) Prediction of school efficiency rates through ensemble regression application. In: proceedings of the 19th international conference on advanced learning technologies (ICALT), pp. 194–198. IEEE, Maceió, Brazil. https://​doi.​org/​10.​1109/​ICALT.​2019.​00050
32.
go back to reference Xu W (2010) Symbolic data analysis: interval-valued data regression. Ph.D. thesis, University of Georgia Athens, GA Xu W (2010) Symbolic data analysis: interval-valued data regression. Ph.D. thesis, University of Georgia Athens, GA
Metadata
Title
Interval regression model adequacy checking and its application to estimate school dropout in Brazilian municipality educational scenario
Authors
Rafaella L. S. do Nascimento
Roberta A. de A. Fagundes
Renata M. C. R. de Souza
Francisco José A. Cysneiros
Publication date
18-07-2022
Publisher
Springer London
Published in
Pattern Analysis and Applications / Issue 1/2023
Print ISSN: 1433-7541
Electronic ISSN: 1433-755X
DOI
https://doi.org/10.1007/s10044-022-01093-0

Other articles of this Issue 1/2023

Pattern Analysis and Applications 1/2023 Go to the issue

Premium Partner