Skip to main content
Erschienen in: Earth Science Informatics 1/2022

07.01.2022 | Research Article

Efficient weighted naive bayes classifiers to predict air quality index

verfasst von: Jasleen Kaur Sethi, Mamta Mittal

Erschienen in: Earth Science Informatics | Ausgabe 1/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In the past few decades, there have been many environmental changes that have lead to deteriorating air quality influenced by number of criteria pollutants and the prevailing climatic conditions. These pollutants lead to respiratory problems and other environmental effects such as acid rain, greenhouse effect etc. Therefore, quality prediction of air quality has long served as prospective and practical study area that has received massive attention. In literature, Naive Bayes classifier based on independence and equal importance assumption has been used by many researchers for air quality prediction. However, these assumptions never hold practically. To fulfill this purpose, two classifiers based on Weighted Naive Bayes named as Covariance based Weighted Naive Bayes and Convergent Cross Mapping based Weighted Naive Bayes have been proposed in this study to predict the air quality index of Faridabad, Delhi and Gurugram in India. The findings of the experimental work conducted in this study showed that both the proposed weighted classifiers perform better than the traditional Naive Bayes, Support Vector Machines and Neural Network classifiers with respect to various performance metrics- accuracy, average precision, average recall, error rate and F1 score. Further, this study depicts that Covariance based Weighted Naive Bayes and Convergent Cross Mapping based Weighted Naive Bayes have an average accuracy of 83.6% and 82.12% respectively.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Asuero AG, Sayago A, Gonzalez AG (2006) The correlation coefficient: An overview. Crit Rev Anal Chem 36(1):41–59CrossRef Asuero AG, Sayago A, Gonzalez AG (2006) The correlation coefficient: An overview. Crit Rev Anal Chem 36(1):41–59CrossRef
Zurück zum Zitat Athanasiadis IN, Karatzas KD, Mitkas PA (2006) Classification techniques for air quality forecasting. In Fifth ECAI Workshop on Binding Environmental Sciences and Artificial Intelligence, 17th European Conference on Artificial Intelligence, Riva del Garda, Italy Athanasiadis IN, Karatzas KD, Mitkas PA (2006) Classification techniques for air quality forecasting. In Fifth ECAI Workshop on Binding Environmental Sciences and Artificial Intelligence, 17th European Conference on Artificial Intelligence, Riva del Garda, Italy
Zurück zum Zitat Battineni G, Sagaro GG, Nalini C, Amenta F, Tayebati SK (2019) Comparative machine-learning approach: A follow-up study on type 2 diabetes predictions by cross-validation methods. Machines 7(4):74CrossRef Battineni G, Sagaro GG, Nalini C, Amenta F, Tayebati SK (2019) Comparative machine-learning approach: A follow-up study on type 2 diabetes predictions by cross-validation methods. Machines 7(4):74CrossRef
Zurück zum Zitat Bougoudis I, Demertzis K, Iliadis L (2016) Fast and low cost prediction of extreme air pollution values with hybrid unsupervised learning. Integrated Computer-Aided Engineering 23(2):115–127CrossRef Bougoudis I, Demertzis K, Iliadis L (2016) Fast and low cost prediction of extreme air pollution values with hybrid unsupervised learning. Integrated Computer-Aided Engineering 23(2):115–127CrossRef
Zurück zum Zitat Chang JH, Tseng CY (2017) Analysis of correlation between secondary PM2. 5 and factory pollution sources by using ANN and the correlation coefficient. IEEE Access 5:22812–22822CrossRef Chang JH, Tseng CY (2017) Analysis of correlation between secondary PM2. 5 and factory pollution sources by using ANN and the correlation coefficient. IEEE Access 5:22812–22822CrossRef
Zurück zum Zitat Corani G, Scanagatta M (2016) Air pollution prediction via multi-label classification. Environ Model Softw 80:259–264CrossRef Corani G, Scanagatta M (2016) Air pollution prediction via multi-label classification. Environ Model Softw 80:259–264CrossRef
Zurück zum Zitat Farid DM, Zhang L, Rahman CM, Hossain MA, Strachan R (2014) Hybrid decision tree and naïve Bayes classifiers for multi-class classification tasks. Expert Syst Appl 41(4):1937–1946CrossRef Farid DM, Zhang L, Rahman CM, Hossain MA, Strachan R (2014) Hybrid decision tree and naïve Bayes classifiers for multi-class classification tasks. Expert Syst Appl 41(4):1937–1946CrossRef
Zurück zum Zitat Frank E, Hall M, Pfahringer B (2012) Locally weighted naive bayes. arXiv preprint arXiv:1212.2487 Frank E, Hall M, Pfahringer B (2012) Locally weighted naive bayes. arXiv preprint arXiv:1212.2487
Zurück zum Zitat Gore RW, Deshpande DS (2017) An approach for classification of health risks based on air quality levels. In 2017 1st International Conference on Intelligent Systems and Information Management (ICISIM) (pp. 58–61). IEEE Gore RW, Deshpande DS (2017) An approach for classification of health risks based on air quality levels. In 2017 1st International Conference on Intelligent Systems and Information Management (ICISIM) (pp. 58–61). IEEE
Zurück zum Zitat Goyal LM, Mittal M, Sethi JK (2016) Fuzzy model generation using Subtractive and Fuzzy C-Means clustering. CSI Transactions on ICT 4(2–4):129–133CrossRef Goyal LM, Mittal M, Sethi JK (2016) Fuzzy model generation using Subtractive and Fuzzy C-Means clustering. CSI Transactions on ICT 4(2–4):129–133CrossRef
Zurück zum Zitat Gu K, Qiao J, Li X (2018) Highly efficient picture-based prediction of PM2. 5 concentration. IEEE Transactions on Industrial Electronics 66(4):3176–3184CrossRef Gu K, Qiao J, Li X (2018) Highly efficient picture-based prediction of PM2. 5 concentration. IEEE Transactions on Industrial Electronics 66(4):3176–3184CrossRef
Zurück zum Zitat Herath HMKKMB, Ariyathunge SVASH, Karunasena GMKB (2021) Development of a Mathematical Model to Forecast Solar Radiation and Validating Results Using Machine Learning Technique Herath HMKKMB, Ariyathunge SVASH, Karunasena GMKB (2021) Development of a Mathematical Model to Forecast Solar Radiation and Validating Results Using Machine Learning Technique
Zurück zum Zitat Ji H, Huang S, Wu Y, Hui Z, Zheng C (2019) A new weighted naive Bayes method based on information diffusion for software defect prediction. Software Qual J 27(3):923–968CrossRef Ji H, Huang S, Wu Y, Hui Z, Zheng C (2019) A new weighted naive Bayes method based on information diffusion for software defect prediction. Software Qual J 27(3):923–968CrossRef
Zurück zum Zitat Jiang L, Zhang L, Li C, Wu J (2018) A correlation-based feature weighting filter for naive bayes. IEEE Trans Knowl Data Eng 31(2):201–213CrossRef Jiang L, Zhang L, Li C, Wu J (2018) A correlation-based feature weighting filter for naive bayes. IEEE Trans Knowl Data Eng 31(2):201–213CrossRef
Zurück zum Zitat Kotsiantis SB, Zaharakis I, Pintelas P (2007) Supervised machine learning: A review of classification techniques. Emerging Artificial Intelligence Applications in Computer Engineering 160(1):3–24 Kotsiantis SB, Zaharakis I, Pintelas P (2007) Supervised machine learning: A review of classification techniques. Emerging Artificial Intelligence Applications in Computer Engineering 160(1):3–24
Zurück zum Zitat Kumar D (2018) Evolving Differential evolution method with random forest for prediction of Air Pollution. Procedia Computer Science 132:824–833CrossRef Kumar D (2018) Evolving Differential evolution method with random forest for prediction of Air Pollution. Procedia Computer Science 132:824–833CrossRef
Zurück zum Zitat Kumar HM, Harish BS (2019) Automatic Irony Detection using Feature Fusion and Ensemble Classifier. Int J Interact Multimedia Artific Intell 5(7) Kumar HM, Harish BS (2019) Automatic Irony Detection using Feature Fusion and Ensemble Classifier. Int J Interact Multimedia Artific Intell 5(7)
Zurück zum Zitat Kumar HM, Harish BS, Darshan HK (2019) Sentiment Analysis on IMDb Movie Reviews Using Hybrid Feature Extraction Method. International Journal of Interactive Multimedia & Artificial Intelligence 5(5) Kumar HM, Harish BS, Darshan HK (2019) Sentiment Analysis on IMDb Movie Reviews Using Hybrid Feature Extraction Method. International Journal of Interactive Multimedia & Artificial Intelligence 5(5)
Zurück zum Zitat Manzoor A, Judge MA, Almogren A, Akhunzada A, Fattah S, Gani A, Zareei M (2020) A priori multiobjective self-adaptive multi-population based jaya algorithm to optimize ders operations and electrical tasks. Ieee Access 8:181163–181175CrossRef Manzoor A, Judge MA, Almogren A, Akhunzada A, Fattah S, Gani A, Zareei M (2020) A priori multiobjective self-adaptive multi-population based jaya algorithm to optimize ders operations and electrical tasks. Ieee Access 8:181163–181175CrossRef
Zurück zum Zitat Mittal M, Sharma RK, Singh VP, Kumar R (2019) Adaptive threshold based clustering: a deterministic partitioning approach. International Journal of Information System Modeling and Design (IJISMD) 10(1):42–59CrossRef Mittal M, Sharma RK, Singh VP, Kumar R (2019) Adaptive threshold based clustering: a deterministic partitioning approach. International Journal of Information System Modeling and Design (IJISMD) 10(1):42–59CrossRef
Zurück zum Zitat Sethi JK, Mittal M (2021) An efficient correlation based adaptive LASSO regression method for air quality index prediction. Earth Science Informatics 1–10 Sethi JK, Mittal M (2021) An efficient correlation based adaptive LASSO regression method for air quality index prediction. Earth Science Informatics 1–10
Zurück zum Zitat Sethi JK, Mittal M (2021) Prediction of Air Quality Index Using Hybrid Machine Learning Algorithm. In Advances in Information Communication Technology and Computing (pp. 439–449). Springer, Singapore Sethi JK, Mittal M (2021) Prediction of Air Quality Index Using Hybrid Machine Learning Algorithm. In Advances in Information Communication Technology and Computing (pp. 439–449). Springer, Singapore
Zurück zum Zitat Sethi J, Mittal M (2019) Ambient air quality estimation using supervised learning techniques. EAI Endorsed Transactions on Scalable Information Systems 6(22) Sethi J, Mittal M (2019) Ambient air quality estimation using supervised learning techniques. EAI Endorsed Transactions on Scalable Information Systems 6(22)
Zurück zum Zitat Sugihara G, May R, Ye H, Hsieh CH, Deyle E, Fogarty M, Munch S (2012) Detecting Causality in Complex Ecosystems Science 338(6106):496–500 Sugihara G, May R, Ye H, Hsieh CH, Deyle E, Fogarty M, Munch S (2012) Detecting Causality in Complex Ecosystems Science 338(6106):496–500
Zurück zum Zitat Usharani B, Sreedevi M (2021) Deep Learning Techniques for Air Pollution Prediction Using Remote Sensing Data. In Smart Technologies in Data Science and Communication (pp. 107–123). Springer, Singapore Usharani B, Sreedevi M (2021) Deep Learning Techniques for Air Pollution Prediction Using Remote Sensing Data. In Smart Technologies in Data Science and Communication (pp. 107–123). Springer, Singapore
Zurück zum Zitat Wibowo M, Sulaiman S, Shamsuddin SM (2018) Comparison of Prediction Methods for Air Pollution Data in Malaysia and Singapore. International Journal of Innovative Computing, 8(3) Wibowo M, Sulaiman S, Shamsuddin SM (2018) Comparison of Prediction Methods for Air Pollution Data in Malaysia and Singapore. International Journal of Innovative Computing, 8(3)
Zurück zum Zitat Ye H, Deyle ER, Gilarranz LJ, Sugihara G (2015) Distinguishing time-delayed causal interactions using convergent cross mapping. Sci Rep 5(1):1–9 Ye H, Deyle ER, Gilarranz LJ, Sugihara G (2015) Distinguishing time-delayed causal interactions using convergent cross mapping. Sci Rep 5(1):1–9
Zurück zum Zitat Yu R, Yang Y, Yang L, Han G, Move OA (2016) RAQ–A random forest approach for predicting air quality in urban sensing systems. Sensors 16(1):86CrossRef Yu R, Yang Y, Yang L, Han G, Move OA (2016) RAQ–A random forest approach for predicting air quality in urban sensing systems. Sensors 16(1):86CrossRef
Zurück zum Zitat Zaidi NA, Cerquides J, Carman MJ, Webb GI (2013) Alleviating naive Bayes attribute independence assumption by attribute weighting Zaidi NA, Cerquides J, Carman MJ, Webb GI (2013) Alleviating naive Bayes attribute independence assumption by attribute weighting
Zurück zum Zitat Zhai B, Chen J (2018) Development of a stacked ensemble model for forecasting and analyzing daily average PM2. 5 concentrations in Beijing. China Science of the Total Environment 635:644–658CrossRef Zhai B, Chen J (2018) Development of a stacked ensemble model for forecasting and analyzing daily average PM2. 5 concentrations in Beijing. China Science of the Total Environment 635:644–658CrossRef
Zurück zum Zitat Zhang H, Sheng S (2004) Learning weighted naive Bayes with accurate ranking. In Fourth IEEE International Conference on Data Mining (ICDM'04) (pp. 567–570). IEEE Zhang H, Sheng S (2004) Learning weighted naive Bayes with accurate ranking. In Fourth IEEE International Conference on Data Mining (ICDM'04) (pp. 567–570). IEEE
Zurück zum Zitat Zhu JY, Zhang C, Zhang H, Zhi S, Li VO, Han J, Zheng Y (2017) pg-causality: Identifying spatiotemporal causal pathways for air pollutants with urban big data. IEEE Transactions on Big Data 4(4):571–585CrossRef Zhu JY, Zhang C, Zhang H, Zhi S, Li VO, Han J, Zheng Y (2017) pg-causality: Identifying spatiotemporal causal pathways for air pollutants with urban big data. IEEE Transactions on Big Data 4(4):571–585CrossRef
Metadaten
Titel
Efficient weighted naive bayes classifiers to predict air quality index
verfasst von
Jasleen Kaur Sethi
Mamta Mittal
Publikationsdatum
07.01.2022
Verlag
Springer Berlin Heidelberg
Erschienen in
Earth Science Informatics / Ausgabe 1/2022
Print ISSN: 1865-0473
Elektronische ISSN: 1865-0481
DOI
https://doi.org/10.1007/s12145-021-00755-7

Weitere Artikel der Ausgabe 1/2022

Earth Science Informatics 1/2022 Zur Ausgabe

Premium Partner