Skip to main content
Top

2022 | OriginalPaper | Chapter

10. Analysis of Quality of Living Data of Households of Indian Districts Using Machine Learning Approach of Fuzzy C-Means Clustering

Author : Supratik Sekhar Bhattacharya

Published in: Persistent and Emerging Challenges to Development

Publisher: Springer Nature Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Machine learning is used to analyse the 2011 census data of physical amenities and educational levels of Indian households in order to cluster and categorize Indian districts based on living standards and educational attainment. Household-level data on amenities such as electric lighting, television, mobile, car/scooter ownership; kitchen, toilet, bathing facilities and home ownership of families; as well as educational levels are used for 640 Indian districts, each consisting of 26 parameters (i.e. attributes). This makes the data set fairly large and complex for a clustering problem, and in order to preserve data granularity, fuzzy C-means (FCM) clustering algorithm has been chosen for analysis. The features of the algorithm are briefly presented. The analysis considers 4–10 clusters for the data set. The quality of clustering with larger clusters is discussed with appropriate indices. The results of computation yield the correlation between the variables which allow us to look at the relationships between them. The results also yield the classification of districts in a scale ‘well-off’ to ‘disadvantaged’ for various levels of clustering and show how the number of districts for each variable changes with the number of clusters. It is argued that these results can be used to orient investment plans for various sectors such as education and housing and establish ease-of-living ranking indices for districts which, in turn, can establish a rational basis for coordinated development of Indian regional economies.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Ahlborn, M., & Wortmann, M. (2018). The core-periphery pattern of European business cycles: A fuzzy clustering approach. Journal of Macroeconomics, 55, 12–27.CrossRef Ahlborn, M., & Wortmann, M. (2018). The core-periphery pattern of European business cycles: A fuzzy clustering approach. Journal of Macroeconomics, 55, 12–27.CrossRef
go back to reference Bezdek, J. C. (1981). Pattern recognition with fuzzy objective function algorithms. Plenum Press.CrossRef Bezdek, J. C. (1981). Pattern recognition with fuzzy objective function algorithms. Plenum Press.CrossRef
go back to reference Bezdek, J. C., & Pal, S. K. (1992). Fuzzy models for pattern recognition: Methods that search for structure in data. IEEE Press. Bezdek, J. C., & Pal, S. K. (1992). Fuzzy models for pattern recognition: Methods that search for structure in data. IEEE Press.
go back to reference Chen, Y. (2009). Research on the income of rural residents of Sichuan province based on fuzzy C-mean clustering. In Sixth international conference on fuzzy systems and knowledge discovery (pp. 151–155). Chen, Y. (2009). Research on the income of rural residents of Sichuan province based on fuzzy C-mean clustering. In Sixth international conference on fuzzy systems and knowledge discovery (pp. 151–155).
go back to reference Gokten, P. O, Baser, F. & Gokten, S. (2017). Using fuzzy c-means clustering algorithm in financial health scoring, Audit Financiar, XV, 3(147), 385–394. Gokten, P. O, Baser, F. & Gokten, S. (2017). Using fuzzy c-means clustering algorithm in financial health scoring, Audit Financiar, XV, 3(147), 385–394.
go back to reference Gupta, A. K., Ladusingh, L., & Borkotoky, K. (2016). Spatial clustering and risk factors of infant mortality: District-level assessment of high-focus states in India. Genus, 72(1), 2.CrossRef Gupta, A. K., Ladusingh, L., & Borkotoky, K. (2016). Spatial clustering and risk factors of infant mortality: District-level assessment of high-focus states in India. Genus, 72(1), 2.CrossRef
go back to reference Rawashdeh, M., & Ralescu, A. (2012). Crisp and fuzzy cluster validity: Generalized intra-inter silhouette index. In Annual meeting of the North American fuzzy information processing society (NAFIPS) Rawashdeh, M., & Ralescu, A. (2012). Crisp and fuzzy cluster validity: Generalized intra-inter silhouette index. In Annual meeting of the North American fuzzy information processing society (NAFIPS)
go back to reference Shuo, Y., & JiQing Y. (2011). The economic status of household consumption expenditure in 31 provinces and regions in China. In 3rd international conference on computer research and development (pp. 464–467). Shuo, Y., & JiQing Y. (2011). The economic status of household consumption expenditure in 31 provinces and regions in China. In 3rd international conference on computer research and development (pp. 464–467).
go back to reference Sun, J., Zhao, H., Xia, T., & Hu, F. (2011). Study on Chinese corporate social responsibility evaluation based on fuzzy C-means clustering. In International conference on computer and management (CAMAN) (pp. 1–4). Sun, J., Zhao, H., Xia, T., & Hu, F. (2011). Study on Chinese corporate social responsibility evaluation based on fuzzy C-means clustering. In International conference on computer and management (CAMAN) (pp. 1–4).
go back to reference Tripathi, R., Nayak, A. K., Shahid, M., Lal, B., Gautam, P., Raja, R., Mohanty, S., Kumar, A., Panda, B. B., & Sahoo, R. N. (2015). Delineation of soil management zones for a rice cultivated area in eastern India using fuzzy clustering. CATENA, 133, 128–136.CrossRef Tripathi, R., Nayak, A. K., Shahid, M., Lal, B., Gautam, P., Raja, R., Mohanty, S., Kumar, A., Panda, B. B., & Sahoo, R. N. (2015). Delineation of soil management zones for a rice cultivated area in eastern India using fuzzy clustering. CATENA, 133, 128–136.CrossRef
go back to reference Zhao, Y., & Karypis, G. (2001) Criterion functions for document clustering: Experiments and analysis. Technical Report TR#01–40, Department of Computer Science, University of Minnesota. Zhao, Y., & Karypis, G. (2001) Criterion functions for document clustering: Experiments and analysis. Technical Report TR#01–40, Department of Computer Science, University of Minnesota.
Metadata
Title
Analysis of Quality of Living Data of Households of Indian Districts Using Machine Learning Approach of Fuzzy C-Means Clustering
Author
Supratik Sekhar Bhattacharya
Copyright Year
2022
Publisher
Springer Nature Singapore
DOI
https://doi.org/10.1007/978-981-16-4181-7_10