Top

Published in:

2022 | OriginalPaper | Chapter

10. Analysis of Quality of Living Data of Households of Indian Districts Using Machine Learning Approach of Fuzzy C-Means Clustering

Author : Supratik Sekhar Bhattacharya

Published in: Persistent and Emerging Challenges to Development

Publisher: Springer Nature Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

Machine learning is used to analyse the 2011 census data of physical amenities and educational levels of Indian households in order to cluster and categorize Indian districts based on living standards and educational attainment. Household-level data on amenities such as electric lighting, television, mobile, car/scooter ownership; kitchen, toilet, bathing facilities and home ownership of families; as well as educational levels are used for 640 Indian districts, each consisting of 26 parameters (i.e. attributes). This makes the data set fairly large and complex for a clustering problem, and in order to preserve data granularity, fuzzy C-means (FCM) clustering algorithm has been chosen for analysis. The features of the algorithm are briefly presented. The analysis considers 4–10 clusters for the data set. The quality of clustering with larger clusters is discussed with appropriate indices. The results of computation yield the correlation between the variables which allow us to look at the relationships between them. The results also yield the classification of districts in a scale ‘well-off’ to ‘disadvantaged’ for various levels of clustering and show how the number of districts for each variable changes with the number of clusters. It is argued that these results can be used to orient investment plans for various sectors such as education and housing and establish ease-of-living ranking indices for districts which, in turn, can establish a rational basis for coordinated development of Indian regional economies.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

previous chapter Incidence of Wasted Pregnancy and Health Facilities: An Empirical Study of the Indian Women

next chapter Multidimensional Poverty in Rural India: An Exploratory Study of Purulia District

Ahlborn, M., & Wortmann, M. (2018). The core-periphery pattern of European business cycles: A fuzzy clustering approach. Journal of Macroeconomics, 55, 12–27.CrossRef

Bezdek, J. C. (1981). Pattern recognition with fuzzy objective function algorithms. Plenum Press.CrossRef

Bezdek, J. C., & Pal, S. K. (1992). Fuzzy models for pattern recognition: Methods that search for structure in data. IEEE Press.

Chen, Y. (2009). Research on the income of rural residents of Sichuan province based on fuzzy C-mean clustering. In Sixth international conference on fuzzy systems and knowledge discovery (pp. 151–155).

Gokten, P. O, Baser, F. & Gokten, S. (2017). Using fuzzy c-means clustering algorithm in financial health scoring, Audit Financiar, XV, 3(147), 385–394.

Gupta, A. K., Ladusingh, L., & Borkotoky, K. (2016). Spatial clustering and risk factors of infant mortality: District-level assessment of high-focus states in India. Genus, 72(1), 2.CrossRef

India-districts-census (2011) http://censusindia.gov.in/2011-Common/CensusData2011.html, https://github.com/nishusharma1608/India-Census-2011-Analysis/blob/master/india-districts-census-2011.csv.

Rawashdeh, M., & Ralescu, A. (2012). Crisp and fuzzy cluster validity: Generalized intra-inter silhouette index. In Annual meeting of the North American fuzzy information processing society (NAFIPS)

Shuo, Y., & JiQing Y. (2011). The economic status of household consumption expenditure in 31 provinces and regions in China. In 3rd international conference on computer research and development (pp. 464–467).

Sun, J., Zhao, H., Xia, T., & Hu, F. (2011). Study on Chinese corporate social responsibility evaluation based on fuzzy C-means clustering. In International conference on computer and management (CAMAN) (pp. 1–4).

Tripathi, R., Nayak, A. K., Shahid, M., Lal, B., Gautam, P., Raja, R., Mohanty, S., Kumar, A., Panda, B. B., & Sahoo, R. N. (2015). Delineation of soil management zones for a rice cultivated area in eastern India using fuzzy clustering. CATENA, 133, 128–136.CrossRef

Zhao, Y., & Karypis, G. (2001) Criterion functions for document clustering: Experiments and analysis. Technical Report TR#01–40, Department of Computer Science, University of Minnesota.

Title: Analysis of Quality of Living Data of Households of Indian Districts Using Machine Learning Approach of Fuzzy C-Means Clustering
Author: Supratik Sekhar Bhattacharya
Publisher: Springer Nature Singapore
Book: Persistent and Emerging Challenges to Development
Print ISBN: 978-981-16-4180-0

Electronic ISBN: 978-981-16-4181-7

Copyright Year: 2022
DOI: https://doi.org/10.1007/978-981-16-4181-7_10