Skip to main content
Top

2020 | OriginalPaper | Chapter

Healthcare Decision-Making Over a Geographic, Socioeconomic, and Image Data Warehouse

Authors : Guilherme M. Rocha, Piero L. Capelo, Cristina D. A. Ciferri

Published in: ADBIS, TPDL and EDA 2020 Common Workshops and Doctoral Consortium

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Geographic, socioeconomic, and image data enrich the range of analysis that can be achieved in the healthcare decision-making. In this paper, we focus on these complex data with the support of a data warehouse. We propose three designs of star schema to store them: jointed, split, and normalized. We consider healthcare applications that require data sharing and manage huge volumes of data, where the use of frameworks like Spark is needed. To this end, we propose SimSparkOLAP, a Spark strategy to efficiently process analytical queries extended with geographic, socioeconomic, and image similarity predicates. Performance tests showed that the normalized schema provided the best performance results, followed closely by the jointed schema, which in turn outperformed the split schema. We also carried out examples of semantic queries and discuss their importance to the healthcare decision-making.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Brito, J.J., Mosqueiro, T., Ciferri, R.R., Ciferri, C.D.A.: Faster cloud star joins with reduced disk spill and network communication. In: Proceedings of the International Conference on Computational Science (2016). Proc. Comput. Sci. 80, 74–85 Brito, J.J., Mosqueiro, T., Ciferri, R.R., Ciferri, C.D.A.: Faster cloud star joins with reduced disk spill and network communication. In: Proceedings of the International Conference on Computational Science (2016). Proc. Comput. Sci. 80, 74–85
2.
go back to reference Burdakov, A., et al.: Bloom filter cascade application to SQL query implementation on Spark. In: Proceedings of the 27th Euromicro International Conference on Parallel, Distributed and Network-Based Processing, pp. 187–192 (2019) Burdakov, A., et al.: Bloom filter cascade application to SQL query implementation on Spark. In: Proceedings of the 27th Euromicro International Conference on Parallel, Distributed and Network-Based Processing, pp. 187–192 (2019)
3.
go back to reference Cuzzocrea, A.: Warehousing and protecting big data: state-of-the-art-analysis, methodologies, future challenges. In: Proceedings of the International Conference on Internet of Things and Cloud Computing. Article No.: 14, pp. 1–7 (2016) Cuzzocrea, A.: Warehousing and protecting big data: state-of-the-art-analysis, methodologies, future challenges. In: Proceedings of the International Conference on Internet of Things and Cloud Computing. Article No.: 14, pp. 1–7 (2016)
4.
go back to reference Ferrahi, I., Bimonte, S., Boukhalfa, K.: Logical and physical design of spatial non-strict hierarchies in relational spatial data warehouse. IJDWM 15(1), 1–18 (2019) Ferrahi, I., Bimonte, S., Boukhalfa, K.: Logical and physical design of spatial non-strict hierarchies in relational spatial data warehouse. IJDWM 15(1), 1–18 (2019)
5.
go back to reference Gonzalez, R., Woods, R.: Digital Image Processing, 3rd edn. Prentice-Hall, Upper Saddle River (2006) Gonzalez, R., Woods, R.: Digital Image Processing, 3rd edn. Prentice-Hall, Upper Saddle River (2006)
6.
go back to reference Haralick, R.: Statistical and structural approaches to texture. Proc. IEEE 67(5), 786–804 (1979)CrossRef Haralick, R.: Statistical and structural approaches to texture. Proc. IEEE 67(5), 786–804 (1979)CrossRef
7.
go back to reference Jin, X., Han, J., Cao, L., Luo, J., Ding, B., Lin, C.X.: Visual cube and on-line analytical processing of images. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management, pp. 849–858 (2010) Jin, X., Han, J., Cao, L., Luo, J., Ding, B., Lin, C.X.: Visual cube and on-line analytical processing of images. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management, pp. 849–858 (2010)
8.
go back to reference Kimball, R., Ross, M.: The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling, 2nd edn. Wiley, Hoboken (2002) Kimball, R., Ross, M.: The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling, 2nd edn. Wiley, Hoboken (2002)
9.
go back to reference Kuo, M.H., Sahama, T., Kushniruk, A., Borycki, E., Grunwell, D.: Health big data analytics: current perspectives, challenges and potential solutions. Int. J. Big Data Intell. 1, 114–126 (2014) CrossRef Kuo, M.H., Sahama, T., Kushniruk, A., Borycki, E., Grunwell, D.: Health big data analytics: current perspectives, challenges and potential solutions. Int. J. Big Data Intell. 1, 114–126 (2014) CrossRef
10.
go back to reference Li, D., Zhang, W., Shen, S., Zhang, Y.: SES-LSH: shuffle-efficient locality sensitive hashing for distributed similarity search. In: Proceedings of the IEEE International Conference on Web Services, pp. 822–827 (2017) Li, D., Zhang, W., Shen, S., Zhang, Y.: SES-LSH: shuffle-efficient locality sensitive hashing for distributed similarity search. In: Proceedings of the IEEE International Conference on Web Services, pp. 822–827 (2017)
11.
go back to reference Mahase, E.: Covid-19: death rate is 0.66% and increases with age, study estimates. BMJ 369 (2020) Mahase, E.: Covid-19: death rate is 0.66% and increases with age, study estimates. BMJ 369 (2020)
12.
go back to reference Nguyen, T.D.T., Huh, E.N.: An efficient similar image search framework for large-scale data on cloud. In: Proceedings of the ACM International Conference on Ubiquitous Information Management and Communication, pp. 65:1–65:8 (2017) Nguyen, T.D.T., Huh, E.N.: An efficient similar image search framework for large-scale data on cloud. In: Proceedings of the ACM International Conference on Ubiquitous Information Management and Communication, pp. 65:1–65:8 (2017)
13.
go back to reference Richardson, S., et al.: Presenting characteristics, comorbidities, and outcomes among 5700 patients hospitalized with COVID-19 in the New York City area. JAMA 323, 2052–2059 (2020)CrossRef Richardson, S., et al.: Presenting characteristics, comorbidities, and outcomes among 5700 patients hospitalized with COVID-19 in the New York City area. JAMA 323, 2052–2059 (2020)CrossRef
14.
go back to reference Rocha, G.M., Ciferri, C.D.A.: ImgDW generator: a tool for generating data for medical image data warehouses. In: SBBD 2018 Proceedings Companion, pp. 23–28 (2018) Rocha, G.M., Ciferri, C.D.A.: ImgDW generator: a tool for generating data for medical image data warehouses. In: SBBD 2018 Proceedings Companion, pp. 23–28 (2018)
15.
go back to reference Rocha, G.M., Ciferri, C.D.A.: Processamento eficiente de consultas analíticas estendidas com predicado de similaridade em Spark. In: Proceedings of the 34th Brazilian Symposium on Databases, pp. 1–6 (2019, in Portuguese) Rocha, G.M., Ciferri, C.D.A.: Processamento eficiente de consultas analíticas estendidas com predicado de similaridade em Spark. In: Proceedings of the 34th Brazilian Symposium on Databases, pp. 1–6 (2019, in Portuguese)
17.
go back to reference Teixeira, J.W., Annibal, L.P., Felipe, J.C., Ciferri, R.R., Ciferri, C.D.A.: A similarity-based data warehousing environment for medical images. Comput. Biol. Med. 66, 190–208 (2015)CrossRef Teixeira, J.W., Annibal, L.P., Felipe, J.C., Ciferri, R.R., Ciferri, C.D.A.: A similarity-based data warehousing environment for medical images. Comput. Biol. Med. 66, 190–208 (2015)CrossRef
18.
go back to reference Traina, C., Filho, R.F.S., Traina, A.J.M., Vieira, M.R., Faloutsos, C.: The omni-family of all-purpose access methods: a simple and effective way to make similarity search more efficient. VLDB J. 16(4), 483–505 (2007)CrossRef Traina, C., Filho, R.F.S., Traina, A.J.M., Vieira, M.R., Faloutsos, C.: The omni-family of all-purpose access methods: a simple and effective way to make similarity search more efficient. VLDB J. 16(4), 483–505 (2007)CrossRef
19.
go back to reference Traina, C., Moriyama, A., Rocha, G.M., Cordeiro, R., Ciferri, C.D.A., Traina, A.J.M.: The SimilarQL framework: similarity queries in plain SQL. In: Proceedings of the 34th ACM/SIGAPP Symposium on Applied Computing, pp. 1–4 (2019) Traina, C., Moriyama, A., Rocha, G.M., Cordeiro, R., Ciferri, C.D.A., Traina, A.J.M.: The SimilarQL framework: similarity queries in plain SQL. In: Proceedings of the 34th ACM/SIGAPP Symposium on Applied Computing, pp. 1–4 (2019)
21.
go back to reference Zaharia, M., Chowdhury, M., Franklin, M.J., Shenker, S., Stoica, I.: Spark: cluster computing with working sets. In: Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing, pp. 10–10 (2010) Zaharia, M., Chowdhury, M., Franklin, M.J., Shenker, S., Stoica, I.: Spark: cluster computing with working sets. In: Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing, pp. 10–10 (2010)
22.
go back to reference Zhao, J., et al.: Relationship between the ABO blood group and the COVID-19 susceptibility. medRxiv (2020) Zhao, J., et al.: Relationship between the ABO blood group and the COVID-19 susceptibility. medRxiv (2020)
Metadata
Title
Healthcare Decision-Making Over a Geographic, Socioeconomic, and Image Data Warehouse
Authors
Guilherme M. Rocha
Piero L. Capelo
Cristina D. A. Ciferri
Copyright Year
2020
DOI
https://doi.org/10.1007/978-3-030-55814-7_7

Premium Partner