Skip to main content

2020 | OriginalPaper | Buchkapitel

Healthcare Decision-Making Over a Geographic, Socioeconomic, and Image Data Warehouse

verfasst von : Guilherme M. Rocha, Piero L. Capelo, Cristina D. A. Ciferri

Erschienen in: ADBIS, TPDL and EDA 2020 Common Workshops and Doctoral Consortium

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Geographic, socioeconomic, and image data enrich the range of analysis that can be achieved in the healthcare decision-making. In this paper, we focus on these complex data with the support of a data warehouse. We propose three designs of star schema to store them: jointed, split, and normalized. We consider healthcare applications that require data sharing and manage huge volumes of data, where the use of frameworks like Spark is needed. To this end, we propose SimSparkOLAP, a Spark strategy to efficiently process analytical queries extended with geographic, socioeconomic, and image similarity predicates. Performance tests showed that the normalized schema provided the best performance results, followed closely by the jointed schema, which in turn outperformed the split schema. We also carried out examples of semantic queries and discuss their importance to the healthcare decision-making.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Brito, J.J., Mosqueiro, T., Ciferri, R.R., Ciferri, C.D.A.: Faster cloud star joins with reduced disk spill and network communication. In: Proceedings of the International Conference on Computational Science (2016). Proc. Comput. Sci. 80, 74–85 Brito, J.J., Mosqueiro, T., Ciferri, R.R., Ciferri, C.D.A.: Faster cloud star joins with reduced disk spill and network communication. In: Proceedings of the International Conference on Computational Science (2016). Proc. Comput. Sci. 80, 74–85
2.
Zurück zum Zitat Burdakov, A., et al.: Bloom filter cascade application to SQL query implementation on Spark. In: Proceedings of the 27th Euromicro International Conference on Parallel, Distributed and Network-Based Processing, pp. 187–192 (2019) Burdakov, A., et al.: Bloom filter cascade application to SQL query implementation on Spark. In: Proceedings of the 27th Euromicro International Conference on Parallel, Distributed and Network-Based Processing, pp. 187–192 (2019)
3.
Zurück zum Zitat Cuzzocrea, A.: Warehousing and protecting big data: state-of-the-art-analysis, methodologies, future challenges. In: Proceedings of the International Conference on Internet of Things and Cloud Computing. Article No.: 14, pp. 1–7 (2016) Cuzzocrea, A.: Warehousing and protecting big data: state-of-the-art-analysis, methodologies, future challenges. In: Proceedings of the International Conference on Internet of Things and Cloud Computing. Article No.: 14, pp. 1–7 (2016)
4.
Zurück zum Zitat Ferrahi, I., Bimonte, S., Boukhalfa, K.: Logical and physical design of spatial non-strict hierarchies in relational spatial data warehouse. IJDWM 15(1), 1–18 (2019) Ferrahi, I., Bimonte, S., Boukhalfa, K.: Logical and physical design of spatial non-strict hierarchies in relational spatial data warehouse. IJDWM 15(1), 1–18 (2019)
5.
Zurück zum Zitat Gonzalez, R., Woods, R.: Digital Image Processing, 3rd edn. Prentice-Hall, Upper Saddle River (2006) Gonzalez, R., Woods, R.: Digital Image Processing, 3rd edn. Prentice-Hall, Upper Saddle River (2006)
6.
Zurück zum Zitat Haralick, R.: Statistical and structural approaches to texture. Proc. IEEE 67(5), 786–804 (1979)CrossRef Haralick, R.: Statistical and structural approaches to texture. Proc. IEEE 67(5), 786–804 (1979)CrossRef
7.
Zurück zum Zitat Jin, X., Han, J., Cao, L., Luo, J., Ding, B., Lin, C.X.: Visual cube and on-line analytical processing of images. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management, pp. 849–858 (2010) Jin, X., Han, J., Cao, L., Luo, J., Ding, B., Lin, C.X.: Visual cube and on-line analytical processing of images. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management, pp. 849–858 (2010)
8.
Zurück zum Zitat Kimball, R., Ross, M.: The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling, 2nd edn. Wiley, Hoboken (2002) Kimball, R., Ross, M.: The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling, 2nd edn. Wiley, Hoboken (2002)
9.
Zurück zum Zitat Kuo, M.H., Sahama, T., Kushniruk, A., Borycki, E., Grunwell, D.: Health big data analytics: current perspectives, challenges and potential solutions. Int. J. Big Data Intell. 1, 114–126 (2014) CrossRef Kuo, M.H., Sahama, T., Kushniruk, A., Borycki, E., Grunwell, D.: Health big data analytics: current perspectives, challenges and potential solutions. Int. J. Big Data Intell. 1, 114–126 (2014) CrossRef
10.
Zurück zum Zitat Li, D., Zhang, W., Shen, S., Zhang, Y.: SES-LSH: shuffle-efficient locality sensitive hashing for distributed similarity search. In: Proceedings of the IEEE International Conference on Web Services, pp. 822–827 (2017) Li, D., Zhang, W., Shen, S., Zhang, Y.: SES-LSH: shuffle-efficient locality sensitive hashing for distributed similarity search. In: Proceedings of the IEEE International Conference on Web Services, pp. 822–827 (2017)
11.
Zurück zum Zitat Mahase, E.: Covid-19: death rate is 0.66% and increases with age, study estimates. BMJ 369 (2020) Mahase, E.: Covid-19: death rate is 0.66% and increases with age, study estimates. BMJ 369 (2020)
12.
Zurück zum Zitat Nguyen, T.D.T., Huh, E.N.: An efficient similar image search framework for large-scale data on cloud. In: Proceedings of the ACM International Conference on Ubiquitous Information Management and Communication, pp. 65:1–65:8 (2017) Nguyen, T.D.T., Huh, E.N.: An efficient similar image search framework for large-scale data on cloud. In: Proceedings of the ACM International Conference on Ubiquitous Information Management and Communication, pp. 65:1–65:8 (2017)
13.
Zurück zum Zitat Richardson, S., et al.: Presenting characteristics, comorbidities, and outcomes among 5700 patients hospitalized with COVID-19 in the New York City area. JAMA 323, 2052–2059 (2020)CrossRef Richardson, S., et al.: Presenting characteristics, comorbidities, and outcomes among 5700 patients hospitalized with COVID-19 in the New York City area. JAMA 323, 2052–2059 (2020)CrossRef
14.
Zurück zum Zitat Rocha, G.M., Ciferri, C.D.A.: ImgDW generator: a tool for generating data for medical image data warehouses. In: SBBD 2018 Proceedings Companion, pp. 23–28 (2018) Rocha, G.M., Ciferri, C.D.A.: ImgDW generator: a tool for generating data for medical image data warehouses. In: SBBD 2018 Proceedings Companion, pp. 23–28 (2018)
15.
Zurück zum Zitat Rocha, G.M., Ciferri, C.D.A.: Processamento eficiente de consultas analíticas estendidas com predicado de similaridade em Spark. In: Proceedings of the 34th Brazilian Symposium on Databases, pp. 1–6 (2019, in Portuguese) Rocha, G.M., Ciferri, C.D.A.: Processamento eficiente de consultas analíticas estendidas com predicado de similaridade em Spark. In: Proceedings of the 34th Brazilian Symposium on Databases, pp. 1–6 (2019, in Portuguese)
17.
Zurück zum Zitat Teixeira, J.W., Annibal, L.P., Felipe, J.C., Ciferri, R.R., Ciferri, C.D.A.: A similarity-based data warehousing environment for medical images. Comput. Biol. Med. 66, 190–208 (2015)CrossRef Teixeira, J.W., Annibal, L.P., Felipe, J.C., Ciferri, R.R., Ciferri, C.D.A.: A similarity-based data warehousing environment for medical images. Comput. Biol. Med. 66, 190–208 (2015)CrossRef
18.
Zurück zum Zitat Traina, C., Filho, R.F.S., Traina, A.J.M., Vieira, M.R., Faloutsos, C.: The omni-family of all-purpose access methods: a simple and effective way to make similarity search more efficient. VLDB J. 16(4), 483–505 (2007)CrossRef Traina, C., Filho, R.F.S., Traina, A.J.M., Vieira, M.R., Faloutsos, C.: The omni-family of all-purpose access methods: a simple and effective way to make similarity search more efficient. VLDB J. 16(4), 483–505 (2007)CrossRef
19.
Zurück zum Zitat Traina, C., Moriyama, A., Rocha, G.M., Cordeiro, R., Ciferri, C.D.A., Traina, A.J.M.: The SimilarQL framework: similarity queries in plain SQL. In: Proceedings of the 34th ACM/SIGAPP Symposium on Applied Computing, pp. 1–4 (2019) Traina, C., Moriyama, A., Rocha, G.M., Cordeiro, R., Ciferri, C.D.A., Traina, A.J.M.: The SimilarQL framework: similarity queries in plain SQL. In: Proceedings of the 34th ACM/SIGAPP Symposium on Applied Computing, pp. 1–4 (2019)
21.
Zurück zum Zitat Zaharia, M., Chowdhury, M., Franklin, M.J., Shenker, S., Stoica, I.: Spark: cluster computing with working sets. In: Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing, pp. 10–10 (2010) Zaharia, M., Chowdhury, M., Franklin, M.J., Shenker, S., Stoica, I.: Spark: cluster computing with working sets. In: Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing, pp. 10–10 (2010)
22.
Zurück zum Zitat Zhao, J., et al.: Relationship between the ABO blood group and the COVID-19 susceptibility. medRxiv (2020) Zhao, J., et al.: Relationship between the ABO blood group and the COVID-19 susceptibility. medRxiv (2020)
Metadaten
Titel
Healthcare Decision-Making Over a Geographic, Socioeconomic, and Image Data Warehouse
verfasst von
Guilherme M. Rocha
Piero L. Capelo
Cristina D. A. Ciferri
Copyright-Jahr
2020
DOI
https://doi.org/10.1007/978-3-030-55814-7_7