Skip to main content
Erschienen in: The Journal of Supercomputing 2/2018

24.10.2017

TDRM: tensor-based data representation and mining for healthcare data in cloud computing environments

verfasst von: Rajinder Sandhu, Navroop Kaur, Sandeep K. Sood, Rajkumar Buyya

Erschienen in: The Journal of Supercomputing | Ausgabe 2/2018

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Big data analytics proved to be one of the most influential forces in today’s competitive business environments due to its ability to generate new insights by processing a large volume and variety of data. Storing as well as mining these datasets is one of the primary challenges of the big data era. If data are stored in a well-defined pattern, then its updation mining and deletion processes become easy. In this paper, granular computing concept is used to store heterogeneous data in the format of tensor. A multi-dimensional matrix, also known as tensor, stores data in the raw format, and then, raw tensor is replicated to multiple tensors of different abstraction levels based on concept hierarchy of each attribute. Mathematical foundation of tensor formation and query processing are developed. The proposed method is successful in creating tensors of a diabetes dataset proving its applicability. The proposed system provides faster computation, low response time, better privacy and high relevancy as compared to baseline PARAFAC2 and CANDELINC tensor analysis method when run on Microsoft Azure cloud infrastructure. Different levels of information granules in the form of tensors make data storage and its query processing effective.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Hashem IAT, Yaqoob I, Badrul Anuar N, Mokhtar S, Gani A, Ullah Khan S (2014) The rise of ‘big data’ on cloud computing: review and open research issues. Inf Syst 47:98–115CrossRef Hashem IAT, Yaqoob I, Badrul Anuar N, Mokhtar S, Gani A, Ullah Khan S (2014) The rise of ‘big data’ on cloud computing: review and open research issues. Inf Syst 47:98–115CrossRef
2.
Zurück zum Zitat Clifton L, Clifton DA, Pimentel MAF, Watkinson PJ, Tarassenko L (2014) Predictive monitoring of mobile patients by combining clinical observations with data from wearable sensors. IEEE J Biomed Health Inform 18(3):722–30CrossRef Clifton L, Clifton DA, Pimentel MAF, Watkinson PJ, Tarassenko L (2014) Predictive monitoring of mobile patients by combining clinical observations with data from wearable sensors. IEEE J Biomed Health Inform 18(3):722–30CrossRef
3.
Zurück zum Zitat Dean J (2014) Big data, data mining, and machine learning. Wiley, LondonCrossRef Dean J (2014) Big data, data mining, and machine learning. Wiley, LondonCrossRef
4.
Zurück zum Zitat Qian Y, Liang J, Wu WZZ, Dang C (2011) Information granularity in fuzzy binary GrC model. IEEE Trans Fuzzy Syst 19(2):253–264CrossRef Qian Y, Liang J, Wu WZZ, Dang C (2011) Information granularity in fuzzy binary GrC model. IEEE Trans Fuzzy Syst 19(2):253–264CrossRef
5.
Zurück zum Zitat Lin TY (2003) Granular computing. In: Proceedings of 4th Chinese National Conference Rough Sets Soft Computing, vol 2639, pp 16–24 Lin TY (2003) Granular computing. In: Proceedings of 4th Chinese National Conference Rough Sets Soft Computing, vol 2639, pp 16–24
6.
Zurück zum Zitat Yao Y (2005) Perspectives of granular computing. In: 2005 IEEE International Conference on Granular Computing, pp 85–90 Yao Y (2005) Perspectives of granular computing. In: 2005 IEEE International Conference on Granular Computing, pp 85–90
7.
Zurück zum Zitat Kuang L, Hao F, Yang LT, Lin M, Luo C, Min G (2014) A tensor-based approach for big data representation and dimensionality reduction. IEEE Trans Emerg Top Comput 2(3):280–291CrossRef Kuang L, Hao F, Yang LT, Lin M, Luo C, Min G (2014) A tensor-based approach for big data representation and dimensionality reduction. IEEE Trans Emerg Top Comput 2(3):280–291CrossRef
8.
Zurück zum Zitat Yuan L, Yu Z, Luo W, Hu Y, Feng L, Zhu AX (2015) A hierarchical tensor-based approach to compressing, updating and querying geospatial data. IEEE Trans Knowl Data Eng 27(2):312–325CrossRef Yuan L, Yu Z, Luo W, Hu Y, Feng L, Zhu AX (2015) A hierarchical tensor-based approach to compressing, updating and querying geospatial data. IEEE Trans Knowl Data Eng 27(2):312–325CrossRef
9.
Zurück zum Zitat Erdman AG, Keefe DF, Schiestl R (2013) Grand challenge: applying regulatory science and big data to improve medical device innovation. IEEE Trans Biomed Eng 60(3):700–706CrossRef Erdman AG, Keefe DF, Schiestl R (2013) Grand challenge: applying regulatory science and big data to improve medical device innovation. IEEE Trans Biomed Eng 60(3):700–706CrossRef
10.
Zurück zum Zitat Lin W, Dou W, Zhou Z, Liu C (2015) A cloud-based framework for Home-diagnosis service over big medical data. J Syst Softw 102:192–206CrossRef Lin W, Dou W, Zhou Z, Liu C (2015) A cloud-based framework for Home-diagnosis service over big medical data. J Syst Softw 102:192–206CrossRef
11.
Zurück zum Zitat Zhang F, Cao J, Khan SU, Li K, Hwang K (2014) A task-level adaptive MapReduce framework for real-time streaming data in healthcare applications. Future Gener Comput Syst 43–44:149–160 Zhang F, Cao J, Khan SU, Li K, Hwang K (2014) A task-level adaptive MapReduce framework for real-time streaming data in healthcare applications. Future Gener Comput Syst 43–44:149–160
12.
Zurück zum Zitat Castiglione A, Pizzolante R, De Santis A, Carpentieri B, Castiglione A, Palmieri F (2015) Cloud-based adaptive compression and secure management services for 3D healthcare data. Future Gener Comput Syst 43–44:120–134CrossRef Castiglione A, Pizzolante R, De Santis A, Carpentieri B, Castiglione A, Palmieri F (2015) Cloud-based adaptive compression and secure management services for 3D healthcare data. Future Gener Comput Syst 43–44:120–134CrossRef
13.
Zurück zum Zitat Saleem M, Kamdar MR, Iqbal A, Sampath S, Deus HF, Ngonga Ngomo AC (2014) Big linked cancer data: integrating linked TCGA and PubMed. J Web Semant 27:34–41CrossRef Saleem M, Kamdar MR, Iqbal A, Sampath S, Deus HF, Ngonga Ngomo AC (2014) Big linked cancer data: integrating linked TCGA and PubMed. J Web Semant 27:34–41CrossRef
14.
Zurück zum Zitat Jiang P, Winkley J, Zhao C, Munnoch R, Min G, Yang LT (2014) An intelligent information forwarder for healthcare big data systems with distributed wearable sensors. IEEE Syst J 10(3):1–13 Jiang P, Winkley J, Zhao C, Munnoch R, Min G, Yang LT (2014) An intelligent information forwarder for healthcare big data systems with distributed wearable sensors. IEEE Syst J 10(3):1–13
15.
Zurück zum Zitat Belaud JP, Negny S, Dupros F, Michéa D, Vautrin B (2014) Collaborative simulation and scientific big data analysis: illustration for sustainability in natural hazards management and chemical process engineering. Comput Ind 65(3):521–535CrossRef Belaud JP, Negny S, Dupros F, Michéa D, Vautrin B (2014) Collaborative simulation and scientific big data analysis: illustration for sustainability in natural hazards management and chemical process engineering. Comput Ind 65(3):521–535CrossRef
16.
Zurück zum Zitat Philip Chen CL, Zhang CY (2014) Data-intensive applications, challenges, techniques and technologies: a survey on big data. Inf Sci (Ny) 275:314–347CrossRef Philip Chen CL, Zhang CY (2014) Data-intensive applications, challenges, techniques and technologies: a survey on big data. Inf Sci (Ny) 275:314–347CrossRef
17.
Zurück zum Zitat Yao Y (2007) The art of granular computing. Rough Sets Intell Syst Paradig 4585:101–112CrossRef Yao Y (2007) The art of granular computing. Rough Sets Intell Syst Paradig 4585:101–112CrossRef
18.
Zurück zum Zitat Skowron A, Stepaniuk J (2001) Information granules: towards foundations of granular computing. Int J Intell Syst 16(1):57–85CrossRefMATH Skowron A, Stepaniuk J (2001) Information granules: towards foundations of granular computing. Int J Intell Syst 16(1):57–85CrossRefMATH
19.
Zurück zum Zitat Qiu GF, Ma JM, Yang HZ, Zhang WX (2010) A mathematical model for concept granular computing systems. Sci China Ser F Inf Sci 53(7):1397–1408MathSciNetCrossRef Qiu GF, Ma JM, Yang HZ, Zhang WX (2010) A mathematical model for concept granular computing systems. Sci China Ser F Inf Sci 53(7):1397–1408MathSciNetCrossRef
20.
Zurück zum Zitat Lin TY (2008) Granular computing: common practices and mathematical models.’ In: IEEE International Conference on Fuzzy Systems, pp 2405–2411 Lin TY (2008) Granular computing: common practices and mathematical models.’ In: IEEE International Conference on Fuzzy Systems, pp 2405–2411
21.
Zurück zum Zitat Qian Y, Zhang H, Li F, Hu Q, Liang J (2013) International journal of approximate reasoning set-based granular computing: a lattice model. Int J Approx Reason 1:1–19 Qian Y, Zhang H, Li F, Hu Q, Liang J (2013) International journal of approximate reasoning set-based granular computing: a lattice model. Int J Approx Reason 1:1–19
22.
Zurück zum Zitat Yao YY (2002) A generalized decision logic language for granular computing. In: 2002 IEEE World Congress on Computational Intelligence. 2002 IEEE International Conference on Fuzzy Systems. FUZZ-IEEE’02. Proceedings (Cat. No. 02CH37291), vol 1 Yao YY (2002) A generalized decision logic language for granular computing. In: 2002 IEEE World Congress on Computational Intelligence. 2002 IEEE International Conference on Fuzzy Systems. FUZZ-IEEE’02. Proceedings (Cat. No. 02CH37291), vol 1
23.
Zurück zum Zitat Song M, Wang Y (2015) Human centricity and information granularity in the agenda of theories and applications of soft computing. Appl Soft Comput J 27:610–613CrossRef Song M, Wang Y (2015) Human centricity and information granularity in the agenda of theories and applications of soft computing. Appl Soft Comput J 27:610–613CrossRef
24.
25.
Zurück zum Zitat Xu W, Li W (2014) Granular computing approach to two-way learning based on formal concept analysis in fuzzy datasets. IEEE Trans Cybern 46(2168–2275 (Electronic)):366–379 Xu W, Li W (2014) Granular computing approach to two-way learning based on formal concept analysis in fuzzy datasets. IEEE Trans Cybern 46(2168–2275 (Electronic)):366–379
26.
Zurück zum Zitat Yao Y (2009) Interpreting concept learning in cognitive informatics and granular computing. IEEE Trans Syst Man Cybern Part B Cybern 39(4):855–866CrossRef Yao Y (2009) Interpreting concept learning in cognitive informatics and granular computing. IEEE Trans Syst Man Cybern Part B Cybern 39(4):855–866CrossRef
27.
Zurück zum Zitat Gacek A (2015) Signal processing and time series description: a perspective of computational intelligence and granular computing. Appl Soft Comput 27:590–601CrossRef Gacek A (2015) Signal processing and time series description: a perspective of computational intelligence and granular computing. Appl Soft Comput 27:590–601CrossRef
28.
Zurück zum Zitat Sanchez M a, Castillo O, Castro JR, Melin P (2014) Fuzzy granular gravitational clustering algorithm for multivariate data. Inf Sci (Ny) 279:498–511MathSciNetCrossRefMATH Sanchez M a, Castillo O, Castro JR, Melin P (2014) Fuzzy granular gravitational clustering algorithm for multivariate data. Inf Sci (Ny) 279:498–511MathSciNetCrossRefMATH
29.
Zurück zum Zitat Wang X, Liu X, Zhang L (2014) A rapid fuzzy rule clustering method based on granular computing. Appl Soft Comput 24:534–542CrossRef Wang X, Liu X, Zhang L (2014) A rapid fuzzy rule clustering method based on granular computing. Appl Soft Comput 24:534–542CrossRef
30.
Zurück zum Zitat Peters G (2011) Granular box regression. IEEE Trans Fuzzy Syst 19(6):1141–1152CrossRef Peters G (2011) Granular box regression. IEEE Trans Fuzzy Syst 19(6):1141–1152CrossRef
31.
Zurück zum Zitat a Cimino MGC, Lazzerini B, Marcelloni F, Pedrycz W (2014) Genetic interval neural networks for granular data regression. Inf Sci (Ny) 257:313–330CrossRef a Cimino MGC, Lazzerini B, Marcelloni F, Pedrycz W (2014) Genetic interval neural networks for granular data regression. Inf Sci (Ny) 257:313–330CrossRef
32.
Zurück zum Zitat Cruz-Vega I, Escalante HJ, Reyes CA, Gonzalez JA, Rosales A (2015) Surrogate modeling based on an adaptive network and granular computing. Soft Comput 20(4):1549–1563CrossRef Cruz-Vega I, Escalante HJ, Reyes CA, Gonzalez JA, Rosales A (2015) Surrogate modeling based on an adaptive network and granular computing. Soft Comput 20(4):1549–1563CrossRef
33.
Zurück zum Zitat Liu Y, Jiang Y, Huang L (2010) Modeling complex architectures based on granular computing on ontology. IEEE Trans Fuzzy Syst 18(3):585–598CrossRef Liu Y, Jiang Y, Huang L (2010) Modeling complex architectures based on granular computing on ontology. IEEE Trans Fuzzy Syst 18(3):585–598CrossRef
34.
Zurück zum Zitat Zhou D, Dai X (2015) A method for discovering typical process sequence using granular computing and similarity algorithm based on part features. Int J Adv Manuf Technol 78(9–12):1781–1793CrossRef Zhou D, Dai X (2015) A method for discovering typical process sequence using granular computing and similarity algorithm based on part features. Int J Adv Manuf Technol 78(9–12):1781–1793CrossRef
36.
Zurück zum Zitat Wu WZ, Leung Y, Mi JS (2009) Granular computing and knowledge reduction in formal contexts. IEEE Trans Knowl Data Eng 21(10):1461–1474CrossRef Wu WZ, Leung Y, Mi JS (2009) Granular computing and knowledge reduction in formal contexts. IEEE Trans Knowl Data Eng 21(10):1461–1474CrossRef
37.
Zurück zum Zitat Bargiela A, Pedrycz W (2006) Toward a theory of granular computing for human-centred information processing. IEEE Trans Fuzzy Syst 16(2):320–330CrossRef Bargiela A, Pedrycz W (2006) Toward a theory of granular computing for human-centred information processing. IEEE Trans Fuzzy Syst 16(2):320–330CrossRef
38.
Zurück zum Zitat Yager RR (2008) Intelligent social network analysis using granular computing. Int J Intell Syst 23(11):1197–1219CrossRefMATH Yager RR (2008) Intelligent social network analysis using granular computing. Int J Intell Syst 23(11):1197–1219CrossRefMATH
39.
Zurück zum Zitat Qian J, Lv P, Yue X, Liu C, Jing Z (2015) Hierarchical attribute reduction algorithms for big data using MapReduce. Knowl Based Syst 73:18–31CrossRef Qian J, Lv P, Yue X, Liu C, Jing Z (2015) Hierarchical attribute reduction algorithms for big data using MapReduce. Knowl Based Syst 73:18–31CrossRef
40.
Zurück zum Zitat Wang D-W, Liau C-J, Hsu T-S (2004) Medical privacy protection based on granular computing. Artif Intell Med 32(2):137–149CrossRef Wang D-W, Liau C-J, Hsu T-S (2004) Medical privacy protection based on granular computing. Artif Intell Med 32(2):137–149CrossRef
41.
Zurück zum Zitat Minaev YN, Filimonova OY, Minaeva JI (2014) Kronecker (tensor) models of fuzzy-set granules. Cybern Syst Anal 50(4):519–528MathSciNetCrossRefMATH Minaev YN, Filimonova OY, Minaeva JI (2014) Kronecker (tensor) models of fuzzy-set granules. Cybern Syst Anal 50(4):519–528MathSciNetCrossRefMATH
43.
Zurück zum Zitat Strack B et al (2014) Impact of HbA1c measurement on hospital readmission rates: analysis of 70,000 clinical database patient records. Biomed Res Int 2014:1–11CrossRef Strack B et al (2014) Impact of HbA1c measurement on hospital readmission rates: analysis of 70,000 clinical database patient records. Biomed Res Int 2014:1–11CrossRef
45.
Zurück zum Zitat Meng S, Dou W, Zhang X, Chen J (2014) KASR: a keyword-aware service recommendation method on mapreduce for big data applications. IEEE Trans Parallel Distrib Syst 25(12):3221–3231CrossRef Meng S, Dou W, Zhang X, Chen J (2014) KASR: a keyword-aware service recommendation method on mapreduce for big data applications. IEEE Trans Parallel Distrib Syst 25(12):3221–3231CrossRef
46.
Zurück zum Zitat Wang D, Yeo CK (2012) Exploring locality of reference in P2P VoD systems. IEEE Trans Multimed 14(4):1309–1323CrossRef Wang D, Yeo CK (2012) Exploring locality of reference in P2P VoD systems. IEEE Trans Multimed 14(4):1309–1323CrossRef
47.
Zurück zum Zitat Pouryazdian S, Beheshti S, Krishnan S (2016) CANDECOMP/PARAFAC model order selection based on reconstruction error in the presence of Kronecker structured colored noise. Digit Signal Process 48:12–26MathSciNetCrossRef Pouryazdian S, Beheshti S, Krishnan S (2016) CANDECOMP/PARAFAC model order selection based on reconstruction error in the presence of Kronecker structured colored noise. Digit Signal Process 48:12–26MathSciNetCrossRef
48.
49.
Zurück zum Zitat Carroll JD, Chang J-J (1970) Analysis of individual differences in multidimensional scaling via an n-way generalization of ‘Eckart-Young’ decomposition. Psychometrika 35(3):283–319CrossRefMATH Carroll JD, Chang J-J (1970) Analysis of individual differences in multidimensional scaling via an n-way generalization of ‘Eckart-Young’ decomposition. Psychometrika 35(3):283–319CrossRefMATH
50.
Zurück zum Zitat Kiers HAL, ten Berge JMF, Bro R (1999) PARAFAC2—Part I. A direct fitting algorithm for the PARAFAC2 model. J Chemom 13(3–4):275–294CrossRef Kiers HAL, ten Berge JMF, Bro R (1999) PARAFAC2—Part I. A direct fitting algorithm for the PARAFAC2 model. J Chemom 13(3–4):275–294CrossRef
51.
Zurück zum Zitat Douglas Carroll J, Pruzansky S, Kruskal JB (1980) Candelinc: a general approach to multidimensional analysis of many-way arrays with linear constraints on parameters. Psychometrika 45(1):3–24MathSciNetCrossRefMATH Douglas Carroll J, Pruzansky S, Kruskal JB (1980) Candelinc: a general approach to multidimensional analysis of many-way arrays with linear constraints on parameters. Psychometrika 45(1):3–24MathSciNetCrossRefMATH
53.
Zurück zum Zitat Cortez P, Silva A (2008) Using data mining to predict secondary school student performance. In: 5th Annual Future Business Technology Conference, vol 2003, no 2000, pp 5–12 Cortez P, Silva A (2008) Using data mining to predict secondary school student performance. In: 5th Annual Future Business Technology Conference, vol 2003, no 2000, pp 5–12
Metadaten
Titel
TDRM: tensor-based data representation and mining for healthcare data in cloud computing environments
verfasst von
Rajinder Sandhu
Navroop Kaur
Sandeep K. Sood
Rajkumar Buyya
Publikationsdatum
24.10.2017
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 2/2018
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-017-2163-y

Weitere Artikel der Ausgabe 2/2018

The Journal of Supercomputing 2/2018 Zur Ausgabe