Skip to main content
Top
Published in: Cluster Computing 1/2017

27-08-2016

High-dimensionality priority selection scheme of bioinformatics information using Bernoulli distribution

Authors: Yoon-Su Jeong, Seung-Soo Shin, Kun-Hee Han

Published in: Cluster Computing | Issue 1/2017

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Recently, as the amount of genetic information has been increasing following the completion of the human genome project, bioinformatics information management has been coming to the fore. However, since bioinformatics information is composed of diverse kinds of genetic information, users cannot easily approach and use it. In the present paper, a high-dimensionality information management scheme is proposes that enables users to select those pieces of bioinformatics information that are highly frequently used using the Bernoulli distribution so that users can easily approach those pieces of bioinformatics information that are preferred by them. The proposed scheme is an approach to high-dimensionality priority selection that requires the presentation of two or more pieces of bioinformatics information. In addition, in the case of the proposed scheme, since the order of priority of information is determined based on the kinds, functions, and characteristics of bioinformatics information, users can easily approach bioinformatics information according to their purpose of use of the information. According to the results of experiments, the proposed scheme showed a success rate 11.6 % higher than that of existing schemes in terms of bioinformatics information searches and the delay time of bioinformatics information services used by independent users was shown to be 17.3 % shorter than that of existing schemes .

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Wang, M.D.: In the spotlight: bioinformatics. IEEE Rev. Biomed. Eng. 6, 3–8 (2013)CrossRef Wang, M.D.: In the spotlight: bioinformatics. IEEE Rev. Biomed. Eng. 6, 3–8 (2013)CrossRef
2.
go back to reference Irsoy, O., Yildiz, O.T., Alpaydin, E.: Design and analysis of classifier learning experiments in bioinformatics: survey and case studies. IEEE/ACM Trans. Comput. Biol. Bioinform. 9(6), 1663–1675 (2012)CrossRef Irsoy, O., Yildiz, O.T., Alpaydin, E.: Design and analysis of classifier learning experiments in bioinformatics: survey and case studies. IEEE/ACM Trans. Comput. Biol. Bioinform. 9(6), 1663–1675 (2012)CrossRef
3.
go back to reference Chen, Y.-P.P.: Guest editorial: advanced algorithms of bioinformatics. IEEE Trans. Comput. Biol. Bioinform. 10(2), 273 (2013)CrossRef Chen, Y.-P.P.: Guest editorial: advanced algorithms of bioinformatics. IEEE Trans. Comput. Biol. Bioinform. 10(2), 273 (2013)CrossRef
4.
go back to reference Kriegel, H.P., Kröger, P., Zimek, A.: Clustering high-dimensional data: a survey on subspace clustering, pattern-based clustering, and correlation clustering. ACM Trans. Knowl. Discov. Data 3(1), 1–58 (2009)CrossRef Kriegel, H.P., Kröger, P., Zimek, A.: Clustering high-dimensional data: a survey on subspace clustering, pattern-based clustering, and correlation clustering. ACM Trans. Knowl. Discov. Data 3(1), 1–58 (2009)CrossRef
5.
go back to reference Houle, M.E., Kriegel, H.P., Kröger, P., Schubert, E., Zimek, A.: Can shared-neighbor distances defeat the curse of dimensionality? Lecture notes in computer science. Sci. Stat. Database Manag. 6187, 482–500 (2010)CrossRef Houle, M.E., Kriegel, H.P., Kröger, P., Schubert, E., Zimek, A.: Can shared-neighbor distances defeat the curse of dimensionality? Lecture notes in computer science. Sci. Stat. Database Manag. 6187, 482–500 (2010)CrossRef
6.
go back to reference Agrawal, R., Gehrke, J., Gunopulos, P., Raghavan, P.: Automatic subspace clustering of high dimensional data. Data Min. Knowl. Discov. 11, 5–33 (2005)MathSciNetCrossRef Agrawal, R., Gehrke, J., Gunopulos, P., Raghavan, P.: Automatic subspace clustering of high dimensional data. Data Min. Knowl. Discov. 11, 5–33 (2005)MathSciNetCrossRef
7.
go back to reference K. Kailing, H. P. Kriegel, P. Kröger, “Density-Connected Subspace Clustering for High-Dimensional Data,” In Proc. of the 2004 SIAM International Conference on Data Mining, pp. 246, 2004 K. Kailing, H. P. Kriegel, P. Kröger, “Density-Connected Subspace Clustering for High-Dimensional Data,” In Proc. of the 2004 SIAM International Conference on Data Mining, pp. 246, 2004
8.
go back to reference Cordeiro De Amorim, R., Mirkin, B.: Minkowski metric, feature weighting and anomalous cluster initializing in K-Means clustering. Pattern Recognition 45(3), 1061 (2012)CrossRef Cordeiro De Amorim, R., Mirkin, B.: Minkowski metric, feature weighting and anomalous cluster initializing in K-Means clustering. Pattern Recognition 45(3), 1061 (2012)CrossRef
9.
go back to reference Böhm, C., Kailing, K., Kriegel, H.-P., Kröger, P.: Density connected clustering with local subspace preferences. In: Proceeeding of Fourth IEEE International Conference on Data Mining (ICDM’04), p. 27 (2004) Böhm, C., Kailing, K., Kriegel, H.-P., Kröger, P.: Density connected clustering with local subspace preferences. In: Proceeeding of Fourth IEEE International Conference on Data Mining (ICDM’04), p. 27 (2004)
10.
go back to reference Aggarwal, C.C., Wolf, J.L., Yu, P.S., Procopiuc, C., Park, J.S.: Fast algorithms for projected clustering. ACM SIGMOD Record, p. 61. ACM, New York (1999) Aggarwal, C.C., Wolf, J.L., Yu, P.S., Procopiuc, C., Park, J.S.: Fast algorithms for projected clustering. ACM SIGMOD Record, p. 61. ACM, New York (1999)
11.
go back to reference Kriegel, H., Kröger, P., Renz, M., Wurst S.: A generic framework for efficient subspace clustering of high-dimensional data. In: Proceeding of Fifth IEEE International Conference on Data Mining (ICDM’05), pp. 250–257 (2005) Kriegel, H., Kröger, P., Renz, M., Wurst S.: A generic framework for efficient subspace clustering of high-dimensional data. In: Proceeding of Fifth IEEE International Conference on Data Mining (ICDM’05), pp. 250–257 (2005)
12.
go back to reference Andersson, T., Handel, P.: Multiple-tone estimation by IEEE standard 1057 and the expectation-maximization algorithm. In: Proceeding of the 20th IEEE Instrumentation and Measurement Technology Conference, vol. 1, pp. 739–742 (2003) Andersson, T., Handel, P.: Multiple-tone estimation by IEEE standard 1057 and the expectation-maximization algorithm. In: Proceeding of the 20th IEEE Instrumentation and Measurement Technology Conference, vol. 1, pp. 739–742 (2003)
13.
go back to reference Wang, W.: Big data, big challenges. In: Proceeding of 2014 IEEE International Conference on Semantic Computing (ICSC), p. 6 (2014) Wang, W.: Big data, big challenges. In: Proceeding of 2014 IEEE International Conference on Semantic Computing (ICSC), p. 6 (2014)
14.
go back to reference Sowe, S.K., Kimata, T., Dong, M., Zettsu, K.: Managing heterogeneous sensor data on a big data platform: IoT services for data-intensive science. In: Proceeding of 2014 IEEE 38th International Computer Software and Applications Conference Workshops (COMPSACW), pp. 295–300 (2014) Sowe, S.K., Kimata, T., Dong, M., Zettsu, K.: Managing heterogeneous sensor data on a big data platform: IoT services for data-intensive science. In: Proceeding of 2014 IEEE 38th International Computer Software and Applications Conference Workshops (COMPSACW), pp. 295–300 (2014)
15.
go back to reference Kashlev, A., Lu, S.: A system architecture for running big data workflows in the cloud. In: Proceeding of 2014 IEEE International Conference on Services Computing (SCC), pp. 51–58 (2014) Kashlev, A., Lu, S.: A system architecture for running big data workflows in the cloud. In: Proceeding of 2014 IEEE International Conference on Services Computing (SCC), pp. 51–58 (2014)
16.
go back to reference Fang, C., Yang, F., Zeng, X., Li, X.: BMF-BD: Bayesian model fusion on Bernoulli distribution for efficient yield estimation of integrated circuits. In: Proceeding of 2014 51st ACM/EDAC/IEEE Design Automation Conference (DAC), pp. 1–6 (2014) Fang, C., Yang, F., Zeng, X., Li, X.: BMF-BD: Bayesian model fusion on Bernoulli distribution for efficient yield estimation of integrated circuits. In: Proceeding of 2014 51st ACM/EDAC/IEEE Design Automation Conference (DAC), pp. 1–6 (2014)
17.
go back to reference Sagiroglu S., Sinanc, D.: Big datga: a review. In: Proceeding of 2013 International Conference on Collaboration Technologies and Systems (CTS), pp. 42–47 (2013) Sagiroglu S., Sinanc, D.: Big datga: a review. In: Proceeding of 2013 International Conference on Collaboration Technologies and Systems (CTS), pp. 42–47 (2013)
18.
go back to reference Katal, A., Wazid, M., Goudar, R.H.: Big data: issues, challenges, tools and good practices. In: Proceeding of 2013 Sixth International Conference on Contemporary Computing (IC3), pp. 404–409 (2013) Katal, A., Wazid, M., Goudar, R.H.: Big data: issues, challenges, tools and good practices. In: Proceeding of 2013 Sixth International Conference on Contemporary Computing (IC3), pp. 404–409 (2013)
19.
go back to reference Hansmann, T., Niemeyer, P.: Big data—characterizing an emerging research field using topic models. In: Proceeding of 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence(WI) aqnd Intelligent Agent Technologies (IAT), pp. 43–51 (2014) Hansmann, T., Niemeyer, P.: Big data—characterizing an emerging research field using topic models. In: Proceeding of 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence(WI) aqnd Intelligent Agent Technologies (IAT), pp. 43–51 (2014)
Metadata
Title
High-dimensionality priority selection scheme of bioinformatics information using Bernoulli distribution
Authors
Yoon-Su Jeong
Seung-Soo Shin
Kun-Hee Han
Publication date
27-08-2016
Publisher
Springer US
Published in
Cluster Computing / Issue 1/2017
Print ISSN: 1386-7857
Electronic ISSN: 1573-7543
DOI
https://doi.org/10.1007/s10586-016-0622-5

Other articles of this Issue 1/2017

Cluster Computing 1/2017 Go to the issue

Premium Partner