Skip to main content
Top

2015 | OriginalPaper | Chapter

State-of-the-Art and Future Challenges in the Integration of Biobank Catalogues

Authors : Heimo Müller, Robert Reihs, Kurt Zatloukal, Fleur Jeanquartier, Roxana Merino-Martinez, David van Enckevort, Morris A. Swertz, Andreas Holzinger

Published in: Smart Health

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Biobanks are essential for the realization of P4-medicine, hence indispensable for smart health. One of the grand challenges in biobank research is to close the research cycle in such a way that all the data generated by one research study can be consistently associated to the original samples, therefore data and knowledge can be reused in other studies. A catalogue must provide the information hub connecting all relevant information sources. The key knowledge embedded in a biobank catalogue is the availability and quality of proper samples to perform a research project. Depending on the study type, the samples can reflect a healthy reference population, a cross sectional representation of a certain group of people (healthy or with various diseases) or a certain disease type or stage. To overview and compare collections from different catalogues, we introduce visual analytics techniques, especially glyph based visualization techniques, which were successfully applied for knowledge discovery of single biobank catalogues. In this paper, we describe the state-of-the art in the integration of biobank catalogues addressing the challenge of combining heterogeneous data sources in a unified and meaningful way, consequently enabling the discovery and visualization of data from different sources. Finally we present open questions both in data integration and visualization of unified catalogues and propose future research in data integration with a linked data approach and the fusion of multi level glyph and network visualization.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Norlin, L., Fransson, M.N., Eriksson, M., Merino-Martinez, R., Anderberg, M., Kurtovic, S., Litton, J.-E.: A minimum data set for sharing biobank samples, information, and data: MIABIS. Biopreservation Biobanking 10(4), 343–348 (2012). doi:10.1089/bio.2012.0003 CrossRef Norlin, L., Fransson, M.N., Eriksson, M., Merino-Martinez, R., Anderberg, M., Kurtovic, S., Litton, J.-E.: A minimum data set for sharing biobank samples, information, and data: MIABIS. Biopreservation Biobanking 10(4), 343–348 (2012). doi:10.​1089/​bio.​2012.​0003 CrossRef
3.
go back to reference Huppertz, B., Holzinger, A.: Biobanks – A source of large biological data sets: open problems and future challenges. In: Holzinger, A., Jurisica, I. (eds.) Knowledge Discovery and Data Mining. LNCS, vol. 8401, pp. 317–330. Springer, Heidelberg (2014)CrossRef Huppertz, B., Holzinger, A.: Biobanks – A source of large biological data sets: open problems and future challenges. In: Holzinger, A., Jurisica, I. (eds.) Knowledge Discovery and Data Mining. LNCS, vol. 8401, pp. 317–330. Springer, Heidelberg (2014)CrossRef
5.
go back to reference Fortier, I., Doiron, D., Little, J., et al.: Is rigorous retrospective harmonization possible? Application of the DataSHaPER approach across 53 large studies. Int. J. Epidemiol. 40, 1314–1328 (2011). doi:10.1093/ije/dyr106 CrossRef Fortier, I., Doiron, D., Little, J., et al.: Is rigorous retrospective harmonization possible? Application of the DataSHaPER approach across 53 large studies. Int. J. Epidemiol. 40, 1314–1328 (2011). doi:10.​1093/​ije/​dyr106 CrossRef
6.
go back to reference Doiron, D., Burton, P., Marcon, Y., Gaye, A., Wolffenbuttel, B.H.R., Perola, M., Stolk, R.P., Minelli, F.L., Waldenberger, M., Holle, R., Kvaløy, K., Hillege, H.L., Tassé, A.M., Ferretti, V., Fortier, I.: Data harmonization and federated analysis of population-based studies: the BioSHaRE project. Emerg. Themes Epidemiol. 10(1), 12 (2013). doi:10.1186/1742-7622-10-12 CrossRef Doiron, D., Burton, P., Marcon, Y., Gaye, A., Wolffenbuttel, B.H.R., Perola, M., Stolk, R.P., Minelli, F.L., Waldenberger, M., Holle, R., Kvaløy, K., Hillege, H.L., Tassé, A.M., Ferretti, V., Fortier, I.: Data harmonization and federated analysis of population-based studies: the BioSHaRE project. Emerg. Themes Epidemiol. 10(1), 12 (2013). doi:10.​1186/​1742-7622-10-12 CrossRef
7.
go back to reference Wolfson, M., Wallace, S.E., Masca, N., Rowe, G., Sheehan, N.A., Ferretti, V., LaFlamme, P., Tobin, M.D., Macleod, J., Little, J., Fortier, I., Knoppers, B.M., Burton, P.R.: DataSHIELD: resolving a conflict in contemporary bioscience–performing a pooled analysis of individual-level data without sharing the data. Int. J. Epidemiol. 39(5), 1372–1382 (2010). doi:10.1093/ije/dyq111 CrossRef Wolfson, M., Wallace, S.E., Masca, N., Rowe, G., Sheehan, N.A., Ferretti, V., LaFlamme, P., Tobin, M.D., Macleod, J., Little, J., Fortier, I., Knoppers, B.M., Burton, P.R.: DataSHIELD: resolving a conflict in contemporary bioscience–performing a pooled analysis of individual-level data without sharing the data. Int. J. Epidemiol. 39(5), 1372–1382 (2010). doi:10.​1093/​ije/​dyq111 CrossRef
8.
go back to reference Vasilevsky, N., Johnson, T., Corday, K., Torniai, C., Brush, M., Segerdell, E., Wilson, M., Shaffer, C., Robinson, D., Haendel, M.: Research resources: curating the new eagle-i discovery system. Database (Oxford). 2012 Mar 20;2012:bar067. doi:10.1093/database/bar067 Vasilevsky, N., Johnson, T., Corday, K., Torniai, C., Brush, M., Segerdell, E., Wilson, M., Shaffer, C., Robinson, D., Haendel, M.: Research resources: curating the new eagle-i discovery system. Database (Oxford). 2012 Mar 20;2012:bar067. doi:10.​1093/​database/​bar067
9.
go back to reference Brochhausen, M., Fransson, M.N., Kanaskar, N.V., Eriksson, M., Merino-Martinez, R., Hall, R.A., Litton, J.-E.: Developing a semantically rich ontology for the biobank-administration domain. J. Biomed. Semant. 4(1), 23 (2013). doi:10.1186/2041-1480-4-23 CrossRef Brochhausen, M., Fransson, M.N., Kanaskar, N.V., Eriksson, M., Merino-Martinez, R., Hall, R.A., Litton, J.-E.: Developing a semantically rich ontology for the biobank-administration domain. J. Biomed. Semant. 4(1), 23 (2013). doi:10.​1186/​2041-1480-4-23 CrossRef
10.
go back to reference Swertz, M.A., Dijkstra, M., Adamusiak, T., van der Velde, J.K., Kanterakis, A., Roos, E.T., Lops, J., Thorisson, G.A., Arends, D., Byelas, G., Muilu, J., Brookes, A.J., de Brock, E., Jansen, R.C., Parkinson, H.: The MOLGENIS toolkit: rapid prototyping of biosoftware at the push of a button. BMC Bioinform. 11(Suppl 1), S12 (2010). doi:10.1186/1471-2105-11-S12-S12 CrossRef Swertz, M.A., Dijkstra, M., Adamusiak, T., van der Velde, J.K., Kanterakis, A., Roos, E.T., Lops, J., Thorisson, G.A., Arends, D., Byelas, G., Muilu, J., Brookes, A.J., de Brock, E., Jansen, R.C., Parkinson, H.: The MOLGENIS toolkit: rapid prototyping of biosoftware at the push of a button. BMC Bioinform. 11(Suppl 1), S12 (2010). doi:10.​1186/​1471-2105-11-S12-S12 CrossRef
11.
go back to reference Pang, C., Hendriksen, D., Dijkstra, M., van der Velde, K.J., Kuiper, J., Hillege, H., Swertz, M.: BiobankConnect: software to rapidly connect data elements for pooled analysis across biobanks using ontological and lexical indexing. J. Am. Med. Inform. Assoc. 2014 Oct 31. doi:10.1136/amiajnl-2013-002577. [Epub ahead of print] PubMed PMID: 25361575 Pang, C., Hendriksen, D., Dijkstra, M., van der Velde, K.J., Kuiper, J., Hillege, H., Swertz, M.: BiobankConnect: software to rapidly connect data elements for pooled analysis across biobanks using ontological and lexical indexing. J. Am. Med. Inform. Assoc. 2014 Oct 31. doi:10.​1136/​amiajnl-2013-002577. [Epub ahead of print] PubMed PMID: 25361575
12.
go back to reference O’Donoghue, S.I., Gavin, A.-C., Gehlenborg, N., Goodsell, D.S., Hériché, J.-K., Nielsen, C.B., Olson, A.J., Procter, J.B., Shattuck, D.W., Walter, T., Wong, B.: Visualizing biological data-now and in the future. Nat. Methods 7(3 Suppl), S2–S4 (2010). doi:10.1038/nmeth.f.301 CrossRef O’Donoghue, S.I., Gavin, A.-C., Gehlenborg, N., Goodsell, D.S., Hériché, J.-K., Nielsen, C.B., Olson, A.J., Procter, J.B., Shattuck, D.W., Walter, T., Wong, B.: Visualizing biological data-now and in the future. Nat. Methods 7(3 Suppl), S2–S4 (2010). doi:10.​1038/​nmeth.​f.​301 CrossRef
13.
go back to reference Turkay, C., Jeanquartier, F., Holzinger, A., Hauser, H.: On computationally-enhanced visual analysis of heterogeneous data and its application in biomedical informatics. In: Holzinger, A., Jurisica, I. (eds.) Knowledge Discovery and Data Mining. LNCS, vol. 8401, pp. 117–140. Springer, Heidelberg (2014)CrossRef Turkay, C., Jeanquartier, F., Holzinger, A., Hauser, H.: On computationally-enhanced visual analysis of heterogeneous data and its application in biomedical informatics. In: Holzinger, A., Jurisica, I. (eds.) Knowledge Discovery and Data Mining. LNCS, vol. 8401, pp. 117–140. Springer, Heidelberg (2014)CrossRef
15.
go back to reference Bürger, R., Hauser, H.: Visualization of multi variate scientific data. In: Proceedings of EuroGraphics, pp. 117–134 (2007) Bürger, R., Hauser, H.: Visualization of multi variate scientific data. In: Proceedings of EuroGraphics, pp. 117–134 (2007)
18.
go back to reference Hege, H.-C., Hutanu, A., Kähler, R., Merzky, A., Radke, T., Seidel, E., Ullmer, B.: Progressive retrieval and hierarchical visualization of large remote data. Scalable Comput. Pract. Exp. 6(3), 60–72 (2001) Hege, H.-C., Hutanu, A., Kähler, R., Merzky, A., Radke, T., Seidel, E., Ullmer, B.: Progressive retrieval and hierarchical visualization of large remote data. Scalable Comput. Pract. Exp. 6(3), 60–72 (2001)
19.
go back to reference Fayyad, U., Grinstein, G.G., Wierse, A.: Information Visualization in Data Mining and Knowledge Discovery. Morgan Kaufmann, San Francisco (2002) Fayyad, U., Grinstein, G.G., Wierse, A.: Information Visualization in Data Mining and Knowledge Discovery. Morgan Kaufmann, San Francisco (2002)
20.
go back to reference Fekete, J.-D., Plaisant, C.: Interactive information visualization of a million items. In: IEEE Symposium on Information Visualization, INFOVIS 2002, pp. 117–124. IEEE Computer Society (2002). doi:10.1109/INFVIS.2002.117315 Fekete, J.-D., Plaisant, C.: Interactive information visualization of a million items. In: IEEE Symposium on Information Visualization, INFOVIS 2002, pp. 117–124. IEEE Computer Society (2002). doi:10.​1109/​INFVIS.​2002.​117315
22.
go back to reference Borgo, R., Kehrer, J., Chung, D.H.S., Laramee, R.S., Hauser, H., Ward, M., Chen, M.: Glyph-based visualization: Foundations, design guidelines, techniques and applications. In: Eurographics 2013-State of the Art Report, pp. 39–63. The Eurographics Association (2012) Borgo, R., Kehrer, J., Chung, D.H.S., Laramee, R.S., Hauser, H., Ward, M., Chen, M.: Glyph-based visualization: Foundations, design guidelines, techniques and applications. In: Eurographics 2013-State of the Art Report, pp. 39–63. The Eurographics Association (2012)
23.
24.
26.
go back to reference Helt, G.A., Nicol, J.W., Erwin, E., Blossom, E., Blanchard, S.G., Chervitz, S.A., Harmon, C., Loraine, A.E.: Genoviz Software Development Kit: Java tool kit for building genomics visualization applications. BMC Bioinform. 10, 266 (2009). doi:10.1186/1471-2105-10-266 CrossRef Helt, G.A., Nicol, J.W., Erwin, E., Blossom, E., Blanchard, S.G., Chervitz, S.A., Harmon, C., Loraine, A.E.: Genoviz Software Development Kit: Java tool kit for building genomics visualization applications. BMC Bioinform. 10, 266 (2009). doi:10.​1186/​1471-2105-10-266 CrossRef
27.
go back to reference Konwar, K.M., Hanson, N.W., Pagé, A.P., Hallam, S.J.: MetaPathways: A modular pipeline for constructing pathway/genome databases from environmental sequence information. BMC Bioinform. 14, 202 (2013). doi:10.1186/1471-2105-14-202 CrossRef Konwar, K.M., Hanson, N.W., Pagé, A.P., Hallam, S.J.: MetaPathways: A modular pipeline for constructing pathway/genome databases from environmental sequence information. BMC Bioinform. 14, 202 (2013). doi:10.​1186/​1471-2105-14-202 CrossRef
28.
29.
go back to reference Maguire, E., Rocca-Serra, P., Sansone, S.A., Davies, J., Chen, M.: Taxonomy-based glyph design – with a case study on visualizing workflows of biological experiments. IEEE Trans. Vis. Comput. Graph. 18(12), 2603–2612 (2012)CrossRef Maguire, E., Rocca-Serra, P., Sansone, S.A., Davies, J., Chen, M.: Taxonomy-based glyph design – with a case study on visualizing workflows of biological experiments. IEEE Trans. Vis. Comput. Graph. 18(12), 2603–2612 (2012)CrossRef
30.
go back to reference Maguire, E., Rocca-Serra, P., Sansone, S.A., Davies, J., Chen, M.: Visual compression of workflow visualizations with automated detection of macro motifs. IEEE Trans. Vis. Comput. Graph. 19(12), 2576–2585 (2013)CrossRef Maguire, E., Rocca-Serra, P., Sansone, S.A., Davies, J., Chen, M.: Visual compression of workflow visualizations with automated detection of macro motifs. IEEE Trans. Vis. Comput. Graph. 19(12), 2576–2585 (2013)CrossRef
Metadata
Title
State-of-the-Art and Future Challenges in the Integration of Biobank Catalogues
Authors
Heimo Müller
Robert Reihs
Kurt Zatloukal
Fleur Jeanquartier
Roxana Merino-Martinez
David van Enckevort
Morris A. Swertz
Andreas Holzinger
Copyright Year
2015
DOI
https://doi.org/10.1007/978-3-319-16226-3_11

Premium Partner