Skip to main content

2019 | OriginalPaper | Buchkapitel

Icing: Large-Scale Inference of Immunoglobulin Clonotypes

verfasst von : Federico Tomasi, Margherita Squillario, Alessandro Verri, Davide Bagnara, Annalisa Barla

Erschienen in: Computational Intelligence Methods for Bioinformatics and Biostatistics

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Immunoglobulin (IG) clonotype identification is a fundamental open question in modern immunology. An accurate description of the IG repertoire is crucial to understand the variety within the immune system of an individual, potentially shedding light on the pathogenetic process. Intrinsic IG heterogeneity makes clonotype inference an extremely challenging task, both from a computational and a biological point of view. Here we present icing, a framework that allows to reconstruct clonal families also in case of highly mutated sequences. icing has a modular structure, and it is designed to be used with large next generation sequencing (NGS) datasets, a technology which allows the characterisation of large-scale IG repertoires. We extensively validated the framework with clustering performance metrics on the results in a simulated case. icing is implemented in Python, and it is publicly available under FreeBSD licence at https://​github.​com/​slipguru/​icing.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
1
This is not representative of the amount of computational resources required by the method.
 
Literatur
1.
Zurück zum Zitat Alamyar, E., et al.: IMGT/HighV-QUEST: the IMGT® web portal for immunoglobulin (IG) or antibody and T cell receptor (TR) analysis from NGS high throughput and deep sequencing. Immunome Res. 8(1), 26 (2012) Alamyar, E., et al.: IMGT/HighV-QUEST: the IMGT® web portal for immunoglobulin (IG) or antibody and T cell receptor (TR) analysis from NGS high throughput and deep sequencing. Immunome Res. 8(1), 26 (2012)
2.
Zurück zum Zitat Ester, M., Kriegel, H.-P., Sander, J., Xu, X., et al.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Kdd, vol. 96, pp. 226–231 (1996) Ester, M., Kriegel, H.-P., Sander, J., Xu, X., et al.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Kdd, vol. 96, pp. 226–231 (1996)
3.
Zurück zum Zitat Fowlkes, E.B., Mallows, C.L.: A method for comparing two hierarchical clusterings. J. Am. Stat. Assoc. 78(383), 553–569 (1983)CrossRef Fowlkes, E.B., Mallows, C.L.: A method for comparing two hierarchical clusterings. J. Am. Stat. Assoc. 78(383), 553–569 (1983)CrossRef
4.
Zurück zum Zitat Glanville, J., et al.: Precise determination of the diversity of a combinatorial antibody library gives insight into the human immunoglobulin repertoire. Proc. Natl. Acad. Sci. 106(48), 20216–20221 (2009)CrossRef Glanville, J., et al.: Precise determination of the diversity of a combinatorial antibody library gives insight into the human immunoglobulin repertoire. Proc. Natl. Acad. Sci. 106(48), 20216–20221 (2009)CrossRef
5.
Zurück zum Zitat Gupta, N.T., et al.: Change-O: a toolkit for analyzing large-scale B cell immunoglobulin repertoire sequencing data. Bioinformatics 31(20), 3356–3358 (2015)CrossRef Gupta, N.T., et al.: Change-O: a toolkit for analyzing large-scale B cell immunoglobulin repertoire sequencing data. Bioinformatics 31(20), 3356–3358 (2015)CrossRef
6.
7.
Zurück zum Zitat Janeway C.A., Travers, P., Walport, M., Shlomchik, M.J.: Immunobiology: The Immune System in Health and Disease, vol. 1. Current Biology Singapore (1997) Janeway C.A., Travers, P., Walport, M., Shlomchik, M.J.: Immunobiology: The Immune System in Health and Disease, vol. 1. Current Biology Singapore (1997)
8.
Zurück zum Zitat Kleinstein, S.H., Louzoun, Y., Shlomchik, M.J.: Estimating hypermutation rates from clonal tree data. J. Immunol. 171(9), 4639–4649 (2003)CrossRef Kleinstein, S.H., Louzoun, Y., Shlomchik, M.J.: Estimating hypermutation rates from clonal tree data. J. Immunol. 171(9), 4639–4649 (2003)CrossRef
9.
Zurück zum Zitat Oprea, M.L.: Antibody repertoires and pathogen recognition: the role of germline diversity and somatic hypermutation. Ph.D. thesis, Citeseer (1999) Oprea, M.L.: Antibody repertoires and pathogen recognition: the role of germline diversity and somatic hypermutation. Ph.D. thesis, Citeseer (1999)
10.
Zurück zum Zitat Ralph, D.K., Matsen IV, F.A.: Consistency of VDJ rearrangement and substitution parameters enables accurate B cell receptor sequence annotation. PLoS Comput. Biol. 12(1), e1004409 (2016)CrossRef Ralph, D.K., Matsen IV, F.A.: Consistency of VDJ rearrangement and substitution parameters enables accurate B cell receptor sequence annotation. PLoS Comput. Biol. 12(1), e1004409 (2016)CrossRef
11.
Zurück zum Zitat Rock, E.P., et al.: CDR3 length in antigen-specific immune receptors. J. Exp. Med. 179(1), 323–328 (1994)CrossRef Rock, E.P., et al.: CDR3 length in antigen-specific immune receptors. J. Exp. Med. 179(1), 323–328 (1994)CrossRef
12.
Zurück zum Zitat Sculley, D.: Web-scale k-means clustering. In: Proceedings of the 19th International Conference on World Wide Web, pp. 1177–1178. ACM (2010) Sculley, D.: Web-scale k-means clustering. In: Proceedings of the 19th International Conference on World Wide Web, pp. 1177–1178. ACM (2010)
13.
Zurück zum Zitat Smith, T.F., Waterman, M.S.: Identification of common molecular subsequences. J. Mol. Biol. 147(1), 195–197 (1981)CrossRef Smith, T.F., Waterman, M.S.: Identification of common molecular subsequences. J. Mol. Biol. 147(1), 195–197 (1981)CrossRef
14.
Zurück zum Zitat Vander Heiden, J.A., et al.: pRESTO: a toolkit for processing high-throughput sequencing raw reads of lymphocyte receptor repertoires. Bioinformatics 30, 1930–1932 (2014)CrossRef Vander Heiden, J.A., et al.: pRESTO: a toolkit for processing high-throughput sequencing raw reads of lymphocyte receptor repertoires. Bioinformatics 30, 1930–1932 (2014)CrossRef
15.
Zurück zum Zitat Vinh, N.X., Epps, J., Bailey, J.: Information theoretic measures for clusterings comparison: is a correction for chance necessary? In: Proceedings of the 26th Annual International Conference on Machine Learning, pp. 1073–1080. ACM (2009) Vinh, N.X., Epps, J., Bailey, J.: Information theoretic measures for clusterings comparison: is a correction for chance necessary? In: Proceedings of the 26th Annual International Conference on Machine Learning, pp. 1073–1080. ACM (2009)
16.
Zurück zum Zitat Yaari, G., et al.: Models of somatic hypermutation targeting and substitution based on synonymous mutations from high-throughput immunoglobulin sequencing data. Front. Immunol. 4 (2013) Yaari, G., et al.: Models of somatic hypermutation targeting and substitution based on synonymous mutations from high-throughput immunoglobulin sequencing data. Front. Immunol. 4 (2013)
Metadaten
Titel
Icing: Large-Scale Inference of Immunoglobulin Clonotypes
verfasst von
Federico Tomasi
Margherita Squillario
Alessandro Verri
Davide Bagnara
Annalisa Barla
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-14160-8_5