Elsevier

Biotechnology Advances

Volume 34, Issue 8, December 2016, Pages 1413-1426
Biotechnology Advances

Research review paper
Metagenomics: Probing pollutant fate in natural and engineered ecosystems

https://doi.org/10.1016/j.biotechadv.2016.10.006Get rights and content

Abstract

Polluted environments are a reservoir of microbial species able to degrade or to convert pollutants to harmless compounds. The proper management of microbial resources requires a comprehensive characterization of their genetic pool to assess the fate of contaminants and increase the efficiency of bioremediation processes. Metagenomics offers appropriate tools to describe microbial communities in their whole complexity without lab-based cultivation of individual strains. After a decade of use of metagenomics to study microbiomes, the scientific community has made significant progress in this field. In this review, we survey the main steps of metagenomics applied to environments contaminated with organic compounds or heavy metals. We emphasize technical solutions proposed to overcome encountered obstacles. We then compare two metagenomic approaches, i.e. library-based targeted metagenomics and direct sequencing of metagenomes. In the former, environmental DNA is cloned inside a host, and then clones of interest are selected based on (i) their expression of biodegradative functions or (ii) sequence homology with probes and primers designed from relevant, already known sequences. The highest score for the discovery of novel genes and degradation pathways has been achieved so far by functional screening of large clone libraries. On the other hand, direct sequencing of metagenomes without a cloning step has been more often applied to polluted environments for characterization of the taxonomic and functional composition of microbial communities and their dynamics. In this case, the analysis has focused on 16S rRNA genes and marker genes of biodegradation. Advances in next generation sequencing and in bioinformatic analysis of sequencing data have opened up new opportunities for assessing the potential of biodegradation by microbes, but annotation of collected genes is still hampered by a limited number of available reference sequences in databases. Although metagenomics is still facing technical and computational challenges, our review of the recent literature highlights its value as an aid to efficiently monitor the clean-up of contaminated environments and develop successful strategies to mitigate the impact of pollutants on ecosystems.

Introduction

Pollutants such as polychlorinated biphenyls (PCBs), aromatic hydrocarbons, pesticides and heavy metals are continuously released in the environment through industrial activities, their associated disposal operations and accidental spills. Many of them are recalcitrant to natural degradation and are subject to accumulation especially in soils and sediments. Their hazardous and toxic effects on different living organisms have been demonstrated in many studies (e.g. (Armstrong et al., 2004, Schwarzenbach et al., 2006, Siegrist and Joss, 2012).

Bioremediation is defined as the use of living organisms (mostly plants or microbes) or enzymes to decontaminate sites polluted by hazardous substances. When a polluted site is hardly accessible, like aquifers, this ecofriendly and sustainable approach might be the only option available. To be successful, physicochemical and microbial characteristics should be taken into consideration before applying bioaugmentation (release of specific strains in the polluted environment), biostimulation (stimulation of indigenous microflora by amendement of the environment with limiting factors) (Fonti et al., 2015) or before relying on natural attenuation (recovery of the environment without any intervention) (Adetutu et al., 2015, Bento et al., 2005, Meckenstock et al., 2015). The indigenous microbial communities in polluted environments have high tolerance towards pollutants. They use pollutants as carbon or nitrogen source and/or energy source and degrade them into simpler intermediates up to complete mineralization, i.e. release of carbon dioxide, methane, ammonia and similar final products. In other cases indigenous bacteria cannot use pollutants as growth substrate but they can degrade them (but not mineralize them) by cometabolism. In this case, bacteria use another substrate to sustain their growth (for review see Nzila (2013)). Cometabolism of pollutants was mainly reported for PAHs with more than five aromatic rings, chlorinated biphenyls, chlorinated monoaromatics and chlorinated aliphatics (Nzila, 2013).

Bacterial degradation pathways of many pollutants which belong to the families of PCBs and aromatic hydrocarbons have been described in studies based on pure strains degrading single pollutants (Borja et al., 2005, Fischer et al., 2010, Haritash and Kaushik, 2009, Olsen et al., 1994). However, in contaminated environments such as old sites of petroleum refineries, gas stations and rivers near industrial manufacturing plants, pollution consists usually of a mixture of several compounds. Complete remediation might therefore require synergistic interactions between different degrading bacteria (Chang et al., 2006, Dastgheib et al., 2011, Ren et al., 2014, Tzintzun-Camacho et al., 2012), bacteria and fungi (Jacques et al., 2008) or bacteria and plants (Saiyood et al., 2010). In many reports, authors describe the isolation of biodegradative consortia rather than single species especially when the pollutant(s) belong to hardly degradable compounds such as heavy PAHs, highly chlorinated biphenyls and pesticides (Boonchan et al., 2000, Dastgheib et al., 2011, Ellegaard-Jensen et al., 2014, Pino and Peñuela, 2011, Rowe et al., 2008). Moreover, the majority of environmental bacteria are recalcitrant to cultivation in the laboratory and the mystery of their biology is far from solved (Stewart, 2012). Hence, metagenomics is currently one of the most advanced methods to discover and describe inaccessible environmental microbes in their whole complexity, and to provide a comprehensive overview of the biodegradation potential of microbial communities in polluted environments (Bell et al., 2014, George et al., 2011, Gillan et al., 2015).

Metagenomics is defined as the study of environmental microbial communities using a suite of genomic tools to directly access their genetic content (Thomas et al., 2013), i.e. without prior cultivation of microbes in the laboratory. In metagenomics, the first step consists in extracting total DNA from the environment, which is often challenging in polluted soils and sediments. Indeed, high concentration of pollutants (e.g., metals, aromatic hydrocarbons) and low cell density are the main factors that hamper successful DNA recovery. Metagenomic analysis of recovered DNA is based on (i) genetic and/or functional screening of cloned DNA (referred to as a metagenomic library) or on (ii) large-scale sequencing of environmental metagenomes without pre-cloning (referred to as “direct sequencing” in this paper, or “shotgun metagenomics” by other authors). The term “metagenomics” is also liberally applied in the literature to a third approach: the thorough analysis of the diversity of specific genes – primarily marker genes such as the 16S rRNA gene (Wang and Qian, 2009) or conserved single-copy genes (Lang et al., 2013) – through PCR amplification and high throughput sequencing of the amplicons. In all cases, metagenomic DNA is then sequenced by one of the currently available high throughput sequencing platforms (e.g., 454/Roche, Illumina, SOLiD). Lastly, the huge amount of data (scale of Gbp) generated by sequencing projects is analyzed using a panoply of bioinformatic tools to predict the microbial diversity and/or the functional potential of the studied environment.

This review paper discusses the latest methodological advances in metagenomics applied to contaminated environments, and highlights how this approach has allowed the recent discovery of new biodegradation genes/pathways and microbial adaptation and synergistic interactions for pollutant degradation.

Section snippets

Conducting metagenomic studies in polluted environments: an updated overview of existing solutions to overcome inherent difficulties

Fig. 1 illustrates the main steps of a metagenomic project design for the comprehensive study of pollutant degradation in the environment.

Metagenomics reveals new biodegradation genes and pathways

Metagenomic investigations of polluted environments have led to the discovery of an increasing number of new catabolic enzyme-coding genes from the genetic pool of uncultured bacteria. In particular, functional screening of metagenomes from polluted ecosystems has expanded gene families (see Section 3.1) and highlighted the existence of unsuspected organizations of degradation pathways in microbial communities (see Section 3.2). Moreover, heterologous expression of desired traits and direct

Metagenomics reveals microbial response to disturbance

The comprehensive characterization of taxonomic microbial community composition is now a routine analysis in many labs thanks to the application of NGS technology and user-friendly bioinformatic analysis pipelines (MOTHUR, QIIME). Therefore, microbial diversity assessment is often the first step in any microbial ecology investigation, including in polluted environments. However, 16S profiling of microbial communities does not provide direct evidence of their functional properties. Therefore,

Metagenomics reveals the role of microorganisms in pollution abatement: case studies

The number of metagenomic projects based on massive sequencing of DNA from polluted environments is on the rise (see for example on-going projects at the Joint Genome Institute (http://img.jgi.doe.gov/)). Some of them address specifically the question of metabolic interactions within consortia used in bioremediation. Table 1 gives a quick overview of recent metagenomic projects that have brought to light the involvement of microbes in pollutant removal.

Conclusions and perspectives

Over the last 10 years, the emergence of metagenomics has increased considerably the accuracy with which we understand microbial degradation of pollutants in a wide range of environments. It gives direct access to the total degradative potential of microbial communities and provides insights about the structure of such communities (taxonomic composition), their function (catabolic properties), and their interactions independently of their cultivability. Metagenomic analyses are becoming more and

Acknowledgments

This work was supported by METAEXPLORE, funded by the European Union (grant agreement no. 222625, FP7 Programme). We thank anonymous reviewers for numerous suggestions to improve this manuscript.

References (177)

  • V. Fonti et al.

    Bioremediation of contaminated marine sediments can enhance metal mobility due to changes of bacterial diversity

    Water Res.

    (2015)
  • I.F. George et al.

    Metagenomics for bioremediation

  • D.T. Gibson et al.

    Aromatic hydrocarbon dioxygenases in environmental biotechnology

    Curr. Opin. Biotechnol.

    (2000)
  • L.M. Gieg et al.

    Syntrophic biodegradation of hydrocarbon contaminants

    Curr. Opin. Biotechnol.

    (2014)
  • A.K. Haritash et al.

    Biodegradation aspects of polycyclic aromatic hydrocarbons (PAHs): a review

    J. Hazard. Mater.

    (2009)
  • R.J.S. Jacques et al.

    Microbial consortium bioaugmentation of a polycyclic aromatic hydrocarbons contaminated soil

    Bioresour. Technol.

    (2008)
  • S. Jacquiod et al.

    Characterization of new bacterial catabolic genes and mobile genetic elements by high throughput genetic screening of a soil metagenomic library

    J. Biotechnol.

    (2014)
  • K.S. Kakirde et al.

    Size does matter: application-driven approaches for soil metagenomics

    Soil Biol. Biochem.

    (2010)
  • A. Kouzuma et al.

    Molecular approaches for the analysis of natural attenuation and bioremediation

  • M. Kube et al.

    A catabolic gene cluster for anaerobic benzoate degradation in methanotrophic microbial Black Sea mats

    Syst. Appl. Microbiol.

    (2005)
  • K. Longnecker et al.

    Environmental metabolomics: databases and tools for data analysis

    Mar. Chem.

    (2015)
  • C.B. Abulencia et al.

    Environmental whole-genome amplification to access microbial populations in contaminated sediments

    Appl. Environ. Microbiol.

    (2006)
  • Afiahayati et al.

    MetaVelvet-SL: an extension of the Velvet assembler to a de novo metagenomic assembler utilizing supervised learning

    DNA Res.

    (2015)
  • M. Albertsen et al.

    Genome sequences of rare, uncultured bacteria obtained by differential coverage binning of multiple metagenomes

    Nat. Biotechnol.

    (2013)
  • J. Alneberg et al.

    Binning metagenomic contigs by coverage and composition

    Nat. Methods

    (2014)
  • B. Armstrong et al.

    Lung cancer risk after exposure to polycyclic aromatic hydrocarbons: a review and meta-analysis

    Environ. Health Perspect.

    (2004)
  • R.K. Aziz et al.

    SEED servers: high-performance access to the SEED genomes, annotations, and metabolic models

    PLoS One

    (2012)
  • S. Bag et al.

    An improved method for high quality metagenomics DNA extraction from human and environmental samples

    Sci. Rep.

    (2016)
  • R. Bargiela et al.

    Bacterial population and biodegradation potential in chronically crude oil-contaminated marine sediments are strongly linked to temperature

    Sci. Rep.

    (2015)
  • M. Batıoğlu-Pazarbaşı et al.

    Centimetre-scale vertical variability of phenoxy acid herbicide mineralization potential in aquifer sediment relates to the abundance of tfdA genes

    FEMS Microbiol. Ecol.

    (2012)
  • J.M. Becker et al.

    Bacterial activity, community structure, and centimeter-scale spatial heterogeneity in contaminated soil

    Microb. Ecol.

    (2006)
  • T. Bell et al.

    Metagenomics potential for bioremediation

  • E. Boon et al.

    Interactions in the microbiome: communities of organisms and communities of genes

    FEMS Microbiol. Rev.

    (2014)
  • S. Boonchan et al.

    Degradation and mineralization of high-molecular-weight polycyclic aromatic hydrocarbons by defined fungal-bacterial cocultures

    Appl. Environ. Microbiol.

    (2000)
  • N. Bougon et al.

    Influence of depth and time on diversity of free-living microbial community in the variably saturated zone of a granitic aquifer

    FEMS Microbiol. Ecol.

    (2012)
  • A. Brady et al.

    Phymm and PhymmBL: metagenomic phylogenetic classification with interpolated Markov models

    Nat. Methods

    (2009)
  • V.L. Brisson et al.

    Metagenomic analysis of a stable trichloroethene-degrading microbial community

    ISME J.

    (2012)
  • J.P. Brooks et al.

    The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies

    BMC Microbiol.

    (2015)
  • J.G. Caporaso et al.

    QIIME allows analysis of high-throughput community sequencing data

    Nat. Methods

    (2010)
  • W. Chang et al.

    Polycyclic aromatic hydrocarbon (PAH) degradation coupled to methanogenesis

    Biotechnol. Lett.

    (2006)
  • A. Chemerys et al.

    Characterization of novel polycyclic aromatic hydrocarbon dioxygenases from the bacterial metagenomic DNA of a contaminated soil

    Appl. Environ. Microbiol.

    (2014)
  • J. Chen et al.

    Heavy metal pollution decreases microbial abundance, diversity and activity within particle-size fractions of a paddy soil

    FEMS Microbiol. Ecol.

    (2014)
  • J.R. Cole et al.

    Ribosomal Database Project: data and tools for high throughput rRNA analysis

    Nucleic Acids Res.

    (2014)
  • P.S. Costa et al.

    Metagenome of a microbial community inhabiting a metal-rich tropical stream sediment

    PLoS One

    (2015)
  • J.W. Craig et al.

    Expanding small-molecule functional metagenomics through parallel screening of broad-host-range cosmid environmental DNA libraries in diverse proteobacteria

    Appl. Environ. Microbiol.

    (2010)
  • S. Dastgheib et al.

    Biodegradation of polycyclic aromatic hydrocarbons by a halophilic microbial consortium

    Appl. Microbiol. Biotechnol.

    (2011)
  • C. De Filippo et al.

    Bioinformatic approaches for functional annotation and pathway inference in metagenomics data

    Brief. Bioinform.

    (2012)
  • S.P. de Vasconcellos et al.

    Screening for hydrocarbon biodegraders in a metagenomic clone library derived from Brazilian petroleum reservoirs

    Organ Geochem.

    (2010)
  • A. Dechesne et al.

    Fine scale spatial variability of microbial pesticide degradation in soil: scales, controlling factors, and implications

    Front. Microbiol.

    (2014)
  • T.O. Delmont et al.

    Accessing the soil metagenome for studies of microbial diversity

    Appl. Environ. Microbiol.

    (2011)
  • Cited by (34)

    • Microbial metabolism of aromatic pollutants: High-throughput OMICS and metabolic engineering for efficient bioremediation

      2022, Current Developments in Biotechnology and Bioengineering: Designer Microbial Cell Factories: Metabolic Engineering and Applications
    • From exploration to remediation: A microbial perspective for innovation in mining

      2021, Earth-Science Reviews
      Citation Excerpt :

      Typically, metagenomics takes advantage of two distinct sequencing techniques: direct (shotgun) sequencing and indirect (library-based) metagenomics. These techniques have been extensively reviewed elsewhere, along with the different sequencing technologies and associated costs, which are increasingly affordable (Bouhajja et al., 2016; Kakirde et al., 2010). Briefly, for direct sequencing, DNA is essentially randomly fragmented before being sequenced and does not include a pre-cloning stage of genetic material, therefore, removes this as a basis.

    • Metagenomic analysis of soil microbial community under PFOA and PFOS stress

      2020, Environmental Research
      Citation Excerpt :

      Microbial functions and metabolism pathways are subjected to contribute to it. In general, Metagenomics approach can reveal the microbial community's response to disturbance, which means that it can provide diversification on the structure of microbial communities, their function and the interaction when occurring to external interference without cultivation of microbes, and currently microbial communities in contaminated environment would have less metabolic and phylogenetic diversity (Bouhajja et al., 2016). These methods have been applied for a number of metals and POPs pollution on the microbial community.

    • Metagenomic analysis exploring microbial assemblages and functional genes potentially involved in di (2-ethylhexyl) phthalate degradation in soil

      2020, Science of the Total Environment
      Citation Excerpt :

      Thus, to gain an overview of microbial processes involved in DEHP degradation in soil, soils of different properties and/or under different conditions should be investigated using a more comprehensive approach. Thanks to the rapid development of molecular “-omics” techniques, such as metagenomics and metatranscriptomics which target total environmental DNA and RNA, respectively, it is now possible to investigate a broad range of genes/enzymes putatively involved in pollutant degradation pathways in complex environmental matrices and link degradation potential with microbial taxa through taxonomic analysis of functional genes/enzymes (Bouhajja et al., 2016). This approach has been successfully applied to investigate in situ polycyclic aromatic hydrocarbons (PAHs)-degrading microorganisms in soil (de Menezes et al., 2012; Zafra et al., 2016).

    • Genomics in understanding bioremediation of inorganic pollutants

      2020, Handbook of Bioremediation: Physiological, Molecular and Biotechnological Interventions
    View all citing articles on Scopus
    View full text