Top

Hydrogeology Journal

Published in:

Open Access 09-02-2022 | Paper

Revealing vertical aquifer heterogeneity and hydraulic anisotropy by pumping partially penetrating wells

Authors: Ruth Maier, Carsten Leven, Emilio Sánchez-León, Daniel Strasser, Maximilian Stoll, Olaf A. Cirpka

Published in: Hydrogeology Journal | Issue 2/2022

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Patentsearch

Off

Abstract

The stratification of sedimentary aquifers introduces spatial variability in hydraulic conductivity, primarily between individual horizontal layers. On larger scales, the vertical heterogeneity enhances hydraulic anisotropy, with the horizontal conductivity typically exceeding the vertical one. In this study, the hydraulic anisotropy of a stratified aquifer is estimated from data of hydraulic tests in which water is sequentially extracted from well sections screened at different depths, and the hydraulic response is measured at various multilevel observation wells. The applicability of the method is demonstrated by field tests in a fluvial gravel aquifer in the Upper Rhine Valley, Germany. A homogeneous anisotropic model, and models with three and five anisotropic layers, are fitted to the measured drawdowns in the steady-shape regime, in which differences in hydraulic head between observation locations do not change over time even though the head values themselves change. The position of the five horizontal layers is based on the lithology of the drilling profile at the pumping-well location. The three-layer model is achieved by merging insensitive or similar layers with sensitive layers. The fits result in estimates of the radial and vertical hydraulic conductivities for all layers of the respective models, which are upscaled to effective parameters over the entire depth in the case of the multilayer models. The homogeneous model shows significantly higher errors than those of the heterogeneous models. The heterogeneous locally anisotropic models not only reveal vertical variability of hydraulic conductivity, but also lead to a three-times larger anisotropy ratio upon upscaling.

ESM 1 (PDF 2121 kb)

Supplementary Information

The online version contains supplementary material available at https://doi.org/10.1007/s10040-022-02458-9.

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Introduction

Resolving the main properties of the spatially variable hydraulic conductivity tensor K is a key issue in groundwater modeling. When setting up a groundwater flow model, a model needs to be selected that considers appropriate initial and boundary conditions but also constitutes a suitable representation of the main geological features of the investigated aquifer. In fluvial gravel aquifers, this implies considering bedded subsurface features defined by the deposition of sediments of different size, geometry and sorting (Borghi et al. 2015, Bennett et al. 2019). Since the stratification of sediments affects the spatial distribution of hydraulic properties (Koltermann and Gorelick 1996; Heinz and Aigner 2003; Heinz et al. 2003), a major concern when conceptualizing the conductivity distribution in a groundwater model of a fluvial aquifer is to identify and characterize an appropriate number of layers with different properties. The differences between individual strata also cause the formation-averaged hydraulic conductivity tensor K to be anisotropic because groundwater preferably flows in the direction of the layers rather than perpendicular to it (Bear 1972; Borghi et al. 2015). This implies that the principal directions of the effective hydraulic conductivity tensor K^eff are typically the horizontal and vertical directions which are aligned with the strata. In this study, the ratio of horizontal to vertical conductivity, when averaging over the horizontal layers, is denoted the anisotropy ratio ϑ. The hydraulic anisotropy is of importance whenever the vertical flow component is significant, for instance in flow close to partially penetrating wells, horizontal collector wells, or around objects that partially penetrate aquifers, or when considering river–groundwater exchange. While regional flow is predominantly horizontal, these specific boundary conditions induce a vertical-flow component that can be crucial in the overall design of groundwater management measures.

Many hydrogeological applications such as the design of remediation systems, depend not only on precise information on subsurface heterogeneity (e.g. Cardiff and Barrash 2011; Zschornack et al. 2013), but also require information on hydraulic anisotropy (e.g. Bair and Lahm 1996; Zlotnik and Ledder 1996). A specific example in which hydraulic anisotropy is relevant is the delineation of capture zones of partially penetrating wells (Bair and Lahm 1996). Several experimental methods have been developed for resolving the spatial variability of hydraulic conductivity at different scales and degrees of resolution. Hydraulic tomography, for example, is a common method that helps to obtain larger-scale (>10 m) three-dimensional (3D) information on hydraulic-conductivity variations (Gottlieb and Dietrich 1995; Yeh and Liu 2000; Bohling 2009; Hochstetler et al. 2016; Sanchez-Leon et al. 2016). Direct-push methods such as direct-push injection logging (Bohling et al. 2002; Butler et al. 2007; Dietrich et al. 2008; Lessoff et al. 2010) or the direct-push permeameter (Butler et al. 2007; Chen et al. 2008, 2010; Klammler et al. 2011; Zschornack et al. 2013) resolve apparent horizontal hydraulic conductivity with depth and are thus well suited to investigate local hydraulic-conductivity variations in the vertical direction at high resolution.

To resolve the ratio of horizontal to vertical hydraulic conductivity, different field methods have been investigated. Klammler et al. (2017) proposed a shape factor to estimate the bulk hydraulic anisotropy from measurements obtained with the direct-push permeameter but without considering the vertical variability of hydraulic conductivity. The tomographic slug test proposed and tested by Paradis et al. (2015, 2016) in a littoral aquifer seems to be more appropriate for resolving hydraulic anisotropy induced by heterogeneities at smaller scales, also in the horizontal direction, e.g., from cross-bedding. A specific limitation of the tomographic slug test, however, is the very small range of investigation in the horizontal direction (typically <10 m), especially in highly permeable aquifers (Paradis et al. 2015). Even though different studies have dealt with the investigation of anisotropic conductivity, a suitable field method for estimating the ratio of horizontal to vertical hydraulic conductivity on larger scales in fluvial gravel aquifers has not yet been tested. The aim of this study is to examine the viability and benefits of a method for resolving hydraulic anisotropy on larger scales induced by the vertical heterogeneity on smaller scales.

The present study builds upon a method introduced by Maier et al. (2020) to estimate hydraulic anisotropy by inverting steady-shape aquifer tests using a partially penetrating pumping well. The approach follows the basic principles of hydraulic tomography. That is, a series of pumping tests is performed, in which groundwater from different intervals of a single pumping well are sequentially extracted, and the hydraulic response is observed in surrounding observation wells, placed at different distances and depths. In contrast to the steady-state pumping regime, the absolute drawdowns are still changing in the steady-shape pumping regime, but the hydraulic-head differences between observation locations remain constant (Bohling et al. 2002, Bohling et al. 2007).

In this work, the method described by Maier et al. (2020) is modified and applied to a fluvial gravel aquifer located in the Upper Rhine Valley at the Germany-France border. Specifically, the question of how a homogeneous anisotropic groundwater flow model performs in comparison to models with several anisotropic layers is addressed, as well as how these layers should be defined.

While Maier et al. (2020) described the application of steady-shape aquifer tests to jointly optimize modeling and measurement strategies with a synthetic scenario, the present study considers a field application, including the design of the experiments.

This paper starts with a brief repetition of the underlying theory, followed by a description of the field application. Then the numerical models are described, and the principles used in model calibration are outlined. After presenting the site-specific results, the paper finishes with discussing the main findings and giving general recommendations.

Field application

Field site

Hydrogeological setting

The field site is located in the Upper Rhine Valley, north of the municipality Kappel-Grafenhausen, Baden-Württemberg, southwest Germany (Fig. 1). The study site is surrounded by an artificial flow channel in the west and by River Elz in the east. The general groundwater flow direction is from the southeast to the northwest with a hydraulic gradient of 0.14%. The fluvial unconsolidated aquifer consists of Quaternary sediments of the Neuenburg-Formation (qN), and is characterized by fluviatile gravel deposits with varying amounts of sand and small amounts of silt (LGRB 2004; Wirsing and Luz 2007). The sequence of strata varies locally. The saturated aquifer of ~41 m thickness is overlain by a ~2-m-thick layer of alluvial fines and bounded by considerably less conductive Quaternary sediments from the Breisgau-Formation (qBS) at the aquifer base (LGRB 2004; Wirsing and Luz 2007). Results from a grain size analysis of an exploration drilling at the field site indicate hydraulic-conductivity values of the unconfined aquifer in the range of 6.7 × 10⁻⁵ to 2.6 × 10⁻¹ m/s.

Monitoring network

Figure 2 shows a schematic representation of the field installation at the test site. The large-diameter pumping well (denoted R01, well-screen radius r_w = 0.4 m) reaches to a depth of ~21 m within the aquifer. The well was designed with three separate screen sections (I, II and III) of 2-m length each and with a spacing of 4.5 m in between, centered at elevations of 37.32 m (upper screen), 30.82 m (middle screen) and 24.32 m (lower screen) from the aquifer base. Note that all z-coordinates are given in reference to the aquifer bottom. The well was installed with a prepacked filter along the filter-screen sections and completed with coarse gravel to a 0.4-m-thick filter pack, extending by 0.5 m above and below the individual screen sections. Each filter-pack section is connected above and below to a clay fill through a 0.5-m-thick secondary filter layer.

Three bundles of nested observation wells are placed each in the north, east, south, and west direction of the pumping well R01, at radial distances of 3.5, 6.5 and 10.5 m, respectively (Fig. 2). Two additional bundles of nested observation wells are placed at a radial distance of 21 m, one to the east and one to the west of well R01.

Each bundle of nested observation wells positioned north, south and west of R01 comprises three 1-in (2.54-cm) piezometers, each having a 0.3-m-long screen at its bottom. Two of the three piezometers are placed at depths between the elevation of the pumping-well screens I and II, and the third is at a depth between the elevation of the pumping-well screens II and III (Fig. 2). All observation wells to the east of R01 are continuous multichannel tubing wells (CMT wells) with seven individual channels with a diameter of ~10 mm. Their depths are above, at, and below the elevation of the pumping-well screens I, II and III. Starting from the top to the bottom, the screen openings are enumerated from 1 to 7 (Fig. 2). While screen openings 1 to 6 are 0.4 m long, the lowermost screen opening 7 has a length of 0.15 m. In total, the monitoring network consists of 58 observation points. Table S1 in the electronic supplementary material (ESM) summarizes details of the monitoring network.

Hydraulic tests

In three series of short-term pumping tests, water was successively extracted from the screen sections I, II, and III of pumping well R01 with a frequency-controlled submersible pump. To prevent water inflow from the adjacent screen sections, a customized straddle-packer system was introduced into the well and placed above and below the active screen section.

For each pumping depth, multiple pumping tests p_t with different pumping rates Q(p_t) were performed. All experiments belonging to the same extraction interval constitute a specific hydraulic test y. Details on the number of pumping tests belonging to the individual hydraulic tests and the applied pumping rates are listed in Table 1.

Table 1

Key data of the pumping test series

Parameter	I Upper screen	II Middle screen	III Bottom screen
Hydraulic test, y	1	2	3
Number of pumping tests, p_t	5	10	7
Reduced number of pumping tests, p_r	4	7	4
Time to reach steady-shape behavior, t_trim [s]	1,800	1,680	1,700
Range of Q(p_t) [L/s]	10.0–10.5	5.5–10.9	17.8–19.0

The transient drawdown response of each pumping test was automatically measured in 44 observation points using fiber-optic pressure transducers and different types of piezoresistive data loggers of similar resolution, whereas the drawdown at the remaining 14 observation points were manually measured. Due to technical problems with one data logger the total number of active observation points in each hydraulic test was reduced to 57.

In order to monitor the stability of the pumping rate, flowmeter readings were regularly taken and the drawdown within the pumping well was observed using a pressure transducer. The latter in-well drawdown measurements are not considered in the model calibration because these measurements are affected by pump-induced pressure variations and well-skin effects.

Data processing

The pumping tests resulted in a total of 1,334 drawdown curves, from which defective datasets caused by strong instabilities of pressure transducers or the pumping rate were eliminated. Section S2 of the ESM shows an exemplary dataset to illustrate the criterion for eliminating datasets and the state on the assessment of pumping rate stability. In addition, each remaining curve was corrected to consider barometric pressure variations. Considering that drawdown is linearly proportional to the pumping rate, the drawdown curves from pumping tests with different extraction rates were scaled to those of a harmonized rate Q_h of 0.01 m³/s. The scaling is based on the mean discharge, observed during the pumping phase up to timepoint t_trim (Table 1). These scaled curves were used to compare the multiple hydraulic responses observed at each observation point and for each hydraulic test, and to check the data reproducibility. Pumping tests in which the drawdown curves predominantly failed the reproducibility test were discarded, reducing the total number of pumping tests from p_t to p_r of each hydraulic test (Table 1).

To avoid the uncertainty associated with the transient behavior of the hydraulic responses, and the intensive computational requirements of simulating transient groundwater flow, the drawdown curves were analyzed in the steady-shape regime, in which the absolute value of drawdown still changes but the hydraulic-head differences between measurement locations remains constant (Bohling et al. 2002, 2007). In the reduced set of pumping tests p_r, the steady-shape regime was defined by identifying the time (t_trim) at which changes in drawdown differences between observation locations could be neglected (see Table 1 and section S3 in the ESM). Then the drawdown s_obs at timepoint t_trim were averaged among all pumping tests p_r available in hydraulic test y. By that, a single averaged drawdown measurement s_meas for each observation point k in each hydraulic test y was obtained:

$$ {s}_{\mathrm{meas}}\left(y,k\right)=\frac{1}{p_{\mathrm{r}}}\sum \limits_{i=1}^{p_{\mathrm{r}}}{s}_{\mathrm{obs},i}\left(y,k\right) $$

(1)

together with an associated standard deviation σ_repr as a metric of the reproducibility of each measurement:

$$ {\sigma}_{\mathrm{r}\mathrm{epr}}\left(y,k\right)=\sqrt{\frac{1}{p_{\mathrm{r}}-1}\sum \limits_{i=1}^{p_{\mathrm{r}}}{\left({s}_{\mathrm{obs},i}\left(y,k\right)-{s}_{\mathrm{meas}}\left(y,k\right)\right)}^2} $$

(2)

σ_repr denotes the reproducibility error, which contributes to the overall error of the measurements but does not include systematic errors in the data (e.g., due to the misplacement of observation points) or in the conceptual model (e.g., due to disregarding horizontal heterogeneity). To quantify the variability among the measurement points, the observation points that are arranged in different directions to the pumping well, but coincide by ±0.6 and ±0.5 m in their r- and z-coordinates, respectively, were clustered and the associated drawdown measurements s_meas compared. These cluster ranges are realistic, since intended measurement locations of observation points may be misplaced in the installation of observation wells (Maier et al. 2020). The mean value $ {\mu}_{c_{\mathrm{p}}} $ and standard deviation $ {\sigma}_{c_{\mathrm{p}}} $ of all measurements n_cp available in each cluster c_p are computed by:

$$ {\mu}_{c_{\mathrm{p}}}(y)=\frac{1}{n_{\mathrm{cp}}\left({c}_{\mathrm{p}}\right)}\sum \limits_{i=1}^{n_{\mathrm{cp}}\left({c}_{\mathrm{p}}\right)}{s}_{\mathrm{meas},i}\left(y,{c}_{\mathrm{p}}\right) $$

(3)

$$ {\sigma}_{c_{\mathrm{p}}}(y)=\sqrt{\frac{1}{n_{\mathrm{cp}}\left({c}_{\mathrm{p}}\right)-1}\sum \limits_{i=1}^{n_{\mathrm{cp}}\left({c}_{\mathrm{p}}\right)}{\left({s}_{\mathrm{meas},i}\left(y,{c}_{\mathrm{p}}\right)-{\mu}_{c_{\mathrm{p}}}(y)\right)}^2} $$

(4)

Note that the data clustering was performed only to illustrate the data in a comprehensive way and to estimate the variability of drawdown measurements among different but close-by observation points. The model fit utilizes the nonclustered data. The error model applied in the fitting procedure is discussed in the following.

Model setups

Governing equations

The starting point is the radial-symmetric groundwater-flow equation, describing the drawdown induced by water extraction with a partially penetrating well:

$$ \frac{1}{r}\frac{\partial }{\partial r}\left({K}_{\mathrm{r}}r\frac{\partial s}{\partial r}\right)+\frac{\partial }{\partial z}\left({K}_{\mathrm{z}}\frac{\partial s}{\partial z}\right)={S}_0\frac{\partial s}{\partial t} $$

(5)

in which r [L] and z [L] are the radial and vertical coordinates, K_r [LT⁻¹] and K_z [LT⁻¹] are the radial and vertical hydraulic conductivities, s [L] is the drawdown, i.e., the change in hydraulic head induced upon pumping, t [T] denotes time, and S₀ [L⁻¹] is the specific storage. In this study, K_r(z) and K_z(z) are assumed to vary only in the vertical direction. At the outer radius R [L] of the model domain (much further away from the pumping well than any observation well) zero drawdown is assumed, and there are no-flow boundaries at the top z_top [L] and bottom z_bot[L]:

$$ s\left(R,z\right)=0\forall z $$

(6)

$$ {\left.\frac{\partial s}{\partial z}\right|}_{z={z}_{\mathrm{bot}}}={\left.\frac{\partial s}{\partial z}\right|}_{z={z}_{\mathrm{top}}}=0\forall r $$

(7)

The considered extraction well has three separated well screens to extract groundwater from three different and isolated depth intervals. The pumping well has the radius r_w [L], and each well screen i_scr has the same length b [L], but is centered at a different vertical position z_c [L]. Along the well screen considered for pumping, the hydraulic head is constant, and the total extraction flux Q [L³T⁻¹] for the well screen is constant during the hydraulic test. The model assumes no-flow boundary conditions at all depths below and above the respective active screen section, so that at r = r_w the following boundary conditions hold:

$$ {\displaystyle \begin{array}{cc}\kern-0.32em \left.\begin{array}{c}s=\mathrm{constant},\mathrm{and}\\ {}-{\int}_{z_{\mathrm{c}}\left({i}_{\mathrm{scr}}\right)-\frac{b}{2}}^{z_{\mathrm{c}}\left({i}_{\mathrm{scr}}\right)+\frac{b}{2}}2\pi r{K}_{\mathrm{r}}\frac{\partial s}{\partial r} dz=\mathrm{Q}\end{array}\right\}& \mathrm{if}\kern0.5em {z}_{\mathrm{c}}\left({i}_{\mathrm{scr}}\right)-\frac{b}{2}\le z\le {z}_{\mathrm{c}}\left({i}_{\mathrm{scr}}\right)+\frac{b}{2}\\ {}\frac{\partial s}{\partial r}=0& \mathrm{otherwise}\end{array}} $$

(8)

A transient model would require initial conditions and a value of the specific storage, but the constant-shape regime of the pumping tests, in which the spatial profile of drawdown in the domain of interest changes only by a time-dependent constant in space, does not require this information. In this regime, drawdown differences between observation points can be simulated by the steady-state drawdown equation, that is, Eq. 5 with a right-hand side of zero.

The hydraulic tests were simulated with a finite-element model implemented in MATLAB (codes are available through the data portal at University of Tübingen 2022) which solves the axisymmetric steady-state groundwater flow equation on rectangular elements. The radial grid spacing increases logarithmically, whereas the vertical resolution is uniform with a grid spacing of 0.17 m. To obtain simulation results at observation points that did not fall onto nodes of the grid, bilinear interpolation consistent with the finite element formulation was applied.

Conceptual models

In total, three different groundwater-flow models were set up. The first model considers a single, homogeneous layer (1-layer model), whereas the second one contains five horizontal layers (5-layer model). In the third model, the number of layers was set to three (3-layer model). All models account for hydraulic anisotropy in each layer, that is, each layer has an individual radial and vertical hydraulic conductivity K_r and K_z, respectively. In all models the aquifer is treated as confined, which is a qualified assumption due to the short test durations and the small drawdowns in comparison to the total thickness of the aquifer.

Figure 3 shows the basic model setup. All models contain two separate isotropic units that represent the gravel pack (red units in Fig. 3, hydraulic conductivity K_gp = 10⁻²m/s) and clay fill (gray units in Fig. 3, hydraulic conductivity, K_cl = 10⁻⁷m/s) of the filter pack installed around the pumping well. Fitting the conductivity values of these units was tested, but the data were not sensitive to K_gp and K_cl, so reasonable fixed values were chosen.

The black-dashed horizontal lines in Fig. 3 indicate the layer boundaries considered in the five-layer model. The actual location of the layer boundaries in the 5-layer model is based on the existing field description of the drill core taken at the location of pumping well R01 (Figs. 3 and 4a). Sand layers were delineated when the sand fraction clearly exceeded that of the gravel over more than half a meter in thickness (Fig. 4b). The layers are numbered from the aquifer top to the bottom. In the 3-layer model, layer 2 was merged to layer 3, while layer 4 was merged to layer 5 of the 5-layer model. The reasoning for these choices is discussed in the following section.

Model calibration

All three models were independently calibrated by the Trust-Region Reflective Least-Squares method of the function lsqnonlin in the optimization toolbox of MATLAB (Coleman and Li 1996). To reduce the large data volume in model calibration, the averaged drawdown measurements s_meas from all three hydraulic tests were jointly considered in the calibration, leading to n_meas = 3 × 57 = 171 drawdown observations.

As mentioned before, a steady-shape pumping regime was considered in the simulations, in which drawdown differences between observation locations remain constant. Typically, this requires the specification of pairs of observation points by either setting one observation location as the superordinate reference point (Maier et al. 2020) or by considering all feasible pairs of observation points (Bohling et al. 2002). Each field measurement, however, is subject to measurement errors of different types, including measurement noise or the misplacement of observation wells (Maier et al. 2020). In trials not reported here, the effect of considering different observation points as the reference point had been tested, yielding different model-calibration results due to measurement error. To avoid the propagation of uncertainties in the generation of pairs of observation points, the model calibration includes a virtual reference point. That is, for each hydraulic test, the simulated drawdown difference s_sim = |s_t − s_ref| contained the simulated steady-state drawdown s_t and the drawdown at a virtual reference point s_ref, which is identical among all measurement points but needs to be estimated together with the hydraulic-conductivity values.

Then the differences between the simulated and measured drawdowns s_sim and s_meas were computed and normalized by the error σ_i of each measurement i, which is defined by an error model discussed below.

In the calibration, the objective function φ to be minimized is defined as the sum of squared normalized residuals:

$$ \varphi =\sum \limits_{i=1}^{n_{\mathrm{meas}}}{\left(\frac{s_{\mathrm{sim},i}\left(\mathbf{p}\right)-{s}_{\mathrm{meas},i}}{\sigma_i}\right)}^2 $$

(9)

in which p is the parameter vector including the logarithms of K_r and K_z of all horizontal layers considered and the reference drawdown s_ref for each of the three hydraulic tests. Thus, in total, the 1-, 3- and 5-layer models include n_par = 3, n_par = 7, and n_par = 11 calibration parameters, respectively.

The error model accounts for the combined effects of the reproducibility error, a potential measurement bias (e.g., due to misplacement of the observation points), and most importantly the model-conceptual error (e.g., due to suboptimal definition of layers or lacking 3-D heterogeneity). In essence, none of the defined models are claimed to be perfect representations of reality so that misfits that are bigger than the error of the measurements themselves are accepted for the sake of keeping the hydrogeological models comparably simple and the fitted parameters meaningful. In this framework, a heteroscedastic error model is needed that has a set of parameters that become part of the fitting procedure. As different models have different deficiencies, they have different model errors, and judging the quality of the different models is based on the fitted coefficients of the error model. After testing different error models, which for the sake of brevity are not presented here, the following parameterization appeared to represent the behavior of the residuals reasonably well:

$$ \sigma =a+\frac{b\bullet {s}_{\mathrm{meas}}^2}{c+{s}_{\mathrm{meas}}} $$

(10)

with a, b and c being the error-model parameters. This specific error model starts off with a constant error with $ \underset{s_{\mathrm{meas}}\to 0}{\lim}\sigma =a $ corresponding to the absolute error, then shows a quadratic increase with the measurement, and converges to a linear dependence on s_meas for large values with $ \underset{s_{\mathrm{meas}}\to \infty }{\lim}\frac{\sigma }{s_{\mathrm{meas}}}=b $ corresponding to the relative error. The parameter c quantifies how quickly the error model converges from the measurement-independent to the linear regime.

The error model parameters are determined by calibrating the 1-, 3-, and 5-layer models according to the expectation-maximization method (Dempster et al. 1977). The scheme involves iteratively minimizing the objective function with the Trust-Region Reflective Least-Squares method of the function lsqnonlin in the optimization toolbox of MATLAB (Coleman and Li 1996) with given coefficients of the error model and updating the error-model parameters by performing a least-squares fit of the error model to the absolute residuals |s_sim(p) − s_meas| of the model fit to the measured drawdown s_meas. With this, the error-model parameters a, b and c, as well as all model parameters, are included in the optimization process. The iterative calibration procedure is completed when the change in all model and error parameters is less than 1%. The comparison of the different models is now not based on meeting the observations within the measurement error but on the magnitude of the model error needed to accept the different models. In the following, the goodness of fit is assessed by comparing the resulting absolute and relative errors between the 1-, 3-, and 5-layer models.

After fitting the models, the associated standard deviation $ {\hat{\sigma}}_{p_i} $ of estimation of the model parameter i are first computed by linearized error propagation:

$$ {\hat{\sigma}}_{p_i}=\sqrt{{\mathbf{C}}_{\mathbf{pp}}\left(i,i\right)} $$

(11)

with the parameter covariance matrix C_pp computed by:

$$ {\mathbf{C}}_{\mathbf{pp}}=\frac{\varphi }{n_{\mathrm{meas}}-{n}_{\mathrm{par}}}{\left({\mathbf{J}}^T{\boldsymbol{\Sigma}}^{-\mathbf{1}}\mathbf{J}\right)}^{-1} $$

(12)

in which the Jacobian J contains the partial derivatives of all simulated measurements with respect to all parameters, and Σ is the diagonal matrix of the squared errors according to the error model. Because the parameters a and b of the error model are bigger if the model shows larger misfits, the resulting parameter standard deviations of estimation are also bigger.

To address nonlinearity, the uncertainty estimate of the model parameters is refined by applying a Markov-Chain Monte Carlo (MCMC) method for the hydraulic-conductivity values with Metropolis-Hastings sampling, starting with the best estimate of the preceding optimization and keeping the coefficients of the error model as well as the reference drawdown values s_ref fixed. This leads to a sample of 1,000 parameter realizations for each model, drawn from the posterior distribution. The results of the MCMC sampling are given in section S6 of the ESM.

Finally, the radial and vertical conductivities K_r and K_z are upscaled to the full aquifer thickness, resulting in the effective radial and vertical conductivities $ {K}_{\mathrm{r}}^{\mathrm{eff}} $ and $ {K}_{\mathrm{z}}^{\mathrm{eff}} $, defined as the arithmetic and harmonic means of layer-specific values, respectively:

$$ {K}_{\mathrm{r}}^{\mathrm{eff}}=\frac{1}{z_{\mathrm{top}}-{z}_{\mathrm{bot}}}{\int}_{z_{\mathrm{bot}}}^{z_{\mathrm{top}}}{K}_{\mathrm{r}}\left(\zeta \right) d\zeta $$

(13)

$$ {K}_{\mathrm{z}}^{\mathrm{eff}}=\kern0.5em \left({z}_{\mathrm{top}}-{z}_{\mathrm{bot}}\right){\left({\int}_{z_{\mathrm{bot}}}^{z_{\mathrm{top}}}\frac{1}{K_{\mathrm{z}}\left(\zeta \right)} d\zeta \right)}^{-1} $$

(14)

From this, the anisotropy ratio ϑ is calculated by:

$$ \vartheta =\frac{K_{\mathrm{r}}^{\mathrm{eff}}}{K_{\mathrm{z}}^{\mathrm{eff}}} $$

(15)

The calculation of $ {K}_{\mathrm{r}}^{\mathrm{eff}} $, $ {K}_{\mathrm{z}}^{\mathrm{eff}} $, and ϑ is also performed for each realization of the MCMC ensemble, resulting in distributions of these quantities.

Results and discussion

Field measurements

Figure 5 shows the drawdown measurements s_meas rescaled to reflect a harmonized pumping rate of 0.01 m³/s belonging to the hydraulic tests performed at the top (Fig. 5a), the middle (Fig. 5b) and the bottom (Fig. 5c) screen of the pumping well. The measurements are displayed as a function of the radial distance to the pumping well (different colors in Fig. 5a–c), while for each radial distance the observations are aligned with elevation (bar placement along the z-axis in Fig. 5). As expected, Fig. 5 shows that in each hydraulic test the observed drawdown decreases with increasing radial distance to the pumping well and are higher at elevations close to the pumped screen interval.

The drawdown observations s_meas range between millimeters and meters. The signal strength differs between the three tests even after correcting for different pumping rates. When extracting water from the lower screen (Fig. 5c), the drawdown does not reach the high values observed in the other tests, whereas the strongest responses result from the hydraulic test with water extraction from the middle screen (Fig. 5b). These differences may be caused by vertical variations of hydraulic conductivity. In particular, a lower-conductivity layer at a depth close to that of the middle screen could explain higher drawdown values when water is extracted from this screen.

Measurement reproducibility and horizontal heterogeneity

Figure 6a shows the rescaled drawdown measurements s_meas together with the errors obtained during the reproducibility test σ_repr. Again, Fig. 6a shows that the measurement signal varies between the three hydraulic tests (different colors in Fig. 6a).

A comparison of the errors between the four directions (north, east, south, and west of the pumping well, see different marker symbols in Fig. 6a) reveals no significant spatial pattern with respect to reproducibility.

Figure 6b shows the errors associated with measurements obtained at similar radial distance and depth but in different directions to the pumping well. While the reproducibility errors σ_repr do not exceed values of 3 cm, the errors associated with horizontal heterogeneity are higher and reach values of up to 10 cm.

Goodness of model calibration

Figure 7a–c shows the absolute differences between simulated and measured drawdowns |s_sim − s_meas| versus the measured drawdowns s_meas of the 1-, 3-, and 5-layer model calibrations considering the final error-model update. Comparing parts a, b and c of Fig. 7 shows that the differences between simulated and measured drawdowns are significantly higher for the 1-layer than for the two multilayer models. In all cases, the error-model fit according to Eq. 10 captures the majority of measurements and its errors, while some outliers exist.

Figure 7d–f presents a comparison between measured and simulated drawdown values, s_meas and s_sim, for the best-fitting 1-, 3-, and 5-layer models associated with the error models determined in Fig. 7a–c, respectively. Figure 7d reveals that the 1-layer model systematically underestimates the drawdown in the second hydraulic test, in which water was extracted from the middle screen section, whereas the two multilayer models can decently fit all three hydraulic tests (Fig. 7e,f). As discussed earlier, the drawdown values were higher in the second test series than in the tests where water was extracted from the bottom and top screen, respectively. The multilayer models can reproduce this pattern by fitting a lower horizontal hydraulic-conductivity value to the middle depth of the aquifer (see the following), whereas the 1-layer model can either fit the high drawdown values of the second hydraulic test or the smaller drawdown measurements of the first and third tests. Linear regressions of the fitted versus the measured drawdown values confirm that the 1-layer model has systematic difficulties (slope of 0.29, coefficient of determination R² of 0.55), whereas the 3-layer model (slope of 0.81, R² = 0.86) and the 5-layer model (slope of 0.82, R² = 0.87) show a similar, overall satisfactory performance.

In general, the two multilayer models meet the majority of measured drawdowns (Fig. 7e,f), with only a few measurements falling far off the 1:1-identity line. As indicated by the different marker styles in Fig. 7e,f, there is no clear relationship between the directions of the measurement locations (north, east, west, and south of the pumping well) and the tendency towards over- or underestimating the measured drawdown values, obviating horizontal anisotropy.

Fitted parameter values

Tables 2 and 3 list the parameters estimated for all models. As mentioned before, the set of calibrated parameters include the radial and vertical hydraulic conductivities K_r and K_z of each horizontal layer considered in the model and for each extraction depth the drawdown s_ref at a virtual reference point to avoid computing the drawdown differences between true observation points in the steady-shape regime. While Table 3 contains the fitted values of s_ref, they do not have any real physical meaning.

Table 2

Calibrated radial and vertical hydraulic conductivities and the associated standard deviations $ \hat{\sigma} $ of estimation of each horizontal layer in the 1-, 3-, and 5-layer models. Best est.: value determined by gradient-based optimization (lsqnonlin); Lin. Prop.: linearized uncertainty propagation; MCMC: geometric mean and standard deviation determined by Markov-Chain Monte-Carlo method

Model	Layer	K_r [m/s]		$ {\hat{\boldsymbol{\sigma}}}_{\mathbf{\ln}{\boldsymbol{K}}_{\mathbf{r}}} $		K_z [m/s]		$ {\hat{\boldsymbol{\sigma}}}_{\mathbf{\ln}{\boldsymbol{K}}_{\mathbf{z}}} $
	Layer	Best est.	MCMC	Lin. Prop.	MCMC	Best est.	MCMC	Lin. Prop.	MCMC
1-layer model	–	1.1 × 10⁻³	1.1 × 10⁻³	0.055	0.010	1.8 × 10⁻⁴	1.8 × 10⁻⁴	0.135	0.084
3-layer model	1	2.7 × 10⁻³	2.7 × 10⁻³	0.087	0.068	5.9 × 10⁻⁵	6.8 × 10⁻⁵	0.261	0.230
	2 (2 ∪ 3 of 5-layer model)	2.7 × 10⁻⁴	2.7 × 10⁻⁴	0.119	0.110	6.3 × 10⁻⁵	6.3 × 10⁻⁵	0.116	0.077
	3 (4 ∪ 5 of 5-layer model)	2.6 × 10⁻³	2.6 × 10⁻³	0.116	0.047	2.4 × 10⁻⁴	2.3 × 10⁻⁴	0.338	0.282
5-layer model	1	3.8 × 10⁻³	3.7 × 10⁻³	0.164	0.100	1.2 × 10⁻⁴	1.3 × 10⁻⁴	0.533	0.441
	2	7.0 × 10⁻⁴	5.7 × 10⁻⁴	1.213	0.829	5.1 × 10⁻⁵	4.5 × 10⁻⁵	0.357	0.298
	3	3.1 × 10⁻⁴	3.2 × 10⁻⁴	0.123	0.103	5.6 × 10⁻⁵	5.5 × 10⁻⁵	0.131	0.101
	4	1.0 × 10⁻⁷	5.1 × 10⁻⁶	571	2.425	8.4 × 10⁻⁵	9.2 × 10⁻⁵	1.038	0.508
	5	2.7 × 10⁻³	2.7 × 10⁻³	0.129	0.057	1.8 × 10⁻⁴	1.7 × 10⁻⁴	0.371	0.300

Table 3

Effective parameters and additional calibration results of the locally anisotropic 1-, 3-, and 5-layer models. The effective conductivities $ {K}_{\mathrm{r}}^{\mathrm{eff}} $ and $ {K}_{\mathrm{z}}^{\mathrm{eff}} $ as well as the anisotropy ratio ϑ are given as arithmetic means ± their standard deviations as obtained from the Markov-Chain Monte-Carlo simulations

Parameter	1-layer model	3-layer model	5-layer model
$ {K}_r^{\mathrm{eff}} $ [m/s]	1.1 × 10⁻³ ± 1.1 × 10⁻⁵	2.0 × 10⁻³ ± 6.6 × 10⁻⁵	2.2 × 10⁻³ ± 8.3 × 10⁻⁵
$ {K}_z^{\mathrm{eff}} $ [m/s]	1.8 × 10⁻⁴ ± 1.6 × 10⁻⁵	1.1 × 10⁻⁴ ± 1.5 × 10⁻⁵	1.0 × 10⁻⁴ ± 1.5 × 10⁻⁵
ϑ [−]	6.21 ± 0.48	18.0 ± 2.62	21.1 ± 3.25
s_ref,I/s_ref,II/s_ref,III [m]	0.071/0.064/0.062	0.032/0.028/0.022	0.028/0.023/0.019
a [m]	0.003	7.8 × 10⁻⁴	0.001
b [−]	0.93	0.24	0.24
c [m]	0.29	0	0

Figure 4c includes the radial (blue lines) and vertical (red lines) hydraulic conductivities estimated for the 1-layer model (dotted lines), the 3-layer model (dashed lines), and the 5-layer model (solid lines). In both cases, the radial hydraulic conductivities are higher than the vertical counterparts, except for layer 4 of the 5-layer model (Table 2).

In the 5-layer model the radial and vertical hydraulic log-conductivities estimated for the sand layer 4 have considerably higher associated standard deviations $ {\hat{\sigma}}_{\ln K} $ of estimation than all other conductivity estimates (Table 2), indicating that the measurements are insensitive to the conductivities estimated for that layer. Also, the fitted conductivity values of the 5-layer model reveal that the sand layer 2 is quite similar to layer 1. This hints that the sand layers, which were delineated by grain-size analysis, may not necessarily constitute distinct individual layers, but rather represent transition zones between the three main aquifer segments. This is the reason why the 3-layer model was set up. In this model, the investigated aquifer portion is subdivided into three main sections, with overall similar effective behavior as in the 5-layer model but without the need to fit insensitive parameters. The horizontal hydraulic conductivity K_r for the top and bottom of the investigated aquifer portion show significantly higher values than the middle section, whereas the fitted vertical conductivity K_z systematically increases with depth. The reduced horizontal conductivity of the middle section (layer 3 in the 5-layer model, layer 2 in the 3-layer model) can explain the larger drawdown values in the second test series, in which groundwater is extracted from the middle screen.

Figure 8 shows the ensemble results of the effective horizontal and vertical hydraulic conductivities, $ {K}_{\mathrm{r}}^{\mathrm{eff}} $ and $ {K}_{\mathrm{z}}^{\mathrm{eff}} $, respectively, and the anisotropy ratio $ \vartheta ={K}_{\mathrm{r}}^{\mathrm{eff}}/{K}_{\mathrm{z}}^{\mathrm{eff}} $ for all three models. In the 1-layer model, the fitted horizontal and vertical conductivities K_r and K_z correspond to the effective values $ {K}_{\mathrm{r}}^{\mathrm{eff}} $ and $ {K}_{\mathrm{z}}^{\mathrm{eff}} $, respectively. Upon upscaling of the 3- and 5-layer models, the estimated effective horizontal conductivities $ {K}_{\mathrm{r}}^{\mathrm{eff}} $ of the multilayer models is about twice as high as the estimated value of the 1-layer model (see Table 3), while the upscaled vertical conductivity $ {K}_{\mathrm{z}}^{\mathrm{eff}} $ of the multilayer model is about half the fitted value of the 1-layer model. As a consequence, the anisotropy ratio $ \vartheta ={K}_{\mathrm{r}}^{\mathrm{eff}}/{K}_{\mathrm{z}}^{\mathrm{eff}} $ differs between the one- and multilayer models by a factor of more than 3 (ϑ ≈6 versus ϑ ≈20). Given the uncertainty of the estimates (see distributions in Fig. 8 and standard deviations in Table 3), the effective anisotropy of the 3- and 5-layer models is about the same. Note that both the small anisotropy ratio estimated by the 1-layer model and the large one by the multilayer models are within reasonable ranges expected for fluvial deposits (Freeze and Cherry 1979; Kruseman and de Ridder 1994). Applying the upscaled anisotropic conductivity in a homogeneous model leads to similar misfits of the drawdown data as the fitted 1-layer model (data shown in section S5 of the ESM). That is, meeting the observations clearly requires a profile of hydraulic conductivity with vertical differences.

Table 3 contains the determined coefficients of the error models related to all three models. While the 1-layer model shows an absolute error of a = 3 mm and a relative error of b = 93%, the multilayer models have similar absolute errors of ≤1 mm and a considerably lower relative error of 24%, proving to be the preferred model choice. Also, the specific error model of Eq. (10), with a smooth transition from an error that does not depend on the magnitude of the measurement to a linear dependence, is only needed for the 1-layer model. The errors of the two multilayer models can be expressed by standard expressions involving an absolute and a relative error only.

Conclusions

This work has tested an approach for estimating the hydraulic anisotropy induced by vertical heterogeneity in stratified aquifers. The approach is based on calibrating groundwater flow models using data of sequential hydraulic tests with partially penetrating wells, in which water is extracted from different aquifer depths and the hydraulic response is measured at different radial and vertical distances to an extraction screen (Maier et al. 2020). Pumping-test series with three extraction depths were performed in a fluvial gravel aquifer in South-West Germany, measuring more than 1,000 transient drawdown responses with a monitoring network of 58 observation points. These data were used to fit an anisotropic homogeneous model as well as locally anisotropic 3-layer and 5-layer models. The main target parameters were the radial and vertical hydraulic conductivities of each horizontal layer. The 3- and 5-layer models could reproduce the observed drawdown measurements considerably better than the 1-layer model, particularly because one of the three test series showed larger drawdown values, which could be attributed to pumping from a less permeable layer in the multilevel models, whereas the uniform model showed a systematic bias.

Based on the presented investigations, the following general recommendations for the design and analysis of pumping tests targeting hydraulic anisotropy are proposed:

The key element of the pumping tests is to extract water from a partially penetrating well, which induces a strong vertical flow component (at least in the vicinity of the pumping well), which is required to resolve the directional dependence of hydraulic conductivity in stratified aquifers.

The development of the pumping well considered in this study follows the development of an extraction well used for dewatering measures in a large construction pit. Performing the pumping tests is not restricted to such a large-diameter well or to the screen lengths of the well considered in the present study. The well diameter should be dimensioned based on the objective to induce a sufficiently large cone of depression which at the same time is within a measurable signal range.

Stressing the aquifer by extraction in different depths is mandatory. If water had been extracted only from a single depth (e.g., using the bottom well screen), the general vertical profile of hydraulic conductivity would most likely not have been detected.

Checking the reproducibility of the performed pumping tests by repeating the tests with different pumping rates and then rescaling the results to a common rate is highly recommended. Averaging over the repetitive tests has reduced the large data volume.

To avoid the challenges related to analyzing transient data or of reaching steady-state drawdown in field applications, the steady-shape analysis (Bohling et al. 2002; Bohling et al. 2007) is advantageous. Implementing a virtual reference point to compute drawdown differences is a reasonable alternative to the computation of drawdown differences based on pairs of true observation points, for which inherent measurement errors are propagated.

If sufficient data are available, it is preferable to resolve the main vertical structure of hydraulic conductivity over fitting a uniform effective conductivity tensor. A better agreement between simulated and measured drawdowns, avoiding systematic bias, was achieved with the multilayer models than with the single-layer model. Upon upscaling, the anisotropy ratio resulting from the multilayer model was considerably larger. Also, identifying layers of preferential flow may be important both in solute-transport applications and in flow applications in which the vertical flow component occurs mainly in a specific depth, as in the dewatering scenario considered in the authors’ preceding theoretical study (Maier et al. 2020).

Selecting the right number and vertical positions of multiple layers is a challenge and may be prone to confirmation bias. As Zhao and Illman (2018) have illustrated, the use of information from prior hydrogeological investigations benefits model calibration. In this study, available lithologic information from the drilling profile of the pumping well proved to be a plausible decision guide for narrowing down potential layers, but in hindsight one layer per extraction screen turned out to be sufficient. Most likely, performing several flowmeter or direct-push injection-logging tests to see whether consistent layers of higher or lower conductivities exist across several vertical profiles would have been better for delineating hydraulically relevant layers than the grain-size data used here.

The true hydraulic conductivity in an aquifer will always be a spatially variable full 3 × 3 tensor. On the scale of pumping tests, however, horizontal variability is often smaller than the differences among the vertical layers. To justify the assumption of radial symmetry (neglecting horizontal heterogeneity and/or anisotropy), it was important to install observation wells in several directions from the pumping well.

Overall, the study has demonstrated the applicability of the proposed approach targeting the vertical variability and anisotropy of potentially stratified aquifers. Of course, the experimental effort of installing a multisection partially penetrating well and multilevel observation wells is considerably higher than the effort associated with fully-screened wells. This extra effort may only be justified in applications in which either significant vertical flow is to be expected such as in riverbank-filtration setups or in the design of horizontal collector wells, or when the identification of preferential-flow layers is crucial, like in solute-transport applications.

Acknowledgements

The raw field data, the data set implemented in model calibration and the MATLAB codes of the numerical groundwater flow model are available on a repository of the University of Tübingen (2022).

Declarations

Conflict of interest

None.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

previous article How groundwater time series and aquifer property data explain heterogeneity in the Permo-Triassic sandstone aquifers of the Eden Valley, Cumbria, UK

next article Estimating groundwater mean transit time from SF6 in stream water: field example and planning metrics for a reach mass-balance approach

Supplementary Information

ESM 1 (PDF 2121 kb)

Bair ES, Lahm TD (1996) Variations in capture-zone geometry of a partially penetrating pumping well in an unconfined aquifer. Ground Water 34(5):842–852. https://doi.org/10.1111/j.1745-6584.1996.tb02079.xCrossRef

Bear J (1972) Dynamics of fluids in porous media. Dover, New York

Bennett JP, Haslauer CP, Ross M, Cirpka OA (2019) An open, object-based framework for generating anisotropy in sedimentary subsurface models. Groundwater 57(3):420–429. https://doi.org/10.1111/gwat.12803CrossRef

Bohling GC (2009) Sensitivity and resolution of tomographic pumping tests in an alluvial aquifer. Water Resour Res 45:W02420. https://doi.org/10.1029/2008wr007249

Bohling GC, Butler JJ, Zhan YY, Knoll MD (2007) A field assessment of the value of steady shape hydraulic tomography for characterization of aquifer heterogeneities. Water Resour Res 43 (5):W05430. https://doi.org/10.1029/2006wr004932

Bohling GC, Zhan XY, Butler JJ, Zheng L (2002) Steady shape analysis of tomographic pumping tests for characterization of aquifer heterogeneities. Water Resour Res 38 (12):1324. https://doi.org/10.1029/2001wr001176

Borghi A, Renard P, Courrioux G (2015) Generation of 3D spatially variable anisotropy for groundwater flow simulations. Groundwater 53(6):955–958. https://doi.org/10.1111/gwat.12295CrossRef

Butler JJ, Dietrich P, Wittig V, Christy T (2007) Characterizing hydraulic conductivity with the direct-push permeameter. Ground Water 45(4):409–419. https://doi.org/10.1111/j.1745-6584.2007.00300.xCrossRef

Cardiff M, Barrash W (2011) 3-D transient hydraulic tomography in unconfined aquifers with fast drainage response. Water Resour Res 47:W12518. https://doi.org/10.1029/2010wr010367

Chen XH, Burbach M, Cheng C (2008) Electrical and hydraulic vertical variability in channel sediments and its effects on streamflow depletion due to groundwater extraction. J Hydrol 352(3–4):250–266. https://doi.org/10.1016/j.jhydrol.2008.01.004CrossRef

Chen XH, Song JX, Wang WK (2010) Spatial variability of specific yield and vertical hydraulic conductivity in a highly permeable alluvial aquifer. J Hydrol 388(3–4):379–388. https://doi.org/10.1016/j.jhydrol.2010.05.017CrossRef

Coleman TF, Li YY (1996) An interior trust region approach for nonlinear minimization subject to bounds. SIAM J Optim 6(2):418–445. https://doi.org/10.1137/0806023CrossRef

Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via EM algorithm. J Royal Stat Soc Series B-Methodol 39(1):1–38. https://doi.org/10.1111/j.2517-6161.1977.tb01600.xCrossRef

Dietrich P, Butler JJ, Faiss K (2008) A rapid method for hydraulic profiling in unconsolidated formations. Ground Water 46(2):323–328. https://doi.org/10.1111/j.1745-6584.2007.00377.xCrossRef

Freeze RA, Cherry JA (1979) Groundwater. Prentice Hall, Englewood Cliffs, NJ

Gottlieb J, Dietrich P (1995) Identification of the permeability distribution in soil by hydraulic tomography. Inverse Problems 11(2):353–360. https://doi.org/10.1088/0266-5611/11/2/005CrossRef

Heinz J, Aigner T (2003) Hierarchical dynamic stratigraphy in various quaternary gravel deposits, Rhine glacier area (SW Germany): implications for hydrostratigraphy. Int J Earth Sci 92(6):923–938. https://doi.org/10.1007/s00531-003-0359-2CrossRef

Heinz J, Kleineidam S, Teutsch G, Aigner T (2003) Heterogeneity patterns of quaternary glaciofluvial gravel bodies (SW-Germany): application to hydrogeology. Sediment Geol 158(1–2):1–23. https://doi.org/10.1016/S0037-0738(02)00239-7CrossRef

Hochstetler DL, Barrash W, Leven C, Cardiff M, Chidichimo F, Kitanidis PK (2016) Hydraulic tomography: continuity and discontinuity of high-K and low-K zones. Ground Water 54:171–185. https://doi.org/10.1111/gwat.12344CrossRef

Klammler H, Hatfield K, Nemer B, Mathias SA (2011) A trigonometric interpolation approach to mixed-type boundary problems associated with permeameter shape factors. Water Resour Res 47:W03510. https://doi.org/10.1029/2010wr009337

Klammler H, Layton L, Nemer B, Hatfield K, Mohseni A (2017) Theoretical aspects for estimating anisotropic saturated hydraulic conductivity from in-well or direct-push probe injection tests in uniform media. Adv Water Resour 104:242–254. https://doi.org/10.1016/j.advwatres.2017.04.010CrossRef

Koltermann CE, Gorelick SM (1996) Heterogeneity in sedimentary deposits: a review of structure-imitating, process-imitating, and descriptive approaches. Water Resour Res 32(9):2617–2658. https://doi.org/10.1029/96wr00025CrossRef

Kruseman GP, de Ridder NA (1994) Analysis and evaluation of pumping test data, vol 47. International Institute for Land Reclamation and Improvement, Wageningen, The Netherlands

Lessoff SC, Schneidewind U, Leven C, Blum P, Dietrich P Dagan G (2010) Spatial characterization of the hydraulic conductivity using direct-push injection logging. Water Resour Res 46:W12502. https://doi.org/10.1029/2009wr008949

LGRB (2004) Geological map of Baden-Württemberg, Germany. Landesamt für Geologie, Rohstoffe und Bergbau Baden-Württemberg, Germany

Maier R, Gonzalez-Nicolas A, Leven C Nowak W, Cirpka OA (2020) Joint optimization of measurement and modeling strategies with application to radial flow in stratified aquifers. Water Resour Res 56 (7):2019WR026872. https://doi.org/10.1029/2019WR026872

Paradis D, Gloaguen E, Lefebvre R, Giroux B (2015) Resolution analysis of tomographic slug test head data: two-dimensional radial case. Water Resour Res 51(4):2356–2376. https://doi.org/10.1002/2013wr014785CrossRef

Paradis D, Gloaguen E, Lefebvre R, Giroux B (2016) A field proof-of-concept of tomographic slug tests in an anisotropic littoral aquifer. J Hydrol 536:61–73. https://doi.org/10.1016/j.jhydrol.2016.02.041CrossRef

Sanchez-Leon E, Leven C, Haslauer CP, Cirpka OA (2016) Combining 3D hydraulic tomography with tracer tests for improved transport characterization. Groundwater 54(4):498–507. https://doi.org/10.1111/gwat.12381CrossRef

University of Tübingen (2022) Forschungsdatenportal FDAT [Research data portal FDAT]. http://hdl.handle.net/10900.1/a67994b5-19be-4934-9bdb-12fdc0aaef23. Accessed January 1, 2022

Wirsing G,Luz A (2007) Hydrogeologischer Bau und Aquifereigenschaften der Lockergesteine im Oberrheingraben (Baden-Württemberg) [Hydrogeological structure and aquifer properties of the unconsolidated rocks in the upper Rhine rift (Baden-Württemberg)]. Freiburg im Breisgau, Germany

Yeh TCJ, Liu SY (2000) Hydraulic tomography: development of a new aquifer test method. Water Resour Res 36(8):2095–2105. https://doi.org/10.1029/2000wr900114CrossRef

Zhao ZF, Illman WA (2018) Three-dimensional imaging of aquifer and aquitard heterogeneity via transient hydraulic tomography at a highly heterogeneous field site. J Hydrol 559:392–410. https://doi.org/10.1016/j.jhydrol.2018.02.024CrossRef

Zlotnik V, Ledder G (1996) Theory of dipole flow in uniform anisotropic aquifers. Water Resour Res 32(4):1119–1128. https://doi.org/10.1029/95wr03813CrossRef

Zschornack L, Bohling GC, Butler JJ, Dietrich P (2013) Hydraulic profiling with the direct-push permeameter: assessment of probe configuration and analysis methodology. J Hydrol 496:195–204. https://doi.org/10.1016/j.jhydrol.2013.05.036CrossRef

Title: Revealing vertical aquifer heterogeneity and hydraulic anisotropy by pumping partially penetrating wells
Authors: Ruth Maier
Carsten Leven
Emilio Sánchez-León
Daniel Strasser
Maximilian Stoll
Olaf A. Cirpka
Publication date: 09-02-2022
Publisher: Springer Berlin Heidelberg
Published in: Hydrogeology Journal / Issue 2/2022
Print ISSN: 1431-2174
Electronic ISSN: 1435-0157
DOI: https://doi.org/10.1007/s10040-022-02458-9

Parameter	1-layer model	3-layer model	5-layer model
\( {K}_r^{\mathrm{eff}} \) [m/s]	1.1 × 10⁻³ ± 1.1 × 10⁻⁵	2.0 × 10⁻³ ± 6.6 × 10⁻⁵	2.2 × 10⁻³ ± 8.3 × 10⁻⁵
\( {K}_z^{\mathrm{eff}} \) [m/s]	1.8 × 10⁻⁴ ± 1.6 × 10⁻⁵	1.1 × 10⁻⁴ ± 1.5 × 10⁻⁵	1.0 × 10⁻⁴ ± 1.5 × 10⁻⁵
ϑ [−]	6.21 ± 0.48	18.0 ± 2.62	21.1 ± 3.25
s_ref,I/s_ref,II/s_ref,III [m]	0.071/0.064/0.062	0.032/0.028/0.022	0.028/0.023/0.019
a [m]	0.003	7.8 × 10⁻⁴	0.001
b [−]	0.93	0.24	0.24
c [m]	0.29	0	0

Springer Professional

Abstract

Supplementary Information

Publisher’s note

Introduction

Field application

Field site

Hydrogeological setting

Monitoring network

Hydraulic tests

Data processing

Model setups

Governing equations

Conceptual models

Model calibration

Results and discussion

Field measurements

Measurement reproducibility and horizontal heterogeneity

Goodness of model calibration

Fitted parameter values

Conclusions

Acknowledgements

Declarations

Conflict of interest

Publisher’s note

Supplementary Information

Other articles of this Issue 2/2022

Estimating groundwater mean transit time from SF6 in stream water: field example and planning metrics for a reach mass-balance approach

Domestic-well failure mitigation and costs in groundwater management planning: observations from recent groundwater sustainability plans in California, USA

The influence of layer and voxel geological modelling strategy on groundwater modelling results

High-efficiency and high-resolution numerical modeling for two-dimensional infiltration processes, accelerated by a graphics processing unit

Impact of mining-induced bed separation spaces on a cretaceous aquifer: a case study of the Yingpanhao coal mine, Ordos Basin, China

Correction: Stratigraphic and structural controls on groundwater salinity variations in the Poso Creek Oil Field, Kern County, California, USA