Introduction

Imposex is the imposition of male sexual characteristics on female gastropods and has been found to be one of the most sensitive indicators of environmental tri-butyl tin (TBT) exposure. The incidence of imposex was found to be caused by TBT when used as a biocide in marine antifouling paints for both yachts and large ships in the 1960s (Waldock et al. 1988). In Nucella lapillus, the effect is dose related and severe imposex can lead to sterility in females and detrimental reproductive effects on both individuals and populations. The process and characterisation of imposex have been well documented (see, for example, Gibbs et al. 1987; Oehlmann et al. 1991; Santos et al. 2002). Dogwhelk monitoring is also an established biological effect monitoring tool used by many Oslo and Paris Convention (OSPAR 2013a) contracting countries to assess good environmental status by 2020 for the Marine Strategy Framework Directive.

Over the last two decades, legislation has been introduced, firstly to control the use of TBT-based antifouling paints on yachts and, more recently, the prohibition of the use of these biocides on large ships. Imposex (in N. lapillus) surveys like those undertaken in 1992 and 1997 for England and Wales were conducted to establish data to support and justify United Kingdom (UK) and European Union (EU) legislation restricting the use of organotin compounds. In 2003, the International Maritime Organization (IMO) introduced legislation banning the use of organotin on all ships. As seagoing vessels may be repainted with antifouling paint only every 5 years, this ban was implemented during 2003–2008. Surveys conducted in 2004, 2007 and 2010 span the period prior to and during the implementation of the IMO worldwide ban (2003–2008) on the use and application of TBT antifouling paints on all commercial shipping, and therefore serve as a time series allowing us to measure the effectiveness of the legislation. The 2014 survey aimed to study the effectiveness of these measures, 6 years after TBT-containing antifouling paints were fully removed from use.

The surveys of 1992, 1997, 2004 and 2007 established a network of 102 sampling sites (Table 1) across England and Wales. The 2007 survey used 86 sites, reduced in number either because dogwhelks had disappeared from some sites or because some sites were too close to each other, i.e. duplicate sampling. The strategy of the 2010 survey was to revisit all of the 2007 sampling sites, while the primary aim of the 2014 survey was to apply a more risk-based approach by only visiting sites that either indicated effects of TBT at or above (OSPAR) assessment class C in the 2010 or 2007 survey, or where the Environment Agency had recorded TBT levels in the water column above the EQS (personal communication with Ali Miles and Graham Phillips from the EA; sites indicated in Table 1).

Table 1 All 102 imposex sampling sites and when they were sampled

This paper firstly looks at a spatial assessment of the 18 sites sampled in 2014 and secondly examines the results of the 1997 to 2014 surveys with a direct comparison between the same stations sampled more than once between 1997 and 2014.

Materials and methods

Sampling sites

The aim of the limited 1992 survey (14 stations) was to measure the effects TBT had on dogwhelks, to establish the geographic spread of it around England and Wales and to assess the need for a wider study. Due to the high level of TBT exposure around these coastlines, the network of sites increased from 14 sites in 1992 to 102 in 2007 (Fig. 1, Table 1). These sites include areas close to ports, e.g. Dover, Southampton Water and Milford Haven; areas close to shipping lanes (traffic separation zones), e.g. Dover Straits, Isle of Wight and Holyhead; and areas away from the influence of shipping, including “background reference” sites.

Fig. 1
figure 1

All 102 imposex sampling sites in England and Wales

Sample collection

At each sampling site, up to 100 dogwhelks were collected from the foreshore between spring low water and mid tide levels. The location of the site was recorded using national grid references in the early survey years and in latitude and longitude using the global positioning system (GPS) in more recent years. Date and time of collection were also recorded for ICES reporting purposes.

From the 100 animals collected, 50 specimens (young adults, with toothed shells) were taken back to the laboratory for analysis; animals with worn shells were excluded from the sample as they could be older than 4 years. If no individuals were found within the first 25 min of sampling, the station was abandoned. Each sample was stored in a sealed net bag in a cool box with sea water-wetted seaweed (taken from the sampling site) until the sample was transferred to aquarium facilities in the laboratory. On transfer to these facilities (cool [4–15 °C], aerated and filtered sea water), the animals were stored for as short a period of time as possible before analysis. In general, this was less than 36 h and always less than 7 days.

Sample processing and quality assurance

Of the 50 individuals brought back from the shore, 40 were chosen at random and analysed. If an individual was parasitised or dead, it was excluded from the analysis and a fresh dogwhelk was added. Prior to the analysis, the length of each dogwhelk was measured to the nearest 0.01 mm with Vernier callipers and then cracked open with a vice for further analysis, which included sex determination according to Fioroni et al. (1991) and Gibbs et al. (1987), penis length measurements in males and females (if present) and the associated imposex stage in females. To identify the stages of imposex, the methods described by Gibbs et al. (1987) were used.

Quality assurance was implemented over the years by ensuring that the analysts have undergone in-house training and participated in ring tests with other organisations, such as Marine Scotland. The analysts also participated in the Quality Assurance of Information for Marine Environmental Monitoring in Europe laboratory proficiency rounds for imposex in marine snails since 2003. The highest possible score of “satisfactory” was achieved every time an analyst participated. The last round for imposex in marine snails took place in 2012. The same analyst of the 2010 and 2014 surveys participated in the 2012 round.

Calculation of imposex parameter

After all the individuals were measured, the Vas Deferens Sequence Index (VDSI) was calculated as the mean of the Vas Deferens Sequence stages (VDS) observed in one sample to assess the Oslo and Paris Convention (OSPAR) class for each site (Table 2, OSPAR 2004).

Table 2 Interpretations of the assessment classes, referring to Nucella lapillus used in the Oslo and Paris Convention (OSPAR) monitoring guideline (OSPAR 2004)

Statistical analysis

The data were analysed to see whether there was evidence of a trend in mean VDS (VDSI) over time. Trends were assessed by station to avoid biases due to not all stations being sampled on every survey occasion. The survey years used were 1997, 2004, 2007, 2010 and 2014. Stations were included if they were sampled on at least two survey years. This meant that 88 stations were included and only 15 omitted. The 1992 survey could not be included in the assessment as the original VDS values for each dogwhelk were not available (similar to the paper published by Oliveira et al. 2009).

A non-parametric Mann-Kendall (MK) statistic was used to investigate trend. For trend at a single station, the Mann-Kendall method is used as follows. For each VDS observation (except for those in the final survey), the Mann-Kendall (MK) statistic (Mann 1945; Kendall 1975) is the sum of the number of observations from later surveys that are greater than it (+1 per observation) and the number of observations that are less than it (−1 per observation). This is repeated for all time points and the cumulative sum is calculated. We denote this as the MK statistic.

To assign a p value to the observed MK statistic, a two-sided randomisation test (Manly 2006) was used. Under the null hypothesis of no trend, the VDS observations can be randomly permuted (keeping the survey year vector fixed). Then, the MK statistic can be calculated as above. This procedure was carried out 999 times in total to estimate the null distribution of the MK statistic. The number of the absolute values of these observations that are greater than the absolute value of the observed MK statistic is noted (call this BIGGER). Following Manly, the p value is calculated as (BIGGER + 1)/1000.

For the trend analysis, we have assumed that the samples taken at each station were a reasonably representative spatial sample and that the VDS values within a station were independent. Because we cannot be sure whether this assumption holds fully, we also looked at trends between 1997 and 2010 but by analysing the mean VDS values at each station—and hence negating the need for independence between samples within a station. We considered the 56 stations that were sampled in both 1997 and 2010. For each station, we calculated the mean level of VDS. We then compared the 56 VDSI sampled in 1997 with the same 56 VDSI sampled in 2010 using a paired (between stations) non-parametric Wilcoxon test. The same approach was carried out for the 6 stations that were sampled in both 1997 and 2014. While this is a less powerful comparison, it allows us to examine the most recent data.

Results

The results of the 2014 survey show that 2 sites (St. Mawes and Gurnard Bay) still exceed the Ecotoxicological Assessment Criteria (EAC) for TBT exposure with an OSPAR assessment class C, while 3 and 13 sites are at OSPAR assessment class A and B, respectively (Fig. 2 for spatial overview; Table 3 for summary information). Comparing the same 12 sites sampled between 2010 and 2014 shows that three sites (Gurnard Bay, Roa Island and Bovisand) increased in the VDSI (Table S1), with Gurnard Bay and Roa Island also changing to a higher OSPAR assessment class (Table 4). The other nine sites sampled in 2010 and 2014 (Table 1) showed reductions in VDSI with 8 also changing to a lower OSPAR assessment class (Table 4). By comparing the summary results shown in Table 4 of the TBT OSPAR assessment classes, it seemed that there were large decreases in the effects of TBT on N. lapillus between 1992 and 2014 in England and Wales.

Fig. 2
figure 2

Oslo and Paris Convention classification of dogwhelks from 2014 England and Wales sampling sites; OSPAR classifications go from A (background incidence of imposex) to F (populations of the more sensitive gastropod species are absent/expired)

Table 3 Summary results of the 2014 imposex survey separated into strata
Table 4 Nucella lapillus Vas Deferens Sequence Index (VDSI) results since 1992 separated into strata

While recognising that comparisons between years are not strictly appropriate because the same stations are not always sampled in the same year, it is interesting to see how the percentages in the various OSPAR assessment classes changed over years. Not one station out of 14 of the 1992 England and Wales survey was at background imposex level, class A on the OSPAR assessment scale, whereas 100 % of stations were at classes C or D. In contrast, in 2010, 43 % of the 77 sites were class A, followed by 47, 9 and 1 % for B, C and D, respectively. In the intervening surveys, 1997, 2004 and 2007, there was a gradual improvement in the level of imposex in each class recorded as shown by the changes in the proportion of imposex within each class (Table 5). The summary statistics of the imposex development in England and Wales between 1992 and 2014 indicate an increase in the number of sites at OSPAR assessment classes A and B (from 0 % for A and B in 1992 to 17 and 72 % in 2014, respectively; Table 5).

Table 5 Percentage of sites in each OSPAR assessment class for imposex for each reporting year between 1992 and 2014 for England and Wales

A VDSI trend assessment between the 1997 and 2014 surveys was carried out with the non-parametric Mann-Kendall statistic. The results highlighted in Table 4 show that 65 sites out of 88 reduced statistically significantly in VDSI. The Gurnard Bay site showed a significant increase in VDSI. Twenty sites showed not significant reductions and two not significant increases in VDSI (Table 4). A direct comparison of the same 56 sites sampled in the 1997 and 2010 surveys showed that the VDSI reduced from 2.89 and 0.42, respectively—this difference was strongly statistically significant (p > 0.001). Another comparison of the same 6 sites sampled in 1997 and 2014 showed a statistically significant VDSI reduction from 3.14 in 1997 to 1 in 2014 (p = 0.03). The summary information of the observed VDSI can be found in Table S1 in the supplementary section.

Discussion

Over the past two decades of regular surveys, there has been a consistent decrease in imposex levels corresponding to the implementation of different legislation controlling the use of TBT (Birchenough et al. 2002; Bryan et al. 1993). The results described in the 2014 survey show a continued decline in the incidence of imposex since the 1997 survey when looking at the same 6 stations.

The decline has been most marked in 2007 and 2010, probably as a result of the phasing out and complete supposed removal of TBT antifouling paints from boats on 17 September 2008. These results are similar to a study for sites around Portugal by Galante-Oliveira et al. (2011). A general decrease was observed in Portugal: whereas 56 % of dogwhelks had a VDSI value of 2 or higher in 2006, in 2008, only 17 % showed a similar level of effect. This correlates well with our study, where 35 % of dogwhelks had VDSI values greater than 2 in 2007 and only 10 % in 2010. The OSPAR monitoring community also observed a downward trend in VDSI. Over 80 % of sites showing a downward trend and 16 % showed a non-improvement, while 4 % show a stable background concentration level (OSPAR 2014).

Currently, the only source of TBT in the marine environment is from historical/residual contamination in sediments in estuaries and harbours (Sousa et al. 2009) and arising from dredged material disposal, but this activity is tightly controlled and is unlikely to be a significant source of contamination in the future as disposal sites are chosen to be dispersive and insuring that sediments will not stay on one site within England and Wales. Nevertheless, two sites (St. Mawes and Gurnard Bay) measured OSPAR assessment class C in 2014, failing the EAC that was derived for TBT, meaning that there is a risk of adverse effects, such as reduced growth and recruitment (OSPAR 2004). Langston et al. (2015) observed a VDSI of 2.76 in St. Mawes in 2012 which is between the 2010 and 2014 survey result of these surveys (3.73 and 2.57, respectively). Currently, it is difficult to understand why there has not been a significant decrease at these sites. Local illegal use of TBT antifouling paint cannot be ruled out, but it is more likely that historic pollution in sediments may act as a source of TBT for a longer time period in certain areas (OSPAR 2013b). Gibbs (2009) observed little improvement in the reproductive capacity of the population at a shipyard near Falmouth. The TBT pollution in his study area was probably a result of discharges from the shipyards or release or re-suspension from sediments. This could be another explanation for the continuing TBT impacts. His results correlate with the constantly high levels at St. Mawes station Falmouth in our study. As imposex is irreversible in dogwhelks (Bryan et al. 1986) and since they live for up to 7 years, it will take time for the most affected animals (class C and above) to disappear from the population. Nevertheless, clear evidence that dogwhelk populations are recovering at all sites is shown by the presence of juveniles and eggs recorded at each site. Eggs were present at many of the sites, but juveniles were observed at all 77 sites in the 2010 survey and all 18 sites in the 2014 survey. Because dogwhelks do not have a pelagic dispersal phase in their life cycle, it indicates that all populations are actively and successfully reproducing. In 1997, N. lapillus was deemed extinct from the Bill of Portland, but due to the ability to re-colonise areas where extinction has occurred (Huet et al. 2004), these organisms were observed again from 2007 onwards.

Conclusions

Imposex in dogwhelks was attributed to TBT-based antifouling paints used on yachts and ships in the UK in the 1980s (Gibbs and Bryan 1986). Since then, the UK Government has actively pursued legislation to control the use of TBT in the marine environment and supported UK-wide marine monitoring surveys of imposex. As a result, the evidence shown here clearly demonstrates the efficacy of the policy in improving dogwhelk health and in many England and Wales foreshores. The data set shows a good example of science and policy working hand-in-hand for environmental benefit. The continued measurement of imposex in marine molluscs offers the most clear-cut tool for monitoring of a contaminant-specific pollution effect under the Marine Strategy Framework Directive (OSPAR 2014).