-
An analysis of 45 large-scale wastewater sites in England to estimate SARS-CoV-2 community prevalence
Accurate surveillance of the COVID-19 pandemic can be weakened by under-reporting of cases, particularly due to asymptomatic or pre-symptomatic infections, resulting in bias. Quantification of SARS-CoV-2 RNA in wastewater can be used to infer infection prevalence, but uncertainty in sensitivity and considerable variability has meant that accurate measurement remains elusive. Here, we use data from 45 sewage sites in England, covering 31% of the population, and estimate SARS-CoV-2 prevalence to within 1.1% of estimates from representative prevalence surveys (with 95% confidence). Using machine learning and phenomenological models, we show that differences between sampled sites, particularly the wastewater flow rate, influence prevalence estimation and require careful interpretation. We find that SARS-CoV-2 signals in wastewater appear 4–5 days earlier in comparison to clinical testing data but are coincident with prevalence surveys suggesting that wastewater surveillance can be a leading indicator for symptomatic viral infections. Surveillance for viruses in wastewater complements and strengthens clinical surveillance, with significant implications for public health.
-
Minimizing the Expected Posterior Entropy Yields Optimal Summary Statistics
Extracting low-dimensional summary statistics from large datasets is essential for efficient (likelihood-free) inference. We propose obtaining summary statistics by minimizing the expected posterior entropy (EPE) under the prior predictive distribution of the model. We show that minimizing the EPE is equivalent to learning a conditional density estimator for the posterior as well as other information-theoretic approaches. Further summary extraction methods (including minimizing the $L^2$ Bayes risk, maximizing the Fisher information, and model selection approaches) are special or limiting cases of EPE minimization. We demonstrate that the approach yields high fidelity summary statistics by applying it to both a synthetic benchmark as well as a population genetics problem. We not only offer concrete recommendations for practitioners but also provide a unifying perspective for obtaining informative summary statistics.
-
Wastewater catchment areas in Great Britain
Wastewater catchment area data are essential for wastewater treatment capacity planning and have recently become critical for operationalising wastewater-based epidemiology (WBE) for COVID-19. Owing to the privatised nature of the water industry in the United Kingdom, the required catchment area datasets are not readily available to researchers. Here, we present a consolidated dataset of 7,537 catchment areas from ten sewerage service providers in the Great Britain, covering more than 96% of the population of England and Wales. We develop a geospatial method for estimating the population resident within each catchment from small area population estimates generated by the Office for National Statistics. The method is more widely applicable to matching electronic health records to wastewater infrastructure. Population estimates are highly predictive of population equivalent treatment loads reported under the European Urban Wastewater Treatment Directive. We highlight challenges associated with using geospatial data for wastewater-based epidemiology.
-
Understanding and managing uncertainty and variability for wastewater monitoring beyond the pandemic: Lessons learned from the United Kingdom national COVID-19 surveillance programmes
The COVID-19 pandemic has put unprecedented pressure on public health resources around the world. From adversity, opportunities have arisen to measure the state and dynamics of human disease at a scale not seen before. In the United Kingdom, the evidence that wastewater could be used to monitor the SARS-CoV-2 virus prompted the development of National wastewater surveillance programmes. The scale and pace of this work has proven to be unique in monitoring of virus dynamics at a national level, demonstrating the importance of wastewater-based epidemiology (WBE) for public health protection. Beyond COVID-19, it can provide additional value for monitoring and informing on a range of biological and chemical markers of human health. A discussion of measurement uncertainty associated with surveillance of wastewater, focusing on lessons-learned from the UK programmes monitoring COVID-19 is presented, showing that sources of uncertainty impacting measurement quality and interpretation of data for public health decision-making, are varied and complex. While some factors remain poorly understood, we present approaches taken by the UK programmes to manage and mitigate the more tractable sources of uncertainty. This work provides a platform to integrate uncertainty management into WBE activities as part of global One Health initiatives beyond the pandemic.
-
Current Environmental Monitoring Cannot Constrain the Effect of Vaccines on SARS-CoV-2 Transmission
This paper presents summary statistics of wastewater data and a Bayesian hierarchical log-linear regression model developed to predict weekly COVID-19 case rates (NHS Pillar 1 and 2) based on wastewater surveillance data. Outputs are analysed to investigate whether the AstraZeneca and Pfizer/BioNTech vaccines inhibit SARS-CoV-2 infection and transmission in addition to preventing symptomatic disease. No significant deviation was observed between reported case rates and SARS-CoV-2 RNA concentrations in wastewater. However, three confounding factors have been identified that limit the interpretation of this analysis: changes in NPI, the emergence of B.1.1.7, and a change in laboratory methodology. Therefore, the results presented in this paper cannot be considered evidence of COVID-19 vaccines preventing transmission of SARS-CoV-2. While the insight provided by wastewater in interrogating the impact of vaccines on SARS-CoV-2 transmission is limited, the Environmental Monitoring for Health protection programme has, and will continue to, provide surveillance and outbreak support in the COVID-19 response.