BST228: Applied Bayesian Analysis
BST228 Applied Bayesian Analysis is a practical introduction to the Bayesian analysis of biomedical data taught in the Department of Biostatistics at the Harvard T.H. Chan School of Public Health taught by Prof Stephenson and Dr Hoffmann. It is an intermediate graduate-level course in the philosophy, analytic strategies, implementation, and interpretation of Bayesian data analysis. Specific topics that will be covered include: the Bayesian paradigm; Bayesian analysis of basic models; Markov Chain Monte Carlo for posterior inference; Stan R software package for Bayesian data analysis; linear regression; hierarchical regression models; generalized linear models; meta-analysis; models for missing data.
Lectures
- Introduction (taught by Prof Stephenson)
- The Bayesian Paradigm (taught by Prof Stephenson)
- Mechanics of Bayes’ Theorem and Prior Distributions (taught by Prof Stephenson)
- Posteriors, Prediction, and Simple Models (taught by Prof Stephenson)
- Binomial, Poisson, and Normal Models (taught by Dr Hoffmann, slides): Similarities and differences between binomial and Poisson models; what constitutes a “non-informative” prior; normal likelihood.
- Normal and Multivariate Models (taught by Dr Hoffmann, slides): Choosing hyperparameters for weakly-informative priors; posterior for location parameter of normal likelihood given known precision parameter and vice versa.
- Joint Inference (taught by Dr Hoffmann, slides): Joint and marginal distributions; normal-gamma conjugate prior for normal data with unknown location and precision; marginal posterior for location and precision parameters.
- Introduction to MCMC (taught by Prof Stephenson)
- Gibbs Sampler (taught by Prof Stephenson)
- MCMC Diagnostics (taught by Prof Stephenson)
- Linear Regression (taught by Dr Hoffmann, slides): Review of MCMC diagnostics; generic Metropolis sampler implementation in R; linear regression likelihood; constructing regression features from data; conditional distributions for Gibbs sampling regression parameters.
- Regression Case Study (taught by Dr Hoffmann, slides): Limiting cases of conditional distributions for regression parameters; funnels in coefficient-precision space; posterior correlation for regression coefficients for features with non-zero mean; interpreting regression coefficients; posterior predictive distribution for linear regression.
- Generalized Linear Models & Stan (taught by Dr Hoffmann, slides): Heteroskedastic regression; generalized linear models applied to an example of incumbency advantage in United States House of Representatives elections; introduction to Stan to decouple model definition and posterior sampling.
- Hierarchical Models I (taught by Prof Stephenson)
- Hierarchical Models II (taught by Prof Stephenson)
- Hierarchical Regression (taught by Prof Stephenson)
- Midterm Review
- Midterm Exam
- Bayesian Model Averaging (taught by Prof Stephenson)
- Model Checking (taught by Dr Hoffmann, slides): sensitivity of Bayesian model averaging to priors, posterior predictive replication, identifying problems with models using replicated data.
- Missing Data (taught by Dr Hoffmann, slides): missing data as parameters of the model, missing data mechanisms (missing completely at random, at random, not at random), ignorability of the missing data mechanism, examples.
- Networks (taught by Dr Hoffmann, slides): statistical and mechanistic network models, Erdős–Rény model, stochastic block models, conditionally independent edge models, application to social isolation, mechanistic models for sexual contact networks.
- Sensitivity Analysis (taught by Prof Stephenson) and Probabilistic Programming with Large Language Models (external presentation by Dr Du Phan from Google)
rjags
and Distributed Computing for Bayesian Computation (taught by Dr Daniel Schwartz)- Bayesian Causal Inference (taught by Dr Heejun Shin)
- Thanksgiving
- Variational Inference (taught by Dr Hoffmann, slides)
Future materials will be posted after the next lecture.