Instructor: Richard McElreath
Lectures: Uploaded <Playlist> and pre-recorded, two per week
Discussion: Online, Fridays 3pm-4pm Central European Time
This course teaches data analysis, but it focuses on scientific models first. The unfortunate truth about data is that nothing much can be done with it, until we say what caused it. We will prioritize conceptual, causal models and precise questions about those models. We will use Bayesian data analysis to connect scientific models to evidence. And we will learn powerful computational tools for coping with high-dimension, imperfect data of the kind that biologists and social scientists face.
Online, flipped instruction. The lectures are pre-recorded. We'll meet online once a week for an hour to work through the solutions to the assigned problems.
We'll use the 2nd edition of my book, <Statistical Rethinking>. I'll provide a PDF of the book to enrolled students.
Registration: Please sign up via <[COURSE IS FULL SORRY]>. I've also set aside 100 audit tickets at the same link, for people who want to participate, but who don't need graded work and course credit.
There are 10 weeks of instruction. Links to lecture recordings will appear in this table. Weekly problem sets are assigned on Fridays and due the next Friday, when we discuss the solutions in the weekly online meeting.
Lecture playlist on Youtube: <Statistical Rethinking 2022>
Week ## | Meeting date | Reading | Lectures |
---|---|---|---|
Week 01 | 07 January | Chapters 1, 2 and 3 | [1] <The Golem of Prague> <(Slides)> [2] <Bayesian Inference> <(Slides)> |
Week 02 | 14 January | Chapters 4 and 5 | [3] <Basic Regression> <(Slides)> [4] <Categories & Curves> <(Slides)> |
Week 03 | 21 January | Chapters 5 and 6 | [5] <Elemental Confounds> <(Slides)> [6] <Good & Bad Controls> <(Slides)> |
Week 04 | 28 January | Chapters 7, 8 and 9 | [7] <Overfitting> <(Slides)> [8] <Markov chain Monte Carlo> <(Slides)> |
Week 05 | 04 February | Chapters 10 and 11 | [9] <Logistic and Binomial GLMs> <(Slides)> [10] <Sensitivity and Poisson GLMs> <(Slides)> |
Week 06 | 11 February | Chapters 12 and 13 | [11] <Ordered Categories> <(Slides)> [12] <Multilevel Models> <(Slides)> |
Week 07 | 18 February | Chapters 13 and 14 | [13] <Multi-Multilevel Models> <(Slides)> [14] <Correlated varying effects> <(Slides)> |
Week 08 | 25 February | Chapter 14 | [15] <Social Networks> <(Slides)> [16] <Gaussian Processes> <(Slides)> |
Week 09 | 04 March | Chapter 15 | [17] <Measurement Error> <(Slides)> [18] <Missing Data> <(Slides)> |
Week 10 | 11 March | Chapters 16 and 17 | [19] <Beyond GLMs> <(Slides)> [20] <Horoscopes> <(Slides)> |
This course involves a lot of scripting. Students can engage with the material using either the original R code examples or one of several conversions to other computing environments. The conversions are not always exact, but they are rather complete. Each option is listed below. I also list conversions <here>.
For those who want to use the original R code examples in the print book, you need to install the rethinking
R package. The code is all on github https://github.com/rmcelreath/rethinking/ and there are additional details about the package there, including information about using the more-up-to-date cmdstanr
instead of rstan
as the underlying MCMC engine.
The <Tidyverse/brms> conversion is very high quality and complete through Chapter 14.
The <Python/PyMC3> conversion is quite complete. There are also at least two NumPyro conversions: <NumPyro1> <NumPyro2>. And there is this <TensorFlow Probability>.
The <Julia/Turing> conversion is not as complete, but is growing fast and presents the Rethinking examples in multiple Julia engines, including the great <TuringLang>.
The are several other conversions. See the full list at https://xcelab.net/rm/statistical-rethinking/.
I will also post problem sets and solutions. Check the folders at the top of the repository.