Section 8 Required Texts

8.1 Professor Love’s Materials

Dr. Love maintains a set of Course Notes, titled Data Science for Biological, Medical and Health Research: Notes for 431. Professor Love revises the Notes every year, and so they appear in pieces as the semester progresses.

Although these Notes share some of the features of a textbook, they are neither comprehensive nor completely original. The main purpose is to give 431 students a set of common materials on which to draw during the course, providing a series of examples using R to work through issues that are likely to come up during the semester, and in later work.

In addition, slides and video recordings from each of Professor Love’s lectures, plus other in-class materials from each session of the class will be posted for your use in a timely fashion throughout the semester.

Once class begins, access all materials at the main course website.

8.1.1 The Book you need to purchase

During the course, we will read David Spiegelhalter’s The Art of Statistics, which was first published by Penguin in March 2019 (and February 2020) in the UK and then by Basic Books in the US in September 2019. You can purchase any of the available versions (hard-cover, paperback or e-reader) online or in your local bookstore for about $20.

  • Dr. Spiegelhalter’s website has lots of useful information.
  • The book’s website contains R code, corrections and other materials.
  • You are welcome to read this book before class starts, if you’d like to get a jump on things, but that’s not necessary: we’ll link readings to the syllabus when that becomes available.

Everything else that you will need is free, and will be described in detail on the main course website once the class begins. Some highlights follow…

8.2 Three Books to Download

There are three additional free books that you will definitely need to obtain during the semester and may be interested in looking at before class begins. Simply visit the links below.

  1. Introduction to Modern Statistics by Mine Cetinkaya-Rundel and Johanna Hardin.
  2. R for Data Science by Garrett Grolemund and Hadley Wickham.
  3. Biostatistics for Biomedical Research (pdf) by Frank E. Harrell Jr and James C Slaughter. This is regularly updated by the authors, as indicated on their course website, so get the most recent version occasionally.

8.3 Key Articles and Posts

While I will recommend dozens, perhaps hundreds of articles, blog posts and the like to you over the course of the year, these are especially important in 431.

  1. Several of the guides prepared by Jeff Leek and his group, including:
  2. Data Organization in Spreadsheets by Karl W. Broman and Kara H. Woo in The American Statistician, 2018 Special Issue on Data Science, or you can read the PeerJ preprint version.
  3. Project-oriented workflow at tidyverse.org from Jenny Bryan.
  4. From the Ten Simple Rules series at PLOS Computational Biology:
  5. Statistical Inference in the 21st Century: A World Beyond p < 0.05 from 2019 in The American Statistician
  6. The American Statistical Association’s 2016 Statement on p-Values: Context, Process and Purpose.

See the main course website for other recommendations as the semester goes on.