6  Required Texts

6.1 Course Notes

Professor Love maintains a set of Course Notes, titled Data Science for Biological, Medical and Health Research: Notes for 432. Professor Love revises the Notes annually, and they will appear in pieces as the semester progresses.

Although these Notes share some of the features of a textbook, they are neither comprehensive nor completely original. The main purpose is to give 432 students a set of common materials on which to draw during the course, providing a series of examples using R to work through issues that are likely to come up during the semester, and in later work.

In addition, slides and video recordings from each of Professor Love’s lectures, plus other in-class materials will be posted for your use throughout the semester.

Once class begins, you’ll be able to access all materials (including the Course Notes) through the main course website at https://thomaselove.github.io/432-2024/.

6.2 Buy This Book!

You need to buy Jeff Leek’s How to be a Modern Scientist, available electronically through https://leanpub.com/modernscientist. The suggested cost is $10, but you can pay what you want.

You can read the book in about four hours. We will expect you to have done so by the end of January.

6.3 Other Books to Download

There are three additional free books that you will definitely want to obtain during the semester and may be interested in looking at before class begins. Simply visit the links below.

  1. Introduction to Modern Statistics by Mine Cetinkaya-Rundel and Johanna Hardin.
  2. R for Data Science by Garrett Grolemund and Hadley Wickham (first edition). Note that the second edition (work in progress) may be better for our purposes, and is found at https://r4ds.hadley.nz/.
  3. Biostatistics for Biomedical Research by Frank E. Harrell Jr. The related set of YouTube videos can be found here.

Our 432 Sources page links to additional resources.

6.4 Key Articles and Posts

While I will recommend dozens, perhaps hundreds of articles, blog posts and the like to you over the course of the year, these are especially important in 432.

  1. Several of the guides prepared by Jeff Leek and his group, including:
  2. Data Organization in Spreadsheets by Karl W. Broman and Kara H. Woo in The American Statistician, 2018 Special Issue on Data Science, or you can read the PeerJ preprint version.
  3. Project-oriented workflow at tidyverse.org from Jenny Bryan.
  4. From the Ten Simple Rules series at PLOS Computational Biology:
  5. Statistical Inference in the 21st Century: A World Beyond p < 0.05 from 2019 in The American Statistician
  6. The American Statistical Association’s 2016 Statement on p-Values: Context, Process and Purpose.

Don’t forget that our 432 Sources page links to these and additional resources. Professor Love’s class-specific READMEs will provide additional links to recommended readings as the semester goes on.