RR Skills: Data Manipulation

Andrey Koval
October 21, 2014

The Laboratory for Integrative Lifespan Research
Department of Psychology
University of Victoria

Overview of the Series

  • Oct 14 – Introduction to Reproducible Research
  • Oct 21 – RR Basic Skills (1): Data Manipulation
  • Oct 28 – Intro to Latent Class and Latent Transition Models
  • Nov 4 – RR Basic Skills (2): Graph Production
  • Nov 11
  • Nov 18 – RR Basic Skills (3): Statistical Modeling
  • Nov 25 – RR Basic Skills (4): Dynamic Reporting
  • Dec 2 – Migrating into R from other Statistical Software

Recap: Introduction to RR

Toolbox

  • R
  • RStudio
  • GitHub

Skillset

  • Data Manipulation
  • Graph Production
  • Statistical Modeling
  • Dynamic Reporting

traditional

Traditional

reproducible

Reproducible

reproducible

Goals of RR

  • Ultimate : Answering a research question
  • Practical : Publishing a manuscript
  • Technical : Producing a dynamic document

reproducible

TODAY

reproducible

Making “messy” data “tidy”

  1. Each variable forms a column
  2. Each observation forms a row
  3. Each type of observational unit forms a table

See Hadley Wickham's paper on tidy data

Big Picture

big picture