20
data wrangling in R @cjlortie

Data wrangling in R: be a wrangleR

Embed Size (px)

Citation preview

Page 1: Data wrangling in R: be a wrangleR

data wrangling in R

@cjlortie

Page 2: Data wrangling in R: be a wrangleR

more than half the battle

Page 3: Data wrangling in R: be a wrangleR

benefit of wrangling in R versus Excel

Page 4: Data wrangling in R: be a wrangleR

reproducibility

transparent

Page 5: Data wrangling in R: be a wrangleR

know thy data

Page 6: Data wrangling in R: be a wrangleR

values, variables, and observations

Page 7: Data wrangling in R: be a wrangleR

tidy your data with variables in columns

vectors of values are easier to work with in R

Page 8: Data wrangling in R: be a wrangleR

column headers should be variable names

Page 9: Data wrangling in R: be a wrangleR

no special signs such as $#*/ within dataframe

Page 10: Data wrangling in R: be a wrangleR

one variable per column and one class

Page 11: Data wrangling in R: be a wrangleR

one observational unit per table

Page 12: Data wrangling in R: be a wrangleR

missing values

happen

Page 13: Data wrangling in R: be a wrangleR

is.na()na.omit

na.rm=TRUE

Page 14: Data wrangling in R: be a wrangleR

simplify & selections from dataframes

Page 15: Data wrangling in R: be a wrangleR

drplyr

Page 16: Data wrangling in R: be a wrangleR

select for columns

Page 17: Data wrangling in R: be a wrangleR

filter for rows

Page 18: Data wrangling in R: be a wrangleR

summarise & mutate

Page 19: Data wrangling in R: be a wrangleR

%>% to build logical wrangling pipelines

Page 20: Data wrangling in R: be a wrangleR

tidy data semantics from Journal of Statistical Softwarereading such as ‘tidy data’ idea paper