Skip to content

Repo for a draft book on social data science methods with R

License

Notifications You must be signed in to change notification settings

datalorax/sds-r

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Netlify Status

Social Data Science with R

This is basically course notes that may eventually turn into a book. It's a collaborative effort with Brendan Cullen and Ouafaa Hmaddi and we'll see where it goes. The plan is to have multiple chapters of interactive exercises where you won't have to always leave the book to try things out. I'll update this README as we go and content is developed to provide a better overview of what we're covering (and link to any material we have created).

Code of Conduct

Please note that the sds-r project is released with a Contributor Code of Conduct. By contributing to this project, you agree to abide by its terms.

Book outline

Section 1: Foundations

Getting Started
Producing your first plot
Basic R Markdown
Basic data wrangling
Types of data
Tidy data
Joins
Collaborating with git and GitHub
A collaborative exploratory data analysis example
Collaborating with git and GitHub A collaborative exploratory data analysis example

Section 2: Data visualization & Communication

Introduction to visualizations
Visual perception
Color
Refining your plots
Geographic data
Visualizing uncertainty
Tables & fonts
Websites in R Markdown
Flex dashboards
Shiny

Section 3: Functional Programming

Data Types
Iteration
Batch load and processing dat
List columns
Parallel iterations
Writing functions
Package development

Section 4: Machine Learning

Inference vs. Prediction
Ethics in Machine Learning
Cross validation
Cloud computing
Extending lm: Ridge, Lasso, Elastic net
Feature engineering
K-nearest neighbor
Decision trees
Bagged trees & Random forests
Boosted Trees
Model Stacking

About

Repo for a draft book on social data science methods with R

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Contributors 4

  •  
  •  
  •  
  •