Coder Social home page Coder Social logo

r-for-data-science's Introduction

title author
Materials for D-Lab's R for Data Science
Dillon Niederhut

This repository contains the instructor materials for the D-Lab's R intensive.

If you are a student:

You can download the contents of this repository with:

git clone https://github.com/dlab-berkeley/r-for-data-science.git

or, by clicking the "Download Zip" button and then extracting the .zip file.

The instructor of this workshop series will lead you through the activities for each day.

If you are a D-Lab instructor

You'll see accumulated teaching notes and examples for each day's topics in the instructor folder. For your convenience, these are available as .Rmd, commented .R files, PDF documents, and HTML slides. The meta-document for this workshop series, which explains the logic behind the structure and topics, can be viewed at the D-Lab guides repository

For information on contributing to this repository, see CONTRIBUTING.md

If you are a D-Lab facilitator

The standard Drupal workshop descriptions and facetweet postings for this workshop series are in PUBLICITY.md

Description

  • data/ : data necessary for interactive coding examples
  • examples/
    • save_console_output.R : R code for saving console output to pdf
  • instructor/ : teaching notes
  • scripts/
    • feedback_cleaner.R : used to clean data for use in Day 3
    • regenrate_files.R : for regenerating .R and .pdf files from .Rmd

Topics:

This workshop series covers:

  1. Interacting with R
  2. Datatypes
  3. Data structures
  4. Reading data
  5. Sanitizing data
  6. Missing data
  7. Reshaping data
  8. Summary statistics
  9. Plotting
  10. Linear models
  11. Non-parametric models
  12. Functions
  13. Loops
  14. Parallelization
  15. Packages

Libraries

This workshop uses the following packages:

  • Amelia
  • devtools
  • dplyr
  • foreign
  • ggplot2
  • parallelMap
  • RCurl
  • roxygen2
  • stringr
  • tidyr
  • XML

D-Lab == Data Intensive Social Science, For All!

r-for-data-science's People

Contributors

deniederhut avatar schoi05 avatar henchc avatar rochelleterman avatar

Stargazers

maura+mcdonagh avatar  avatar Paola Gabriela Villa Paro avatar  avatar Che avatar Rafael de Mendonça avatar Mathias Gibson avatar Somkiat Puisungnoen avatar Reid Whitaker avatar Perumal Kabali avatar Sustainable John avatar Evan Muzzall avatar Renata Barreto avatar Travis Taylor avatar Sergio Castellanos avatar  avatar Nico Tripcevich avatar Bill Eger avatar Jong-Kai Yang avatar  avatar Sebastian Barfort avatar Chris Kennedy avatar

Watchers

Sebastian Benthall avatar James Cloos avatar Patty Frontiera avatar Jack Burris avatar Jon D Stiles avatar Guadalupe Tuñón avatar Nick Adams avatar  avatar Juan Shishido avatar  avatar  avatar  avatar  avatar

r-for-data-science's Issues

Re-smooth discussion flow

Days with lots of modifications feel a little choppy in terms of jumps between topics and examples.

Add staff warning to workshop description

University employees who attend the workshop with university-provisioned Windows laptops are running into permission issues when trying to install libraries. There should be a warning somewhere that they will need to have their sysadmins install R and RStudio with the Library path set to a non-protected location.

Review of Day Two

Glad to see modern libraries covered. I think it's useful to show the figure from Hadley's dplyr workshop when you introduce dplyr. His paper with nice figures for this is here.

It could also be nice to introduce tidyr, and I'd point people towards reshape2, available on cran as reshape2.

data/dirty.csv character encoding issue

On computers with Chinese character encoding sets, the ' in "5'9" in data/dirty.csv expands to an incorrect bytestring when imported into R with read.csv.

regenerate_files.R evaluates code in text files

Specifically, running the script seems to call less in the terminal during the parts of the code that demonstrate pulling documentation, e.g. ?exists causes the terminal to enter:

exists                  package:base                   R Documentation

Is an Object Defined?

Description:

     Look for an R object of the given name and possibly return it

Usage:

     exists(x, where = -1, envir = , frame, mode = "any",
            inherits = TRUE)

     get0(x, envir = pos.to.env(-1L), mode = "any", inherits = TRUE,
          ifnotfound = NULL)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.