Coder Social home page Coder Social logo

uwescience / datasci_course_materials Goto Github PK

View Code? Open in Web Editor NEW
915.0 332.0 2.4K 75.87 MB

Public repository for course materials for the Data Science at Scale Specialization at Coursera

Home Page: https://www.coursera.org/learn/datasci

Python 4.53% Java 23.84% R 0.20% HTML 69.81% PigLatin 1.62%

datasci_course_materials's Introduction

datasci_course_materials

Public repository for course materials for the Data Science at Scale specialization offered by Coursera and the University of Washington.

datasci_course_materials's People

Contributors

billhowe avatar ilirmaci avatar jeff-winchell avatar ksuszka avatar sachu avatar t101jv avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

datasci_course_materials's Issues

This repository is over its data quota

When cloning the repository:

Downloading capstone/blight/data/detroit-311.csv (6.2 MB)
Error downloading object: capstone/blight/data/detroit-311.csv (542a42d): Smudge error: Error downloading capstone/blight/data/detroit-311.csv (542a42df00192aec8198bf895aad6cdf4ddab3b4c00242af2dfec09322fc42bf): batch response: This repository is over its data quota. Purchase more data packs to restore access.

Errors logged to /home/lieu/dev/datasci_course_materials/.git/lfs/logs/20190722T100133.191553284.log
Use `git lfs logs last` to view the log.
error: external filter 'git-lfs filter-process' failed
fatal: capstone/blight/data/detroit-311.csv: smudge filter lfs failed
warning: Clone succeeded, but checkout failed.
You can inspect what was checked out with 'git status'
and retry the checkout with 'git checkout -f HEAD'

Assignment 1 for Windows

I'm putting this here for posterity. No real action required.

Issue Summary:
Windows users not using the course's image may encounter an infuriatingly frustrating issue if they out their file using the command:

python twitterstream.py > output.txt

This defaults the encoding Windows-1252 and the output looks something like:

This will inevitably cause JSON parsing to fail.

Solution:

In Powershell, pipe in Out-File and force UTF-8. So something like:

python twitterstream.py | Out-File -Encoding "UTF8" output.txt

In command prompt, you actually need to convert the default encoding temporarily:

chcp 65001

Then run the command as normal. Then convert back:

chcp 1252

Assignment 1 - twitterstream.py

Sorry if this isn't the correct way to do this, I'm new to GitHub. The twitterstream.py needs to have the fetchsamples() function updated. The url defined in the function is an old url. Per the post below, this is needs to be updated as follows:

Current: url = "https://stream.twitter.com/1/statuses/sample.json"
Should be: url = "https://stream.twitter.com/1.1/statuses/sample.json"

Post showing new URL: https://twittercommunity.com/t/retiring-old-v1-streaming-api-endpoints-on-march-31st-2016/60351

Assignment 2 - Part d - mismatch in instructions

Assignment materials in github does NOT match the online posted instructions for Data Manipulation course.

Online instructions
https://www.coursera.org/learn/data-manipulation/programming/AxbQn/twitter-sentiment-analysis

(d) count: Write a SQL statement to count the number of unique documents containing the word "law" or containing the word "legal" (If a document contains both law and legal, it should only be counted once)

github instructions:

https://github.com/uwescience/datasci_course_materials

datasci_course_materials/assignment2.html

(d) count: Write a SQL statement to count the number of documents containing the word "parliament"

The grader is looking for counts of law/legal not parliament. Therefore, online instructions are the ones to use if there is a discrepency.

Also typo in Problem 2

Part (g) matrix shows 0 0 -3 but the matrix is showing row at 2,2,-3.

Update of assign1 twitterstream.py needed for py3

Since python 3 is now standard for most environments, it would be helpful to have the twitterstream.py file updated to use urllib instead of urllib2. i've spent a few hours on this and still can't get it to work.

Thank you,
Jennifer

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.