Coder Social home page Coder Social logo

cityofaustin / hack-the-traffic Goto Github PK

View Code? Open in Web Editor NEW
22.0 10.0 9.0 5.45 MB

A Transportation-Themed Hackathon hosted by the City of Austin and the UT Center for Transportation Research

austin transportation open-data bluetooth-sensors traffic

hack-the-traffic's Introduction

ATX Hack the Traffic

Hack the Traffic Logo

ATX Hack the Traffic is a hackathon series that seeks to use data and technology to solve Austin's transportation challenges. The project is a City of Austin initative carried out in parternship with the UT Center for Transportation Research.

Our next event will be at ATX Hack for Change on June 2nd. Visit our Open Source Traffic Sensor repo for details about the Hack for Change project.

Event Schedule

Talk to Us

Catch up with us on Open Austin's Slack channel #HackTheTraffic.

Email us at [email protected].

hack-the-traffic's People

Contributors

itamargal avatar johnclary avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

hack-the-traffic's Issues

Retaining enough of the MAC address to determine device vendor

First, thanks so much for releasing this data! It's an amazing dataset and I anticipate that we only touched the surface of potential solutions it enables at the event yesterday.

One problem I discussed with a few other people at my table was distinguishing cars from bicycles from pedestrians. We could draw thresholds based on the assumption that cars travel faster than bikes, which travel faster than pedestrians, but this is not always true: during rush hour bikes might outstrip cars; in rush hour + accident, pedestrians might even win.

While cars / bikes / peds might all be carrying iPhones, there are likely some devices that are clear indicators of a specific category, like a smart dashboard system in a newer car.

For this purpose, it would be useful to retain the first three octets of the original MAC Addresses, which indicates the vendor of the device, and only anonymize the subsequent octets.

Having even limited vendor cues could be useful for bootstrapping a clustering algorithm to automatically distinguish (to some degree) between cars / bikes / peds.

data.austintexas.gov (Socrata) vs. data.world

  • Which one should be preferred for correctness / timely updates?
  • Are the csv files as available through data.world the 'raw' form that is fed into Socrata and thence only available by the usual Socrata API / export features?

And if relevant, can you briefly describe the ETL process and anticipated frequency of updating those data sources with the latest sensor readings?

MAC address anonymization procedure documentation

Related to #9, I have a few questions about the MAC addresses that appear in the dataset. Can you explain / describe:

  • How are the anonymized addresses generated, precisely?
  • If I see, for example, 8f:bd:9e:e9:18, in one day's data, will that be the same device as every other 8f:bd:9e:e9:18 in the rest of the data?
    • If not, what are the boundaries of identity?
  • Why are the (anonymized) MAC addresses five octets instead of the full six?
  • How is the matching performed?
    • If the sensors themselves truncate the last octet, is the matching process going to confuse similar devices? (it's probably rare, given the size of the MAC address space, but possible)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.