Coder Social home page Coder Social logo

dotapromap's Introduction

This is a work in progress project mapping all the player histories in Dota 2 liquipedia, as well as recreating the database in mongodb, as it appears there is not one.

Thoughts on what needs to be done

General Notes on project

  • Creating a local instance of mongodb from https://docs.mongodb.com/manual/tutorial/install-mongodb-on-ubuntu/
  • Code only runs on full version of ubuntu 16.04 in virtualbox for some reason
  • Pimpmuckl's askfm link is formatted as ask.fm, which cannot be inserted into mongo as a field name, so it has been edited manually
  • On machine restart, be sure to restart mongod service
  • Throw out all bad dates and record them

Data Quality is very very hard

  • Need to throw out all dates before 2010 in some way

  • Need to verify the format of all dates

  • 1128/(5734/2+1128) = 28.2% of history entries are unreadable, ie bad formatting or ??

  • Establish lists of files that are a problem

  • Not sure why MoonMeander DC line does not parse

Immediate Concerns

  • Establish convention for ?? in data before database entry (this is some bad stuff)
  • Improve data quality/robustness of parsing logic
  • Currently 122 player history entries appear as None due to non standard data representaitons, after accounting for hyphen and dash
  • Take notes of data changes
  • Convert the unchanged dates to ints
  • Rounding starting dates is not working correctly
  • It appears the history_list variable does not keep dates after it encounters a '?'

Good Things To Do

  • May establish black list for personality pages, as they have no player history, may create new db for them
  • Establish refresh convention, check update time on webapges, have cached data to fix data quality errors
  • Not accounting for Dota1/Starcraft histories
  • 22 entires have ???? for years, needs to be accounted for

Things that will probably never get done

  • Fix Birthdays (last priority honestly)

dotapromap's People

Contributors

mikemcgo avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.