Coder Social home page Coder Social logo

wikipedia-map-reduce's People

Contributors

andbristol avatar colin-welch avatar nathanielmiller20 avatar shilad avatar

Watchers

 avatar

wikipedia-map-reduce's Issues

User identification dependent on source of user data

Users info can be gathered from <revision> data or from <article> data.  If 
it's from <revision> data, we have either a uID/Name pair or just an IP.  If 
it comes from <article> data, we have a Name or IP and an aID, but no uID.

Original issue reported on code.google.com by [email protected] on 4 Jun 2010 at 11:08

Accessing ALL users, not just those with activity

One of the main issues with user-based analysis of Wikipedia is that it's hard 
to capture all user information.  In our system, if users don't have IDs 
attached to the, we have to ignore them; furthermore, we can't get information 
on a registered user in the first place unless they edit a page.  We can't rely 
on User: namespace pages, because not every user's User: page is defined.

This issue is a first gathering place for thoughts on how best to handle this 
problem.  If some sort of consensus starts to be had, we'll move everything 
over to a Wiki page.

Original issue reported on code.google.com by [email protected] on 24 Jun 2010 at 7:23

isVandalism not being set

Refactor calls to Revision() to include new boolean parameter.

Also create isVandalism handling in PageParser.

Original issue reported on code.google.com by [email protected] on 4 Jun 2010 at 11:35

User identification based on uID/Name pair or IP

Each time a user is created, it can be done using `User(String id)` or 
`User(String name, String id)`.  For now, if the user only has an IP, not a 
uID, that is treated as the users ID.  Users should have either a uID/Name 
pair *or* an IP to be identified by.

Can have User superclass with RegisteredUser and AnonymousUser subclasses.

Original issue reported on code.google.com by [email protected] on 4 Jun 2010 at 11:05

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.