rehamaltamimi / wikipedia-map-reduce Goto Github PK
View Code? Open in Web Editor NEWAutomatically exported from code.google.com/p/wikipedia-map-reduce
Automatically exported from code.google.com/p/wikipedia-map-reduce
GraphParser's readGraph() method returns an ArrayList of Links. Is the Graph
class not necessary?
Original issue reported on code.google.com by [email protected]
on 4 Jun 2010 at 11:22
getContributors() uses a HashSet to weed out duplicate contributors, but it
doesn't add all, even when they are non-unique.
Original issue reported on code.google.com by [email protected]
on 6 Jun 2010 at 11:10
Users info can be gathered from <revision> data or from <article> data. If
it's from <revision> data, we have either a uID/Name pair or just an IP. If
it comes from <article> data, we have a Name or IP and an aID, but no uID.
Original issue reported on code.google.com by [email protected]
on 4 Jun 2010 at 11:08
Need to refactor code to use new User subclasses.
Original issue reported on code.google.com by [email protected]
on 21 Jun 2010 at 7:44
Check all single-argument User() instantiations for validity.
Original issue reported on code.google.com by [email protected]
on 4 Jun 2010 at 11:32
Should be refactored to a wrapper class for a list of edges. May not be
needed in the end anyways, but it should still happen.
Original issue reported on code.google.com by [email protected]
on 4 Jun 2010 at 11:29
One of the main issues with user-based analysis of Wikipedia is that it's hard
to capture all user information. In our system, if users don't have IDs
attached to the, we have to ignore them; furthermore, we can't get information
on a registered user in the first place unless they edit a page. We can't rely
on User: namespace pages, because not every user's User: page is defined.
This issue is a first gathering place for thoughts on how best to handle this
problem. If some sort of consensus starts to be had, we'll move everything
over to a Wiki page.
Original issue reported on code.google.com by [email protected]
on 24 Jun 2010 at 7:23
Refactor calls to Revision() to include new boolean parameter.
Also create isVandalism handling in PageParser.
Original issue reported on code.google.com by [email protected]
on 4 Jun 2010 at 11:35
Each time a user is created, it can be done using `User(String id)` or
`User(String name, String id)`. For now, if the user only has an IP, not a
uID, that is treated as the users ID. Users should have either a uID/Name
pair *or* an IP to be identified by.
Can have User superclass with RegisteredUser and AnonymousUser subclasses.
Original issue reported on code.google.com by [email protected]
on 4 Jun 2010 at 11:05
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.