Comments (11)
I think that depends on the metric. I don't think "contributors" would be far less useful without having usernames to further explore a user's timeline of contributions, but "committer_locations" would probably be just as useful.
Either way, we are only using public data so I don't think there's a huge privacy risk. Users who are concerned about privacy probably don't use personally identifiable information on GitHub.
from augur.
Good point about the ability to drill down to how each metric is informed by the data. Maybe we can limit the level of detail though to avoid exposing usernames. This would be an ethical decision.
A concern beyond privacy is how the data could be used against contributors, for example for job performance evaluation. There will be many things contributors do that cannot be reliably captured through our metrics and thus any conclusion drawn about contributors will be skewed. We do not want to provide a tool that incentivises contributors to think about gaming the metrics which distracts from meaningful contributions.
The request is about staying abstract from individual contributors.
from augur.
I think that the metrics that operate at the individual level are useless without usernames, if we anonymize the data only the aggregate metrics will be usable. I think that it's useful for projects to be able to understand how individuals are contributing, and our tools will help make individuals who contribute in ways other than committing more visible
from augur.
If on the call it sounded pretty certain that we want to anonymize it though it can certainly be done
from augur.
Yes, during the 2017-04-08 call, several people voiced concerns with measuring individual users.
The meeting minutes from 2017-04-18 read:
We might not create metrics that are human centered
I acknowledge that I wrote those minutes and maybe someone else can chip in how they understood what we talked about during that call.
from augur.
Sounds good! We'll anonymize or aggregate the metrics that currently return usernames.
from augur.
from augur.
Is this done?
from augur.
Is this done?
I don't know if this will ever be done because I think it can serve as a constant reminder.
Does GHdata currently comply with this issue: yes, we could close it.
from augur.
Yes and no - Yes because GHData's frontend does not display any personally identifiable information. No because GHData has API requests that when made with the "raw" parameter will return all of the rows relevant to a given query, one of which is names (if the data source is GitHub). GHData visualizations will never use names, so I would feel comfortable closing it.
from augur.
We do not expose individual users. The API provides information about users, bur our front end does not expose information about useres.
from augur.
Related Issues (20)
- Create an Admin set of Groups HOT 8
- Dealing with individual repos added .. and then a repo group later. HOT 1
- Create a pretty 500 error response on front end HOT 2
- add multiple repos and groups in a return separated list HOT 1
- table relationship question
- New Dependency Language Parsers HOT 1
- Explicitly represent platform organizations in Augur
- Explore incorporatating softcite/softcite_kb data into Augur for Academic Metrics HOT 1
- "Hello World" for DBT Analytical Transformation Workflows HOT 2
- Repo path in seed data is not in the correct format
- Augur tries to purge celery and redis when collection is disabled
- Toasts pop up behind repo tracker div HOT 1
- Raise exception when API key is out of calls during the addition of a repo or org HOT 1
- Collection Weight doesn't penalize already collected repos until 1 day later HOT 1
- Secondary tasks never run when repo count is small HOT 1
- Facade incorrectly parses repo name for .github repos HOT 1
- Repo set to initializing forever HOT 1
- Augur View Registration Form Behavior HOT 5
- Convert datetime fields to datetimetz where github returns data in that format HOT 1
- Documentation of Login API HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from augur.