snoww / twitchoverlap Goto Github PK

View Code? Open in Web Editor NEW

44.0 44.0 5.0 1.38 MB

stats.roki.sh, polls twitch every 30 minutes to calculate intersection of all channels above 1,000 viewers

Home Page: https://stats.roki.sh

License: MIT License

C# 58.21% CSS 0.81% JavaScript 0.66% TypeScript 40.33%

chatters community graphs intersections twitch twitch-api visualization

twitchoverlap's People

Contributors

Stargazers

Watchers

Forkers

illuminautical gangchains meadowsd13 askepticaldreamer retrobricker

twitchoverlap's Issues

Improving similarity measure

Hi,

I actually did something similar to this a while back at https://channel-similarity.johnpyp.com/ but it didn't generate much interest, presumably due to among other things a lack of visualization which is a core part of this project and which makes it cool and interesting to look at and is well executed.

However, I do think that the similarity measure that I used is better in the sense of capturing similarity between channel communities, and could be implemented here without too much issue. Mathematically what I did is outlined at https://channel-similarity.johnpyp.com/details but it essentially boils down to a couple of differences from what you currently have going on right now:

The weight of a viewer should be normalized according to the number of channels that they're in. The reason behind this is that we want the relative weight of that user to be determined by how much of the percent of their viewing is dedicated to that channel.
For example: channel A and channel B sharing a viewer that ONLY views these two channels on the entire site should account for more than if channel A and channel B happen to share Nightbot that's present on a large chunk of channels on the site. Currently their relative weight for similarity is the same.
(this should be fairly simple to implement, doesn't require any scraping changes, and can also be used for the realtime channel page view)
The weight of a viewer should be determined not only by if they happened to be in that channel during the time period collected, but by the amount of time spent there (i.e the number of scrapes they appeared in).
An example of the shortcoming of the current approach: if channel A happens to host a channel B during the period collected, then all those chatters appearing momentarily in channel B's chat currently provide as much weight to similarity as chatters that spend long periods of time in both of these channels.
(this could require scraping changes to store these values)

Similarity over time for channel pages

It'd be nice to be able to view the ranking of similar channels over time (say, the same period as the Atlas) from within the channel page/view.

Venn Diagrams, user selection of streamers to compare etc.

The system you have built is impressive. I just wish it used Venn Diagrams for representation. Eg. If I select 2 streamers - Bugha, Clix and BuckeFPS, then I can get the overlap between these three streamers, unique viewers that only watch one of these streamers, the ones that watch only 2 out of the other 2 streamers and then a universal set of users(also include the major streamers that fall in the universal set, that don't watch any of the three streamers).

Names missing from probability list of overlapping streamers

Maybe this isn't a bug, so this might just be a question on my part, but why are some streamer's names missing from the list of overlapped streamers? is it because there isn't enough overlap for them to warrant being on the list below the graph?

for example, Hirona has ~400 overlapping chatters to RatedEpicz channel at one point according to this graph:

yet, Hirona is not listed in the probability list below, at any percent. why is this? is the list below the graph, probability based on chatting trends over the past month or something? so a single day isn't enough to matter?

a 2nd example is Hasan overlapping with moonmoon. There's 1600 overlapping chatters, but Hasanabi is not listed in the probability list.

this site is awesome, i'm just trying to better understand the data in front of me.

oh and 2nd question, what timezone are the graphs using? is it my local timezone or a set one?

snoww / twitchoverlap Goto Github PK

twitchoverlap's People

Contributors

Stargazers

Watchers

Forkers

twitchoverlap's Issues

Improving similarity measure

Similarity over time for channel pages

Venn Diagrams, user selection of streamers to compare etc.

Names missing from probability list of overlapping streamers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent