hatespeech's People
Forkers
shashankg7 saumopal97 vrmpx iridiva pratiknarang wachmann guptaarn zewdie2010 ollmor akasunic ji1kang decpaul bdouralzeer frankey419 gsig123 sjnorth yrachel dennissun1 aditi138 seafire1991 kamalravi jonas04 didwiz deepanwitadatta iamweiweishi nazaninsa tonydeep preesee nidhimoore phamtuanmis cozek jworsf01 divyanshu1994 dhavalpotdar naushadzaman minuszer0 huynhduchuydp36 akshayparseja anabel19 dingyunxia vicbotce chhavijain2212 pastatimes molaeiali mmarabella ajfrai scottspace ramashelke 100rabh1401 doehae chatsdude fscavone1 chenjing-825 sidneykung jxyjxyjxy ji-xiaoyu henhhalpert jaqujaqu lsussman1 jushladnik meizhang101 tasneem94 zanasaed lei56 habibi-zz bhup20 djwei96 guoanhu12146 aad3sh kennenvi ben-a-21 hweber01 harshsinghs1058 harshilpatel99 semaahmed shahriarshayesteh davidnol hibalubbad thelaurenxavier aqhali park-jay pamelaferreiralimahatespeech's Issues
.csv file dataset can't be downloaded
@zeeraktalat I am a final year student of Computer Science at Jadavpur University . As a final year student I have chosen Hate Speech as my dissertation topic . So it is very important for me to get the dataset of fully hydreated english tweets of this repo . But i am not able to download it . So , if you provide me the csv file I will be greatful to you .
Same tweetids but with different labels
Hi,
I have found many tweets ids which are available in both datasets (NAACL_SRW_2016 and NLP+CSS_2016). Some of these tweet ids are
572342978255048705
572341498827522049
Both tweet ids are labelled with "Racism" in NAACL_SRW_2016 however different label in NLP+CSS_2016 ("neither or sexism").
Please advise tackling this issue while using this dataset for classification.
Thanks,
Piush Aggarwal
cannot find the annotations.tsv
I cannot find the annotations.tsv file which you guys had mentioned in your readme.md. Could you provide me the link to download the annotations?
Can't access the tweets
Hi,
I am doing a post-graduation project and I want to use this dataset but I couldn't access the full dataset. Can you please provide the full dataset with the tweets. That would be very helpful for my project.
Thank you.
annotations.tsv file missing
Dear researchers/developers
I cannot find the annotations.tsv file which you guys had mentioned in your readme.md. Could you provide me the link to download the annotations?
How to get the full twitter ?
I do not know how to get the original twitter with ids, could you share the complete twitter?
@wvs2 if you e-mail me I can send you the data. :)
@wvs2 if you e-mail me I can send you the data. :)
Originally posted by @ZeerakW in #2 (comment)
Most tweets are not accessible
While trying to fetch tweets from both files, this is what I receive as a response for most tweets:
Twitter Error [200] : [{"errors":[{"value":"551659627872415744","parameter":"ids","resource_type":"tweet","section":"data","title":"Authorization Error","detail":"Sorry, you are not authorized to see the Tweet with ids: [551659627872415744].","type":"https://api.twitter.com/2/problems/not-authorized-for-resource"}]}]
Is there any other way to get access to the data?
Thanks
Tweets not available through API, some marked as both "racist" and "none"
Hello,
I am currently working on a similar project at university, using your data and paper as a comparison. I have tried to fetch all tweets from NAACL_SRW_2016.csv
but a lot of tweets, in particular the racist ones, have been removed. Is there perhaps an offline version containing the tweets?
Another problem is that some tweets in the file are marked either both sexism and none or racism and none. These are not many, but does cause issues. An example is id 572340476503724032. Do you have any solution for these?
Thank you!
Best,
Filip
Missing Tweets in dataset
Hi, I'm doing a research project and I got only a few thousand tweets using twitter API. I think this is because user deleted that tweets or twitter removed them. So, where can I get the file with the downloaded tweets?
Thanks in Advance
Provide dataset
you could make a copy of the data available because it has errors to recover?
Regarding file NLP+CSS_2016.csv
Hey,
Thanks for providing the datasets!
For the file NLP+CSS_2016.csv, opening it using excel or using any default delimiter seems to put most of all labels from "amateur" participants under the first three columns. Were most of the posts labelled by participants represented by these columns, or is there a specific delimiter I need to use to see which participant labelled which post?
Thanks,
Vijay
Was this training data set labeled manually?
Hi,
I am working on an assignment which involves multiple topics, i.e. racism, profanity, alcohol abuse, etc. Creating positive and negative training data sets for binary classification is very time consuming for each topic. Is there any solution to minimize human intervention to label tweets once creating training data sets?
Full Data
Hello,
Can you please share the full dataset? I need the data for my research where I can compare the deleted tweet.
Labels
Hey, the paper mentions that data will be uploaded in ids and labels, but there are only IDs here? Most actual hate tweets have been taken down now & the IDs left are misclassified more often than not. Do you still have the label versions?
Twitter texts not available through twitter API as they were deleted by twitter
Hi, most of the ids or the tweets were deleted by twitter as they were against their policies. If you have those tweet texts please send or upload them
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.