franciellevargas Goto Github PK
Name: Francielle Vargas
Type: User
Bio: Ph.D. Candidate in Computer Science - Natural Language Processing
Location: ICMC - University of São Paulo
Name: Francielle Vargas
Type: User
Bio: Ph.D. Candidate in Computer Science - Natural Language Processing
Location: ICMC - University of São Paulo
Multilingual discourse-annotated dataset for fake news detection
1st International Workshop on Deceptive AI @ECAI2020
Portuguese discourse markers
This is a subjective lexicon of baseline emotion (alegria, desgosto, medo, negativo, neutro, positivo, raiva, supresa e tristeza) em Português.
FactNews is the first dataset to predict sentence-level factuality of news reporting. Furthemore, we provide baseline results for sentence-level factuality and media bias predicition in Portuguese. The FactNews is composed of 6,191 annotated sentences by factuality and media bias definitions by AllSides.
Explainable Fact-Checking and News Credibility Verification System for Portuguese
Um dataset com 92 reviews falsas e 92 reviews verdadeiras sobre livros em Português
The Spanish Fake News Corpus contains a collection of 971 news divided into 491 real news and 480 fake news. The corpus covers news from 9 different topics: Science, Sport, Economy, Education, Entertainment, Politics, Health, Security, and Society
This is a dataset for fake news detection research
Repository for the Georgetown University Multilayer Corpus (GUM)
HateBR is the first large-scale expert annotated dataset of Brazilian Instagram comments for hate speech and offensive language detection on the web and social media.
Catalog of abusive language data (PLoS 2020)
HausaHate is a benchmark dataset for Hausa hate speech detection task. it was extracted from West African Facebook pages and comprises 2,000 comments annotated according to a binary class (offensive and non-offensive) and hate speech targets (race, gender and none).
A multilingual lexicon of words to hurt.
A multilayer perceptron (MLP) manually developed.
Multilingual Offensive Lexicon consists of the first contextual lexicon for abusive language detection, which is composed of 1,000 explicit and implicit terms and expressions with any pejorative connotation annotated with contextual information
Crawler for Portuguese online news
A Brazilian Portuguese Text Offensiveness Analysis System
An automatic opinion implicit and explicit aspect identification and clustering tool for aspect-based opinion mining / sentiment analysis applications. Opcluster-PT also allows the customization for other languages, being minimally necessary a lexical language resource as such as WordNet, deverbal, foreign, diminutive and enhancing Lexicon. OpCluster-PT is currently customized for Brazilian Portuguese.
Automatic identification and clustering of opinion explicit and implicit aspects in product reviews for Portuguese language
Fine-grained opinion identification and polarity classification of Covid-19 tweets in Portuguese.
Ontologies of aspects- groups of (hierarchically organized) explicit and implicit opinion aspects for supporting opinion mining and text summarization tasks, including the domains of smartphones, digital cameras and books, in OWL format.
The SentiAspect-pt comprises 180 product reviews annotated according to implicit and explicit fine-grained opinions, which were hierarchically organized for aspect-based sentiment analysis and opinion summarization applications.
SSA is a post-hoc explanation method by stereotypes and counter-stereotypes to assess social bias in hate speech classifiers
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.