Coder Social home page Coder Social logo

konstantinschulz / covid19-fact-checking Goto Github PK

View Code? Open in Web Editor NEW

This project forked from elip06/covid19-fact-checking

0.0 0.0 0.0 15.8 MB

A fact-checking system of short to medium-sized documents on the topic of COVID-19

Jupyter Notebook 57.88% Python 24.49% JavaScript 0.91% HTML 0.85% Vue 12.59% TypeScript 0.95% Shell 0.43% Dockerfile 1.88%

covid19-fact-checking's Introduction

A two-step fact-checking procedure for medium-sized text documents in English language on the topic of Covid-19

Particularly during a time of a global pandemic, it is crucial to find efficient ways of determining the credibility of information. From fake news to conspiracy theories, it is hard to fight the “infodemic” which eventually makes the pandemic even more dangerous. Currently, fact-checking websites such as Snopes, FactCheck.org, etc. perform manual claim validation from articles, speeches, or even social media posts from known figures, etc. Of course, they cannot cover all the dubious claims that can be found on the internet, as they focus mainly on the ones that "go viral". For the general user, however, it is impossible to fact check every single statement or sentence on a specific topic he finds on the internet. A lot of research has been invested in both claim verification and fact-check-worthiness, but there is no work done yet on the detection and extraction of dubious claims, combined with their fact checking using external information sources, such as knowledge graphs and knowledge bases, especially on the COVID-19 domain. The solution to this problem is a two-step claim verification procedure, consisting of sequence classification using Transformer models and fact checking using the Google Fact Check Tools. The goal of this work is to develop a high-performance component for fact checking of small- to medium-sized documents in English language on the topic of COVID-19.
We curate a dataset from existing COVID-19 related datasets and perform multiple preprocessing steps to ensure all the data is uniform. The data pre-processing procedure and the final datasets can be found here.
The training code for 4 Transformer models and a simple LSTM (as a baseline) is available here.
For a quick tutorial on how to start and use the fact-checking app, go to the Streamlit app directory.

covid19-fact-checking's People

Contributors

elip06 avatar konstantinschulz avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.