tariq60 / liar-plus Goto Github PK
View Code? Open in Web Editor NEWLicense: Creative Commons Zero v1.0 Universal
License: Creative Commons Zero v1.0 Universal
Hi, I was looking at your v2 datasets but figured some justifications were not working. For example, id 6890 has a justification that does not make sense, it also did not show up in the original PF article. I am switching to your v1 dataset, but thought it should be good to tell you.
Can you please add a license to the repository to clarify usage?
Thanks!
Hi Tariq.
There's a few bugs in the dataset, such as instances which have malformed lines or missing justifications:
['1280', '1606.json', 'mostly-true', '"Hospitals, doctors, MRIs, surgeries and so forth are more extensively used and far more expensive in this country than they are in many other countries.\'\'', 'health-care', 'mitt-romney', 'Former governor', 'Massachusetts', 'republican', '34', '32', '58', '33', '19', 'a Fox News Sunday interview']
['9874.json', 'barely-true', 'Obamacare cuts seniors Medicare.', 'health-care,medicare', 'ed-gillespie', 'Republican strategist', 'Washington, D.C.', 'republican', '2', '3', '2', '2', '1', 'a campaign email.']
I've had to correct it with the following script:
def load_line(self, line: str) -> JsonDict:
sections = line.split("\t")
if ".json" in sections[0]:
sections.insert(0, None)
if len(sections) < 15:
sections.append(None)
if len(sections) < 16:
sections.append(None)
return {"id":int(sections[0]),
"label": sections[2],
"claim": sections[3],
"topics": sections[4].split(","),
"originator": sections[5],
"title": sections[6],
"party": sections[7],
"justification": sections[15] }
Hopefully this will help other users of the dataset, but could you please release a fixed version if possible.
James Thorne
There are some entries in the training data for which the justification
contains CSS.
I found this while running the search regex "@". I expected to get twitter handles but I was surprised to find CSS media queries.
1319 12435.json mostly-true In 2006, Donald Trump was hoping for a real estate crash. bankruptcy,candidates-biography,debt,economy,history,housing,income,jobs,stimulus,wealth hillary-clinton Presidential candidate New York democrat 40.0 29.0 69.0 76.0 7.0 an advertisement "Trump said, ""I don't know anything about David Duke. ""Trump seems to have forgotten that he knew enough about Duke to denounce him twice this election cycle. In 2000, he knew enough to criticize Dukes racism. And in 1991, he knew enough to say he hated what votes for Duke represented. . factbox { font-family: 'Roboto', 'Helvetica', sans-serif; color: rgba(0, 0, 0, 0. 86); font-weight: 400; margin: 0px; margin-top:40px; padding: 0; background-color:#f5f5f5; padding:32px; padding-bottom:24px; display: block; margin: auto; margin:0 auto; background-color: #ffffff; width:75%; height:auto; max-width:600px; min-width:260px; box-shadow: 0 2px 2px 0 rgba(0,0,0,. 34),0 3px 1px -2px rgba(0,0,0,. 4),0 1px 5px 0 rgba(0,0,0,. 32); position: relative; } . factbox . factbox-header { height: 25px; margin-top: -20px; border-bottom-width: 1px; border-bottom-color: silver; border-bottom-style: solid; margin-bottom: 20px; font-size: 16px; color: rgba(0,0,0,0. 86); } . factbox . name { display: inline-block; float:left; } . factbox . name img { display: inline-block; float:left; padding-bottom: 10px; padding-right: 20px; } . factbox . speaker{ float:left; padding-top: 10px; width:300px; } . factbox . speaker . speaker-name { font-weight: 500; font-size: 20px; padding-bottom: 8px; } . factbox . speaker . speaker-title { color: rgba(0,0,0,0. 54); font-size:14px; } . factbox . rating { padding-left:32px; float: right; width:120px; height:auto; display: inline; max-height: 200px; } . factbox . rating img { width: 100%; padding-bottom:24px; } . factbox h1 { display: inline-block; font-weight: 500; font-size: 20px; line-height: 1. 35em; -webkit-margin-before: . 35em; -webkit-margin-after: 0em; -webkit-margin-start: 0px; -webkit-margin-end: 0px; } . factbox p { margin-top: 80px; width: 100%; display: block; font-size: 16px; line-height: 26px; } . factbox . dateline { font-size: 14px; color: rgba(0,0,0,0. 54); font-weight: 400; margin-top: 8px; } . factbox hr { border-bottom: 1px; border-left: 0px; border-right: 0px; border-top: 0px; border-style:solid; border-color:#e0e0e0; padding-top: 0px; padding-bottom: 0px; } . factbox . links { display:block; padding-top: 20px; } . factbox a:-webkit-any-link { color: #2196f3; text-decoration: none; cursor: pointer; } . factbox . links a { margin-right: 32px; font-size: 14px; text-transform: uppercase; color: #2196f3; text-decoration: none; font-weight: 500; } . factbox . links a:hover { cursor: pointer; } . factbox a :hover { cursor: pointer; text-decoration: none; } . factbox ul { padding-left: 0px; display:block; padding-top: 10px; padding-bottom: 0px; } . factbox li { display: inline; margin-right: 32px; font-size: 14px; text-transform: uppercase; color: #2196f3; text-decoration: none; font-weight: 500; cursor: pointer; } . factbox li a { font-size: 14px; text-transform: uppercase; color: #2196f3; text-decoration: none; font-weight: 500; } . factbox li a:visited { color: #2196f3; } . factbox . menu . share { display: none; margin-top: 32px; } . factbox . menu . share . share-button { display: inline-block; height: 64px; width: 64px; border-radius: 0px; position: relative; } . factbox . menu . share . share-button . share-label{ position: absolute; top: 50%; left: 50%; transform: translate(-50%, -50%); font-size: 18px; color: white; } . factbox . menu . share . share-button . share-label a:-webkit-any-link{ color:white; } . factbox . menu . share . share-button . share-label a{ color:white; } . factbox . menu . share . facebook { background-color: rgb(64,94,145); } . factbox . menu . share . twitter { background-color: rgb(104,172,216); } . factbox . menu . share . google-plus { background-color: rgb(215,84,81); } . factbox . menu . share . embed-button { display: inline-block; height: 64px; width: 275px; background: #ffffff; box-shadow: 0px 0px 2px 0px rgba(0,0,0,. 12), 0px 2px 2px 0px rgba(0,0,0,. 24); border-radius: 2px; font-weight: 500; font-size: 14px; color: #2196f3; letter-spacing: 0. 5px; line-height: 16px; position: relative; margin-left: 24px; } . factbox . menu . share . embed-button . embed-label { position: absolute; top: 50%; left: 50%; transform: translate(-50%, -50%); } . factbox . menu . share . embed-button . embed-label a { text-decoration: none; } . factbox . menu . share . embed-button . embed-label a:visited { color: #2196f3; } . factbox . share . embed_box { display: none; height: 300px; margin-top: 16px; width: 100%; } . factbox . menu li { color: rgba(0, 0, 0, 0. 87); text-transform: none; cursor: default; display: inline; } . factbox . menu li span { margin-right: 0px; } . factbox . menu li img { width: 100%; display: -webkit-flex; display: flex; } . factbox . menu ul. dropit-submenu { padding-top:32px; width: 100%; } . factbox . menu ul. dropit-submenu a { font-size: 14px; line-height: 25px; padding: 0 18px; } . factbox a { } @media screen and (min-width: 560px){ . factbox { width: 600px; } } @media screen and (max-width: 560px) { . factbox . rating { padding-left: 24px; float: right; width: 100px; height: auto; display: inline; } . factbox . name { padding-bottom: 8px; } . factbox p { clear: left; } . factbox . speaker { float: none; width: 100%; } } @media screen and (max-width: 460px) { . factbox . menu . share . embed-button { margin-top: 24px; margin-left: 0px; width: 100%; } . factbox . name { padding-bottom: 20px; } . factbox . speaker { float: none; } . factbox . speaker . speaker-name { font-size: 14px; } } @media screen and (max-width: 364px) { . factbox { padding: 24px; display: block; margin: auto; margin: 0 auto; background-color: #ffffff; width: 75%; height: auto; max-width: 600px; min-width: 260px; box-shadow: 0 2px 2px 0 rgba(0,0,0,. 14),0 3px 1px -2px rgba(0,0,0,. 2),0 1px 5px 0 rgba(0,0,0,. 12); position: relative; } . factbox . rating { padding-left: 24px; float: right; width: 90px; height: auto; display: inline; } . factbox . name { float: left; } . factbox . name img { float: left; } . factbox . speaker . speaker-title { font-size: 9px; } . factbox p { font-size: 16px; line-height: 24px; } } function toggleEmbedBox(e) { var embed_box = e. parentElement. parentElement. parentElement. getElementsByClassName('embed_box')[0]; if(embed_box. style. display == 'block'){ embed_box. style. display = 'none'; } else { var factbox = e; while (!factbox. classList. contains('factbox')) { factbox = factbox. parentElement; } var container = document. createElement(""div""); container. appendChild(factbox. innerHTML = container. innerHTML; embed_box. style. display = 'block'; } } function toggleShareVisibility(e) { var e = e. parentElement. parentElement. getElementsByClassName('share')[0]; if (e. style. display === 'block') { e. style. display = 'none'; } else { e. style. display = 'block'; } } //adopted from function facebookShare(url, picture, winWidth, winHeight) { openShare(' + url + '&picture=' + picture, winWidth, winHeight); } function twitterShare(tweet_id, winWidth, winHeight){ openShare(' + tweet_id, winWidth, winHeight); } function openShare(url, winWidth, winHeight){ var winTop = (screen. height / 2) - (winHeight / 2); var winLeft = (screen. width / 2) - (winWidth / 2); window. open(url, 'sharer', 'top=' + winTop + ',left=' + winLeft + ',toolbar=0,status=0,width=' + winWidth + ',height=' + winHeight); } Share The Facts Donald Trump Presidential candidate ""I don't know anything about David Duke. ""On CNN's State of the Union."
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.