askplatypus / wikidata-simplequestions Goto Github PK
View Code? Open in Web Editor NEWMapping of the SimpleQuestions dataset to Wikidata
License: Other
Mapping of the SimpleQuestions dataset to Wikidata
License: Other
The size of 'answerable.py' is 0 Bytes. The content is missing.
After I did not use the get_reverse() method ,the result was the property mappings with 323 matches.So how can I do to get the other mappings?Thank you!
I have three questions:
how do you construct this mid_to_qid.tsv? is there any propery that can link freebase and wikidata?
what is the use of https://www.wikidata.org/wiki/Wikidata:WikiProject_Freebase/Mapping, it seems there are only thousands of mapping from freebase to wikidata
is there any way to convert a wikidata property to DBpedia?
I would really appreciate any help in this regard. Thanks!
Dear authors,
Thank you very much for your work.
Do you have a version of the data with explicitly the string label of the entity and property?
Like this:
Alex Golfis \t place of birth \t Athens \t what city was alex golfis born in
Instead of this:
Q16330302 \t P19 \t Q1524 \t what city was alex golfis born in
Thank you for your attention.
The files ending with "_answerable" contain only triples that are also in Wikidata.
In the first few lines in annotated_wd_data_test_answerable.txt
, there are several issues:
m/01htzx
(Action) is mapped to Q11272426
(some church in the Ukraine)So, in these first ten lines, there are three or four correct entries, five which are not answerable and one where the mapping is incorrect.
Scaling that up would mean that I can trust about 40% of all the 'answerable' examples. That's not a lot and makes the dataset unusable in my opinion.
This work is very interesting and helpful.
I have a question is that what's files ends with 'answerable' means?
The files ending with "_full" contain only triples that are also in Wikidata.
And there is no file ending with "_full".
Thanks again, and merry Christmas!
Hi,
thanks for this data!
I noticed that annotated_wd_data_test_answerable.txt
contains 5621 questions, however qald-format/annotated_wd_data_test.json
contains 5721 (jq ".questions[].query.answers" annotated_wd_data_test.json | grep entity
) Does the qald-format contain the same data as the *_anwserable.txt
files ? Further, the qald data contains multiple answers to the questions (if applicable) but in the *_anwserable.txt
files there is always exactly one question (and not always the same as in the qald-format files, e.g what is the film genre for snow falling on cedars? has as answers the entities
Q1054574,Q1257444, Q130232 and Q3072039 in qald-format/annotated_wd_data_test.json
(and in wikidata.org) but Q1257444 in annotated_wd_data_test.txt
(possibly old data from freebase ?)).
Concerning the qald-format
directory: What is the difference between annotated_wd_data_*_full.json
and annotated_wd_data_*.json
(for instance annotated_wd_data_train_full.json
is much much large as annotated_wd_data_train.json
, for valid and test it is the opposite.
It seems that the queries in the 'qald-format' directory point to the answer entities instead of the question entities in some cases. Eg., in the first example the query contains the link for Saving Shiloh rather than for Warner Bros.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.