Coder Social home page Coder Social logo

seukgcode / melbench Goto Github PK

View Code? Open in Web Editor NEW
77.0 4.0 4.0 9 MB

Multimodal entity linking (MEL) aims to utilize multimodal information to map mentions to corresponding entities defined in knowledge bases. We release three MEL datasets: Weibo-MEL, Wikidata-MEL and Richpedia-MEL, containing 25,602, 18,880 and 17,806 samples from social media, encyclopedia and multimodal knowledge graphs respectively. A MEL dataset construction approach is proposed, including five stages: multimodal information extraction, mention extraction, entity extraction, triple construction and dataset construction. Experiment results demonstrate the usability of the datasets and the distinguishability between baseline models.

License: MIT License

multimodal entity-linking knowledge-graph

melbench's Introduction

Multimodal Entity Linking Datasets Benchmark

1. Abstract

Multimodal entity linking (MEL) aims to utilize multimodal information to map mentions to corresponding entities defined in knowledge bases. We release three MEL datasets: Weibo-MEL, Wikidata-MEL and Richpedia-MEL, containing 25,602, 18,880 and 17,806 samples from social media, encyclopedia and multimodal knowledge graphs respectively. A MEL dataset construction approach is proposed, including five stages: multimodal information extraction, mention extraction, entity extraction, triple construction and dataset construction. Experiment results demonstrate the usability of the datasets and the distinguishability between baseline models.

2. Introduction

Multimodal entity linking (MEL) aims to utilize multimodal information to map mentions to corresponding entities defined in knowledge bases. We release three MEL datasets: Weibo-MEL, Wikidata-MEL and Richpedia-MEL, containing 25,602, 18,880 and 17,806 samples from social media, encyclopedia and multimodal knowledge graphs respectively. A MEL dataset construction approach is proposed, including five stages: multimodal information extraction, mention extraction, entity extraction, triple construction and dataset construction. Experiment results demonstrate the usability of the datasets and the distinguishability between baseline models.

To construct large-scale MEL datasets, we propose a MEL dataset construction approach, including five stages. In Multimodal Information Extraction, we select multimodal data sources and extract textual and visual information. In Mention Extraction, we extract mentions from textual information and keep the mentions which corresponding entities may exist. In Entity Extraction, we query the knowledge bases with the filtered mentions, gather the entity lists, and save the correct entities. In Triple Construction, we merge the corresponding mentions and entities into mention-entity (M-E) pairs, and combine them into triples with textual and visual information. Then, we keep the correct triples as the samples of the MEL dataset. Finally, in Dataset Construction stage, we partition the dataset into training set (70%), validation set (10%) and testing set (20%). The overview of the approach is illustrated in figure below.

image-20210704105704949

3. Visual Information and Knowledge Graph Resources

Due to the large size of the visual information and knowledge graph resources, we deposited these resources in the Baidu Cloud Disk.

Visual information download addresses:

Knowledge graph download addresses:

The extraction codes are 2021.

4. Samples of the MEL datasets

4.1 Weibo-MEL dataset

Visual information:

  • image-20210703210612299

Textual information:

  • 【感谢30年的坚守!#辽宁舰首位一级军士长退役# 】近日,辽宁舰为首位一级军士长刘德波举行了隆重的退役仪式。刘德波,1990年12月入伍,服役期间,荣获全军士官优秀人才奖二等奖1次,荣立二等功1次,三等功4次。入伍30年,他坚守在不见阳光的深舱,始终与高温和热浪相伴,穿梭于管路和设备之间,把最美好的青春年华都献给了心爱的战舰。致敬!(北海舰队)#我为**军人点赞# [组图共9张] 原图

Mention-entity pairs:

  • "刘德波": "刘德波"
  • "**": "**(世界四大文明古国之一)"
  • "辽宁舰": "**人民解放军海军辽宁舰"

4.2 Wikidata-MEL dataset

Visual information:

  • image-20210703212658556

Textual information:

  • Seattle Mayor Charles L. Smith (left) with Will Rogers, circa 1935.

Mention-entity pairs:

  • "Charles L. Smith": "Charles L. Smith (Seattle politician)"
  • "Will Rogers": "Will Rogers"

4.3 Richpedia-MEL dataset

Visual information:

  • image-20210703212710693

Textual information:

  • Washington resigned his commission after the Treaty of Paris in 1783.

Mention-entity pairs:

  • "Washington": "George Washington"

melbench's People

Contributors

bluestragglers avatar npubird avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

melbench's Issues

Where is the password of Baidu Cloud Disk

I don't find the password of Baidu Cloud Disk, could you add it in the README.md ?
another issue is that whether the database of Weibo-MEL is Wikidata Knowledge Graph

baseline 分数

Hi,校友们~

我最近在尝试用你们提供的mel数据做实验,非常好的工作!
想请问,这几个数据集你们有baseline的方案和F1可以report吗?我想知道自己做到什么程度了。

Thanks

train/valid/test split

Hello,

Thank you for releasing this well-formatted multimodal entity linking dataset! I'm wondering when will the training/valid/test splits be shared?

数据集图片

你好,我关注到您的相关数据集图片部分发布在百度网盘中,但并无提取码,请问怎么获取?

数据集实体标注疑问

您好!感谢您提供的数据集。
我发现在句子中提及的entity并没有被全部标注出来。例如readme中例子中的“北海舰队”和“Seattle”,都是entity。但是没有标注。请问这是什么原因呢?
谢谢~

RichpediaMEL里每条数据的图片怎么找?

您好,我是汪老师的学生。RichpediaMEL里每条数据并未给出实体对应的图片名或者路径,从网盘下载图片也未能从图片名中看出端倪,请问实体与图片之间是怎么对应的?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.