vis-nlp / chartqa Goto Github PK

View Code? Open in Web Editor NEW

140.0 140.0 19.0 809.79 MB

License: GNU General Public License v3.0

Python 78.65% Shell 0.19% Jupyter Notebook 21.15%

chartqa's People

Contributors

Stargazers

Watchers

Forkers

yingchaojiefeng xunyuw mohbattharani parsareal srini444 mshetairy zhiqic mithun40 dxlong2000 pengyu965 program-white haorand doctorslimm desdai sungbeomchoi anuf0731 giahy2507 elischwartz

chartqa's Issues

Metadata for question type and image source

Thanks for the great work. Is it possible to provide metadata for question type (data retrieval, visual, compositional, etc) and image source for each data instance?

Reference evaluation code?

Hi @vis-nlp,

Thank you for the useful benchmark and insightful paper !
Could you point me to a reference implementation for the relaxed accuracy computation ?
In particular I am interested in the logic that allows to discriminate between numeric and non-numeric answer types (based on casting behavior ?)

Have a great day !

Yana

Details about regenerating charts visual features

Hi,

Sorry to disturb. I am trying to retrain a chart visual features extractor on other charts datasets. Based on the instruction, it seems we need the train_dict.pickle and val_dict.pickle files which contain the ground-truth mask annotations.

Could you mind sharing your preprocessing codes to obtain these pickle files on ChartQA datasets (or other charts datasets)?

Best

Can you release your extended ChartOCR code?

Hi,
thank you for sharing the code and dataset of your interesting research. I am currently doing some research that will also require extracting table from chart images.
In the paper that, you mentioned you extended the chartOCR method to extract the full table output.
I wonder if you can also share this code which will help my current research a lot.
Thank you for your time to consider this request.

Queries regarding ChartQA dataset

There are a few queries that we have, for which your help is needed.

In section 4.1, you mentioned that gold data tables are not available therefore you set up the extraction mechanism to get the underlying tables. We have downloaded the dataset from (https://drive.google.com/file/d/17-aqtiq_KJ16PIGOp30W0y6OJNax6SVT/view) which has table annotations present. Are these extracted tables referred to as 'Gold Data Tables' in the paper?
If there is a separate set of 'Gold Data Tables' not available from the link mentioned above, can you also share those for reproducibility purposes?
And if the extracted tables are the same as the gold data tables, what are the results implying in Table 5 of the paper? How can TaPas predict answers if the tables itself is not provided?

Training the VisionTapas model

Hi,

Since the checkpoint for VisionTapas is not available, I am trying to train this model from scratch. The code directory does not have any instructions to run the code. It would be great if you can a small readme file with steps to train this model. If you can add the required versions for libraries, that would also be appreciated.

Currently, I am using torch==1.13.0 with CUDA==11.6.2, and transformers==4.24.0. When I run the training script, it seems the fixed_vocab_file is missing. Can you please add how to get that?

I understand you might be short on time, but I would appreciate if you can at least give one of the training commands you used. Currently, my command looks like this:

python train_question_answering.py --train-folder ../data/chartqa/train/ --validation-folder ../data/chartqa/val/ --qa-train-file ../data/chartqa/train/train_augmented.json --qa-val-file ../data/chartqa/val/val_augmented.json --out-dir vision-tapas-model

Hoping for a quick reply.

Thanks,
-- ashim

vis-nlp / chartqa Goto Github PK

chartqa's People

Contributors

Stargazers

Watchers

Forkers

chartqa's Issues

Metadata for question type and image source

Reference evaluation code?

Details about regenerating charts visual features

Can you release your extended ChartOCR code?

Queries regarding ChartQA dataset

Training the VisionTapas model

When will the additional chart images annotations be released?

Questions on the Pre-trained results

License information for the dataset

Requesting for the trained model checkpoints

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent