❓ Questions and Help What is your question? Is

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

[QUESTION] Is comet download still a supported command? about comet HOT 8 CLOSED

unbabel commented on May 23, 2024

[QUESTION] Is comet download still a supported command?

from comet.

Comments (8)

ricardorei commented on May 23, 2024 2

Hi @chanberg

This is the link for 2020 DA's:

wget https://unbabel-experimental-data-sets.s3.eu-west-1.amazonaws.com/wmt/2020-da.csv.tar.gz

2020 DA Relative-Ranks:

wget https://unbabel-experimental-data-sets.s3.eu-west-1.amazonaws.com/wmt/2020-daRR.csv.tar.gz

And for the MQM data you have it here but I'll try to upload the exact files we used after splitting the data and creating the z-scores. I'll try to do that later today or tomorrow..

from comet.

ricardorei commented on May 23, 2024 2

@chanberg I also prepared the WMT20 MQM annotated data.

The entire dataset with MQM sentence scores and the corresponding z-score:

wget https://unbabel-experimental-data-sets.s3.eu-west-1.amazonaws.com/wmt/2020-MQM.csv.tar.gz

The train split we used::

wget https://unbabel-experimental-data-sets.s3.eu-west-1.amazonaws.com/wmt/2020-MQM.train.csv.tar.gz

The corresponding test split:

wget https://unbabel-experimental-data-sets.s3.eu-west-1.amazonaws.com/wmt/2020-MQM.test.csv.tar.gz

from comet.

ricardorei commented on May 23, 2024 2

Don't forget to cite Markus paper if you use this MQM data from 2020:

@article{50397,
title	= {Experts, Errors, and Context: A Large-Scale Study of Human Evaluation for Machine Translation},
author	= {Markus Freitag and George Foster and David Grangier and Viresh Ratnakar and Qijun Tan and Wolfgang Macherey},
year	= {2021},
URL	= {https://direct.mit.edu/tacl/article/doi/10.1162/tacl_a_00437/108866/Experts-Errors-and-Context-A-Large-Scale-Study-of},
journal	= {Transactions of the Association for Computational Linguistics},
pages	= {1460-1474},
volume	= {9}
}

and the WMT Metrics/News Translation tasks if you use the direct assessments!

from comet.

ricardorei commented on May 23, 2024

The download command it's not supported anymore. I'll add a readme with download links for data.

I have to do that for this year's shared task models also.

Meanwhile, you can use the links from the previous version:

Apequest:

wget https://unbabel-experimental-data-sets.s3-eu-west-1.amazonaws.com/comet/hter/apequest.zip

QT21:

wget https://unbabel-experimental-data-sets.s3-eu-west-1.amazonaws.com/comet/hter/qt21.zip

WMT 17-> 19:
This includes relative ranks and DA scores.

wget https://unbabel-experimental-data-sets.s3-eu-west-1.amazonaws.com/comet/da/wmt-metrics.zip

from comet.

isabelcachola commented on May 23, 2024

@ricardorei This is exactly what I needed. Thank you!

from comet.

chanberg commented on May 23, 2024

Hi @ricardorei,

Are there already any similar download links for the 2021 shared task?
Thank you!

Cheers,
Chantal

from comet.

chanberg commented on May 23, 2024

@ricardorei thank you so much!

from comet.

chanberg commented on May 23, 2024

@ricardorei this is great! thank you so much!

from comet.

[QUESTION] Is comet download still a supported command? about comet HOT 8 CLOSED

Comments (8)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent