Coder Social home page Coder Social logo

Comments (7)

ricardorei avatar ricardorei commented on May 26, 2024 2

Ok, the scores make sense! HTER and DA's have different scales. HTER is a measure that you want to minimize. It reflects the effort required to "correct" the translation output in order to be semantically equivalent to the reference (higher HTER reflects more effort).

DA is a continuous scale of "how good is a translation" (a high DA score means that the translation is good).

Both models are telling you that your MT is not good. For a SOTA MT system, you should expect your HTER score to be close to 0 while the DA score should be between 0.6 and 1

Its all here: https://unbabel.github.io/COMET/html/models.html

from comet.

ricardorei avatar ricardorei commented on May 26, 2024 1

If you want to read more about HTER: Snover et al., 2006

and DA's: Graham et al., 2013

from comet.

ricardorei avatar ricardorei commented on May 26, 2024

This issue label is exactly for this type of questions! I am happy to help

What are the scores exactly?

Sometimes when comparing two systems with similar quality these two models (wmt-large-da-estimator-1719 and wmt-large-hter-estimator) can differ regarding "which model is better". Yet, when scoring a single MT the scores should point into the same direction...

from comet.

ricardorei avatar ricardorei commented on May 26, 2024

You are testing the model with 70k translations? can you compute a Pearson correlation between wmt-large-da-estimator-1719 and wmt-large-hter-estimator scores?

from comet.

george2seven avatar george2seven commented on May 26, 2024

This issue label is exactly for this type of questions! I am happy to help

What are the scores exactly?

Sometimes when comparing two systems with similar quality these two models (wmt-large-da-estimator-1719 and wmt-large-hter-estimator) can differ regarding "which model is better". Yet, when scoring a single MT the scores should point into the same direction...

Please find below the results:

  wmt-large-da-estimator-1719 wmt-large-hter-estimator emnlp-base-da-ranker
Score -0.21418807 0.212977027 0.145221945
Translations Count (same MT) 70544 70544 70544

Thanks for the support!

from comet.

george2seven avatar george2seven commented on May 26, 2024

You are testing the model with 70k translations? can you compute a Pearson correlation between wmt-large-da-estimator-1719 and wmt-large-hter-estimator scores?

Unfortunately we don't have within our team experience with this type of computation but I will ask our engineers to have a look.

from comet.

george2seven avatar george2seven commented on May 26, 2024

Thank you very much Ricardo! Makes sense now.

from comet.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.