Coder Social home page Coder Social logo

Use T5 as a highlighter about covidex HOT 6 OPEN

rodrigonogueira4 avatar rodrigonogueira4 commented on September 17, 2024
Use T5 as a highlighter

from covidex.

Comments (6)

daemon avatar daemon commented on September 17, 2024

Writing down some notes:

  • Eyeballing the T5-based highlighter as implemented seems to yield worse results (e.g., more super random highlightings of completely unrelated material, such as extraneous CC-BY attribution text).
  • We tried dynamic query representation as well as fixed query representation, i.e., f'Query: {query} Document: {document} Relevant:' vs '{query}' and '{document}'.
  • If we want to take advantage of caching the reranker, the T5-based highlighter is limited to a maximum sequence length of 256, whereas BioBERT to 512.
  • The T5-based highlighter is 0-25% faster with reranker caching and 10-20% slower without.

from covidex.

rodrigonogueira4 avatar rodrigonogueira4 commented on September 17, 2024

I think the main problem is that we are using 256 tokens for the reranker. Could you please try increasing to 512 tokens? There might be only a small increase in latency because we were underutilizing the GPU when feeding it with 256 tokens..

Also, since we will then have a spare GPU, we can use it to cut the inference time by half (but that we can leave for another PR)

from covidex.

daemon avatar daemon commented on September 17, 2024

Sure, but the results won't be the same as the TensorFlow implementation. Is that okay?

from covidex.

daemon avatar daemon commented on September 17, 2024

I guess I can evaluate it on R04.

from covidex.

rodrigonogueira4 avatar rodrigonogueira4 commented on September 17, 2024

Yeah, evaluating on R04 is an even better idea

from covidex.

santhoshkolloju avatar santhoshkolloju commented on September 17, 2024

can you throw some light on what do u mean highlighting...
can i get link to this BioBERT's highlighter?

from covidex.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.