Coder Social home page Coder Social logo

huggingface-to-tfhub's Issues

Convert MiniLM models

  • L6xH768 MiniLMv2 distilled from RoBERTa-Large
  • L12xH384 MiniLMv2 distilled from RoBERTa-Large
  • L6xH384 MiniLMv2 distilled from RoBERTa-Large
  • L12xH384 MiniLMv2 distilled from XLMR-Large
  • L6xH384 MiniLMv2 distilled from XLMR-Large

https://github.com/microsoft/unilm/tree/master/minilm

-> to use in personal project (multilingual L6xH384 + "Load what you need")

Allow bert_pack_inputs to have a configurable seq_length

Hey there! First off I'd like to thank you on all the work you've done to convert these HF models into TF Hub models, we've found your work really useful within my organization.

We were trying to use the DistilBERT uncased model, and wanted to set a different seq_length for bert_pack_inputs. We noticed on the official tensorflow site, it's conventional to support an optional seq_length for the preprocessor's bert_pack_inputs implementation. Indeed in your underlying implementation, there is the option, but it's not exposed in the call function, which only allows the default length of 128.

For now, we have copied our own version of DistilBertPackInputs, but it would be nice to just use the TF Hub preprocessor as-is.

I was wondering if there was a reason you avoided including it - I'd be happy to open a PR to add it, but I don't have a good means of testing to make sure that the change is backwards compatible with anything else you may be using it for.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.