Coder Social home page Coder Social logo

Comments (5)

aerdem4 avatar aerdem4 commented on June 9, 2024

Thanks for your interest. Pre-trained embeddings would be basically for each Quora question, therefore I don't see a general use case for it. How do you plan to use them? In which problem? On the other hand, current model doesn't need any expensive hardware. It can run on a laptop with Nvidia 950m and 8 GB RAM.

from kaggle-quora-dup.

aerdem4 avatar aerdem4 commented on June 9, 2024

Ah my bad, you didn't mean the sentence embeddings but the model weights. It is indeed possible. Nowadays, my GPU is busy with some Kaggle competitions. I can train a model and add its weights to the repo when I find spare computing time. But if you want to use it for a real life problem, it should be trained without non-nlp features.

from kaggle-quora-dup.

jembishop avatar jembishop commented on June 9, 2024

The application was for a text adventure game where the user could input a question and it could be matched (using a model such as this) to a list of predefined questions. Don't know how well that will work in practice as but this seems like a good starting point, as it is lightweight and therefore should (?) be able to process ~100 question pairs in ~1s.

Yes I should have read the requirements my bad, I can actually handle this on my laptop. I still think it would be valuable to have the weights on the repo though. Thanks for helping me out.

from kaggle-quora-dup.

aerdem4 avatar aerdem4 commented on June 9, 2024

For your use case, non-NLP features (graph features) should be discarded. They were only helping for utilizing Kaggle's sampling bias: https://github.com/aerdem4/kaggle-quora-dup/blob/master/model.py#L149

Btw, I have noticed that my code needs serious refactoring. It was 2 years ago and I forgot to convert it to a reusable ML pipeline just after the competition. Hopefully, when I find time, I can convert it to something that can be used in real life problems directly.

from kaggle-quora-dup.

jembishop avatar jembishop commented on June 9, 2024

Cool, that would be awesome if you get round to that.

from kaggle-quora-dup.

Related Issues (8)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.