Coder Social home page Coder Social logo

Use with Llama-2-70b-hf? about alpaca_farm HOT 4 CLOSED

mensch72 avatar mensch72 commented on August 17, 2024
Use with Llama-2-70b-hf?

from alpaca_farm.

Comments (4)

lxuechen avatar lxuechen commented on August 17, 2024

Hi, thanks for reporting this.

Size mismatch would definitely be one big issue. Since you're trying to run the recovery script, which is essentially running weights addition under the hood, we expect the base checkpoint to be exactly the original 7b checkpoint in hugging face format.

from alpaca_farm.

mensch72 avatar mensch72 commented on August 17, 2024

OK, then I need to find a way to obtain Llama-1 rather than Llama-2. Any hints are welcome.

from alpaca_farm.

lxuechen avatar lxuechen commented on August 17, 2024

I think there are third-party weights on spaces if you search for llama there. Let me know how it goes!

from alpaca_farm.

mensch72 avatar mensch72 commented on August 17, 2024

I tried the llama-7b-hf from https://huggingface.co/decapoda-research/llama-7b-hf which looks like it is a copy of the original. Still, I get the exact same error as in my first comment above.
Can you point me to any version of llama that should work, so that I can rule out that it is a problem with the model?

from alpaca_farm.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.