Coder Social home page Coder Social logo

Comments (7)

jeffra avatar jeffra commented on May 4, 2024

This feature is definitely on our radar, I don’t have a concrete timeline though. But we’ll be sure to update this thread when we have more info.

from deepspeed.

samyam avatar samyam commented on May 4, 2024

@Michiel29 as Jeff mentioned, this is something we definitely want, but currently its not on the top of our priority list at the moment. We be more than happy to accept contributions on this from the community. If you would like to contribute, we can definitely share our discussions on what would be needed to support Lamb with ZeRO. Please let us know.

from deepspeed.

LiweiPeng avatar LiweiPeng commented on May 4, 2024

@jeffra I'd like to use lamb with zero too. Can you share updates on using lamb + zero?

from deepspeed.

e-sha avatar e-sha commented on May 4, 2024

@Michiel29 as Jeff mentioned, this is something we definitely want, but currently its not on the top of our priority list at the moment. We be more than happy to accept contributions on this from the community. If you would like to contribute, we can definitely share our discussions on what would be needed to support Lamb with ZeRO. Please let us know.

Why Lamb is not in a list of supported optimizers?
Please share your ideas of what would be needed to support Lamb with ZeRO.

from deepspeed.

e-sha avatar e-sha commented on May 4, 2024

I realized that we have to add two updates to use LAMB with ZeRO:

  1. Split model parameters preserving whole tensors on a single node (here)
  2. Preserve splitting of a parameter vector into tensors at optimizer initialization stage (here)

Am I right?

from deepspeed.

gongjingcs avatar gongjingcs commented on May 4, 2024

I realized that we have to add two updates to use LAMB with ZeRO:

  1. Split model parameters preserving whole tensors on a single node (here)
  2. Preserve splitting of a parameter vector into tensors at optimizer initialization stage (here)

Am I right?

Hi,I am not very familiar with lamb, can you explain in detail your thoughts on lamb combined with zero ?thanks alot

from deepspeed.

gongjingcs avatar gongjingcs commented on May 4, 2024

@Michiel29 as Jeff mentioned, this is something we definitely want, but currently its not on the top of our priority list at the moment. We be more than happy to accept contributions on this from the community. If you would like to contribute, we can definitely share our discussions on what would be needed to support Lamb with ZeRO. Please let us know.

Hi, I'd like to use lamb with zero stage1. could you share your ideas of what would be needed to support Lamb with ZeRO.
Do we need to modify the implementation of fused lamb or modify the implementation of zero ?
If I know the detailed idea, I can contribute the code. Looking forward to your reply

from deepspeed.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.