Coder Social home page Coder Social logo

Finetune Problem about refact HOT 12 OPEN

ChinnYu avatar ChinnYu commented on May 30, 2024
Finetune Problem

from refact.

Comments (12)

hazratisulton avatar hazratisulton commented on May 30, 2024

Hello, @ChinnYu!
Can you provide logs?
Are you using release docker or using source code?
What message you see when clicking on "Run Filter"?

from refact.

ChinnYu avatar ChinnYu commented on May 30, 2024

Hello, @hazratisulton. I used the source code and the provided Dockerfile to build the image. I noticed that the built image has an error when I press 'run filter.
image

from refact.

JegernOUTT avatar JegernOUTT commented on May 30, 2024

@hazratisulton have you managed to reproduce it?

from refact.

hazratisulton avatar hazratisulton commented on May 30, 2024

@hazratisulton have you managed to reproduce it?

No, I couldn't. I asked @mitya52 to take a look, maybe he could offer some ideas.

from refact.

ChinnYu avatar ChinnYu commented on May 30, 2024

HI @hazratisulton @JegernOUTT , I attempted to build the image using the latest version of the source code (12/28) from the 'dev' branch, and it seems that the same issue persists. If there's a specific log for analysis that you need, please let me know, and I'll provide it.

image

from refact.

ChinnYu avatar ChinnYu commented on May 30, 2024

After numerous code modifications, I discovered that changing 'aux' to '_aux' allows locating the Python module. But it also brings about two issues. 1. 'Index out of bounds' occurs when pressing 'run filter' and selecting 'codellama.' 2. 'AssertionError: You have to have more files to process than processes' happens at the beginning of Finetune. Indeed, the number of my files exceeds the number of processes. The first one is resolved by changing to transformers==4.34.0, and for the second one, the allocation rules need to be modified.

image

image

from refact.

olegklimov avatar olegklimov commented on May 30, 2024

Interesting! But it works in nightly without any changes 🤔 Let's ask what @JegernOUTT and @mitya52 think.

from refact.

ChinnYu avatar ChinnYu commented on May 30, 2024

Hi, I'd like to ask another question. I'm attempting to integrate 'deepseek-ai/deepseek-coder-33b-base' into the refact. I added the 33B model to these two files: 'refact/known_models_db/refact_known_models/huggingface.py' and 'self_hosting_machinery/finetune/configuration/supported_models.py'. The modification process is similar to 'deepseek-coder/5.7b/mqa-base', and I've also added 'known_models' in 'refact-lsp'. However, Visual Studio Code (vscode) continues to report the following errors. I have checked the main page and confirmed that the model has been successfully initialized. Do the experts have any debugging suggestions for this issue?
image

Additionally, there is a warning as shown in the second image. What should be configured in this case? Thank you.

image

from refact.

olegklimov avatar olegklimov commented on May 30, 2024

@ChinnYu awesome that you are trying this! You might need a change in refact-lsp, just add the model there by analogy like the other models.

There was this idea to try new models using "works like this other known model" in settings. But then it appeared not very practical (the best settings is no settings, because it gets outdated, needs tech support to remove unnecessary settings once it's there and server side changes, etc). Or maybe we could return to this idea, because it allows to try a model quickly without recompiling the lsp.

from refact.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.