Comments (12)
Hello, @ChinnYu!
Can you provide logs?
Are you using release docker or using source code?
What message you see when clicking on "Run Filter"?
from refact.
Hello, @hazratisulton. I used the source code and the provided Dockerfile to build the image. I noticed that the built image has an error when I press 'run filter.
from refact.
@hazratisulton have you managed to reproduce it?
from refact.
@hazratisulton have you managed to reproduce it?
No, I couldn't. I asked @mitya52 to take a look, maybe he could offer some ideas.
from refact.
HI @hazratisulton @JegernOUTT , I attempted to build the image using the latest version of the source code (12/28) from the 'dev' branch, and it seems that the same issue persists. If there's a specific log for analysis that you need, please let me know, and I'll provide it.
from refact.
After numerous code modifications, I discovered that changing 'aux' to '_aux' allows locating the Python module. But it also brings about two issues. 1. 'Index out of bounds' occurs when pressing 'run filter' and selecting 'codellama.' 2. 'AssertionError: You have to have more files to process than processes' happens at the beginning of Finetune. Indeed, the number of my files exceeds the number of processes. The first one is resolved by changing to transformers==4.34.0, and for the second one, the allocation rules need to be modified.
from refact.
Interesting! But it works in nightly without any changes 🤔 Let's ask what @JegernOUTT and @mitya52 think.
from refact.
Hi, I'd like to ask another question. I'm attempting to integrate 'deepseek-ai/deepseek-coder-33b-base' into the refact. I added the 33B model to these two files: 'refact/known_models_db/refact_known_models/huggingface.py' and 'self_hosting_machinery/finetune/configuration/supported_models.py'. The modification process is similar to 'deepseek-coder/5.7b/mqa-base', and I've also added 'known_models' in 'refact-lsp'. However, Visual Studio Code (vscode) continues to report the following errors. I have checked the main page and confirmed that the model has been successfully initialized. Do the experts have any debugging suggestions for this issue?
Additionally, there is a warning as shown in the second image. What should be configured in this case? Thank you.
from refact.
@ChinnYu awesome that you are trying this! You might need a change in refact-lsp
, just add the model there by analogy like the other models.
There was this idea to try new models using "works like this other known model" in settings. But then it appeared not very practical (the best settings is no settings, because it gets outdated, needs tech support to remove unnecessary settings once it's there and server side changes, etc). Or maybe we could return to this idea, because it allows to try a model quickly without recompiling the lsp.
from refact.
Related Issues (20)
- Support for HTTP proxies
- Add DeepSeek Coder models HOT 4
- Add Code LLaMA HOT 4
- Could not scan repo with Refact/1.6B selected HOT 2
- Refactoring of the finetuning script HOT 1
- [CICL] cache flash_attn
- add check for the minumum number of files for fine-tuning job
- Without DB, web UI should still open, tell what the problem is
- When Using Self-Hosted Server VS Code Plugin Does Not Send Requests HOT 2
- Simple API key for OSS version, so people can expose docker port via reverse proxy HOT 1
- Host Model for Embeddings for RAG
- Finetune failed with "No train files provided"
- GPU Filtering improvement
- Finetune improvement for better performance HOT 2
- Self Hosted Chat Times Out VSCode
- docker image fails to start on mac m3 HOT 8
- Database not starting? HOT 2
- Finetune of deepseek-coder fails HOT 7
- refact refuses to finetune if finds weird bytes in files HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from refact.