Comments (2)
Have you tried Continue.dev, an extension for VSCode? You can load the model with llama.cpp and add Deepseek as the model to use.
Here's my config (in the continue.dev text box, type /config to access) Got it from YearZero on TheBloke's discord.
{
"models": [
{
"title": "CodeLlama-34b-Instruct",
"provider": "llama.cpp",
"model": "codellama-34b",
"api_base": "http://localhost:8080"
},
{
"title": "CodeLlama-7b-Instruct",
"provider": "llama.cpp",
"model": "codellama-7b",
"api_base": "http://localhost:8080",
"system_message": ""
},
{
"title": "DeepSeek",
"provider": "llama.cpp",
"model": "deepseek-33b",
"api_base": "http://localhost:8080"
}
],
"model_roles": {
"default": "DeepSeek",
"chat": "DeepSeek",
"edit": "DeepSeek",
"summarize": "DeepSeek"
},
"system_message": "",
"slash_commands": [
{
"name": "edit",
"description": "Edit highlighted code",
"step": "EditHighlightedCodeStep"
},
{
"name": "config",
"description": "Customize Continue",
"step": "OpenConfigStep"
},
{
"name": "comment",
"description": "Write comments for the highlighted code",
"step": "CommentCodeStep"
},
{
"name": "clear",
"description": "Clear step history",
"step": "ClearHistoryStep"
},
{
"name": "share",
"description": "Download and share this session",
"step": "ShareSessionStep"
},
{
"name": "cmd",
"description": "Generate a shell command",
"step": "GenerateShellCommandStep"
}
],
"custom_commands": [
{
"name": "test",
"prompt": "Write a comprehensive set of unit tests for the selected code. It should setup, run tests that check for correctness including important edge cases, and teardown. Ensure that the tests are complete and sophisticated. Give the tests just as chat output, don't edit any file.",
"description": "Write unit tests for highlighted code"
}
],
"context_providers": [
{
"name": "diff"
},
{
"name": "url"
},
{
"name": "terminal"
}
]
}
from deepseek-coder.
Oh, looks like this is a dup of #6 where code shell was recommended.
from deepseek-coder.
Related Issues (20)
- Code to generate data HOT 1
- Pretraining code HOT 2
- 模型推理完成后怎么一直占用显存呢? HOT 1
- Catastrophic forgetting problem HOT 2
- chat completion任务时输出大量<|EOT|> token HOT 3
- Trying to finetune DeepSeek-Coder on custom Dataset HOT 13
- 33B AWQ量化+vLLM部署问题
- 如何构建微调的CoT数据 HOT 1
- 官方提供的微调训练脚本是否支持33B模型训练?(及训练相关问题) HOT 1
- Leetcode数据集的构建脚本请问可以开源吗
- 33B inference too slowly HOT 1
- Fail to fine-tune V1.5 model with custom llama script HOT 1
- How can I do continue pretraining? HOT 1
- Are NTP and FIM 2 separate stages of training, or are they combined? HOT 4
- clarification on the sentinel token format
- 使用react调用接口错误
- Does DeepSeek-Coder have wasm related knowledge? HOT 1
- Why generate "GGGGG...." ,when the input string is longer than a certain length in GGUF model? HOT 1
- What is the base context length of the model before extension to 16k? HOT 1
- 请问支持function call吗?支持在RAG中实现inline citations吗?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from deepseek-coder.