Comments (3)
ok thanks! How much memory is available on the system? Is it possible that it is just running out of RAM?
Also how long is the file that you're trying to autocomplete? Is it hundreds of lines long? Does the system work on a short or new file? There might be an issue with how much context is being fed through via vscode-fauxpilot
from turbopilot.
I believe it is highly unlikely that it was running out of memory. The system ahs 24GB ram and the test was simply ran on single line python file with def hello_world():
from turbopilot.
i'm using MODEL="/models/santacoder-q4_0.bin" and have same issue
[2023-08-21 10:01:54.792] [info] Initializing Starcoder/Wizardcoder type model for 'starcoder' model type
[2023-08-21 10:01:54.792] [info] Attempt to load model from starcoder
load_model: loading model from '/models/santacoder-q4_0.bin'
load_model: n_vocab = 49280
load_model: n_ctx = 2048
load_model: n_embd = 2048
load_model: n_head = 16
load_model: n_layer = 24
load_model: ftype = 2002
load_model: qntvr = 2
load_model: ggml ctx size = 1542.88 MB
load_model: memory size = 768.00 MB, n_mem = 49152
load_model: model size = 774.73 MB
[2023-08-21 10:01:56.353] [info] Loaded model in 1561.73ms
(2023-08-21 10:01:56) [INFO ] Crow/1.0 server is running at http://0.0.0.0:18080 using 2 threads
(2023-08-21 10:01:56) [INFO ] Call `app.loglevel(crow::LogLevel::Warning)` to hide Info level logs.
(2023-08-21 10:03:55) [INFO ] Request: 192.168.0.1:53945 0x7f27b5a5f040 HTTP/1.1 POST /v1/engines/codegen/completions
Segmentation fault (core dumped)
from turbopilot.
Related Issues (20)
- Local build failing to run (NO AVX2) HOT 3
- use WebSocket for Real-time reception
- terminated by signal SIGABRT (Abort)
- Fauxpilot client does not communicate with TurboPilot πserver
- How to use it with cuda in v0.0.5 HOT 2
- Any chance for cuda 12 support? HOT 1
- Is there any roadmap to add support for replit models? HOT 2
- Add support for StableCode
- Support Huggingface Code plugin
- "symbol not found" error in docker image running under ARM64 HOT 3
- How to build for Mac OS Apple Silicon? HOT 2
- ggml_new_tensor_impl: not enough space in the context's memory pool HOT 7
- docker turbopilot:v0.1.0-cuda12 not using gpu HOT 3
- CAN NOT RUN TURBOPILOT USING DOCKER HOT 5
- Only huggingface client works, and crashes server HOT 3
- Support for Code Llama HOT 1
- Docker Image Fail to load model
- Failed to load model wizardcoder (Illegal Instruction)
- [Feature request] Add refact model
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. πππ
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google β€οΈ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from turbopilot.