Comments (4)
if you are using "ngl" with value ranging from 0-3x and using a distribution of binary with cuda , it's using gpu, you can tell from the log also when model is loaded up
from nitro.
There seems to be a problem with the install script since version 0.2.5
where the incorrect download URL is used. There are now 2 different cuda
releases:
But the download URL leaves off the -11-7
or -12-0
and only appends -cuda
. We were able to navigate around this but updating the install.sh
manually to include the -12-0
postfix. Either the script needs to be updated to allow for more granular selection or one of the created cuda
releases needs to be assigned as a default for the script to work.
from nitro.
i'd recommend not use install script since it's not recently updated, using pure binary file will give you the best experience
from nitro.
@hiento09 can also have a look at the install script
from nitro.
Related Issues (20)
- feat: Migration from coupled server to engine - Python Sub System HOT 1
- feat: Migration from coupled server to engine - llamacpp HOT 1
- feat: Migration from coupled server to engine - trt-llm
- feat: Cortex Backend Auth
- feat: Allow transforming sse chunk response
- feat: Support for multilingual embedding models and simultaneous hosting of LLM & embedding models HOT 1
- feat: /eingines - model engines listing endpoint
- bug: I'm trying to get JAN on avx with ERROR AVX2. HOT 3
- feat: integrate python-runtime as an engine
- bug: Importing Nitro/Cortex OpenAPI Specification (YAML) Files Fails in Postman, While OpenAPI Specification Files Import Successfully
- feat: Multible Documents(PDF) for Embedding HOT 1
- bug: Cannot able to build in windows
- feat: cortex-cpp: endpoint to returns all models
- Quickstart setup step 4 (not working) HOT 1
- Discussion: cortex benchmark cli HOT 3
- feat: Corββtex Storage has File System Support with YAML
- epic: Cortex supports preset configuration list
- bug: Duplicate BOS Token in Hugging Face Chat Templates
- model id required
- feat: cortex supports onnx engine
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. πππ
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google β€οΈ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nitro.