Related Issues (20)
- TGI on NVIDIA GH200 (Arm64) HOT 3
- Newer HF Mamba model is not supported HOT 1
- Tokenizer's unset `eos_token_id` causes Galactica model to fail when using grammar HOT 1
- The "/health" is so slow when generating extra-long textγ HOT 1
- AttributeError: 'RWConfig' object has no attribute 'num_ln_in_parallel_attn' HOT 1
- Recent issues building text-generation-server with torch+cu118 HOT 3
- TGI does not support DeepSeekCoderV2-gptq HOT 2
- AttributeError: 'Idefics2ForConditionalGeneration' object has no attribute 'model' HOT 1
- Build Intel CPU optimized image automatically HOT 1
- Add /v1/models API endpoint to be compatible with OpenAI APIs HOT 3
- RuntimeError: "weight lm_head.weight does not exist" When Loading qwen2-0.5B-Instruct HOT 3
- Tools Not Passed in Prompt Leading to Incorrect Function Calls in TGI HOT 6
- Get opentelemetry trace id from request headers instead of creating a new trace HOT 6
- Disable logging of "grammar" parameter HOT 1
- [BUG] Running FP8 quantized model fails on NVIDIA L4 (repack_fp8_for_marlin) HOT 4
- InternLM2.5 support HOT 1
- Llama3.1-8b with LoRa: This model does not support adapter loading.
- fp8 weight load failed IndexError: list index out of range HOT 2
- Tool call performs worse on v2.2.0 as compared to latest HOT 6
- torch.cuda.OutOfMemoryError: CUDA out of memory. Why isn't it handle by the queue system ?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. πππ
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google β€οΈ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from text-generation-inference.