Comments (3)
I'd say being able to infer beyond eot is a feature some might want, even if it's just to run some experiment to see what would happen. But I'm OK with making the API harder to misuse, as long as it's still possible to request inference for a new token after EOT.
Returning some sort of "EOT" error in infer_next_token
sounds like it might be what we want?
I wonder if A[EOT]B infers differently to AB...
Yes, for sure. EOT is an important token and the transformer will interpret both strings completely differently 👍
from llm.
Yeah, that's what I was thinking. It wouldn't allow your use-case, though: if you returned an EOT error, you wouldn't be able to get the inferred token back.
from llm.
Discussed this on the Discord - it makes much more sense to return the EOT token itself as an error, and then let users continue inferring if they want.
from llm.
Related Issues (20)
- AMD ROCm support with HIPBLAS HOT 2
- WizardCoder llama assert failure HOT 3
- NaN logits on LLaMA 65B when using 2k+ token contexts
- Default String for ConfiguredSamplers HOT 1
- SIGTRAP triggered on MacOS HOT 2
- Medusa Speculative Decoding HOT 1
- How do I use Huggingface tokenization to use a model on Huggingace in MODEL_PATH instead of my local machine? HOT 1
- Clarify MSRV policy HOT 2
- How to disable ggml logging? HOT 1
- Support for Mistral-7b HOT 5
- Disable tokenizers-remote support for the library by default HOT 1
- Reduce dependencies
- Why is the feed_prompt process so slow? HOT 5
- Support Separate Loading of Vocabulary or Tensors
- EOS is not read from gguf format HOT 1
- Behavior when missing quantization version HOT 1
- Build fails: error: no such file or directory: 'ggml/src/ggml.c'
- When using tokio and HuggingFaceRemote it breaks dropping the runtime HOT 1
- Currently in dev any inference is broken HOT 2
- Sub reddit is down HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from llm.