Comments (8)
Yeah I wont merge such large changes
Fair enough. I think at some point you're going to have to clean up the code and it's going to be more difficult the larger and more complicated it becomes. Your choice though.
The best way to understand something is to write your own version.
I actually already did. I wrote a Rust based implementation here: https://github.com/KerfuffleV2/smolrsrwkv
It's CPU only and actually runs about as fast as ChatRWKV in cpu fp32
and cpu fp32i8
mode.
If you think it could help other people in the community, please feel free to link it if you want to. (Also, I'd be happy to receive any constructive criticism about it.)
from chatrwkv.
My code is a horrible mess (too busy to refactor)
You can use RWKV pip package: https://pypi.org/project/rwkv/ to write your own version from scratch
from chatrwkv.
Getting error after converting the model.
from chatrwkv.
Make sure you are using latest rwkv (0.7.2)
pip install rwkv --upgrade
from chatrwkv.
My code is a horrible mess (too busy to refactor)
Just checking. You're still not interested in letting someone else clean it up. Correct?
(Mostly talking about the actual model, not the chat example.)
from chatrwkv.
Just checking. You're still not interested in letting someone else clean it up. Correct?
(Mostly talking about the actual model, not the chat example.)
Please feel free to fork and write a better version :)
I will clean it when I am less busy (now training lots of models everyday)
from chatrwkv.
I'd be willing to do that, but it's something that would be a significant amount of work. So it basically would only be worth putting the time into if there was a pretty good chance of the changes getting accepted.
Obviously it wouldn't be reasonable to expect you to promise to merge a pull request before even seeing it, but there's still a way to find out of something is generally the kind of change you're looking for/would accept. You can see my previous pull #47 for the type of changes I'm talking about.
from chatrwkv.
I'd be willing to do that, but it's something that would be a significant amount of work. So it basically would only be worth putting the time into if there was a pretty good chance of the changes getting accepted.
Yeah I wont merge such large changes, but it will be great for understanding how the chat mode works, if you are interested :) The best way to understand something is to write your own version.
And it will be beneficial for the community because people can look at different implementations.
from chatrwkv.
Related Issues (20)
- IndexError: list index out of range on ubuntu 20.04 when run python v2/chat.py HOT 2
- "LayerNormKernelImpl" not implemented for 'Half'
- cpu运算乱码问题 HOT 1
- Question about the accuracy of smaller checkpoints HOT 2
- 很好的模型,但是 HOT 4
- 求助time-mixing部分,处理数值溢出代码的解释
- Feature Request: an option to use Positional Interpolation to extent CTX length at inference time HOT 2
- Add a support to "stop_words" in PIPELINE
- 开源中文NSFW微调模型
- demo ? HOT 2
- demo true error ? HOT 1
- 'No CUDA GPUs are available' in google colab with V100 GPU and high RAM HOT 2
- huggingface无法访问,模型无法下载 HOT 4
- Prompt for RAG with RWKV-4-World-7B-v1-20230626-ctx4096 HOT 1
- [Feature Request] text2music HOT 2
- RuntimeError: Error building extension 'wkv_cuda_v1' HOT 2
- How to write the RWKV in autogressive style like RNN HOT 2
- NameError: name 'PIPELINE' is not defined HOT 1
- 大哥,乱码了 HOT 1
- 回复总是截断了,如何让回复自然的结束 HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from chatrwkv.