Comments (3)
we haven't the computed parameters, please refer to the original implementation
from fastedit.
We didn't add this constant because it needs to be computed for each model on the wiki data, reducing the convenience of the library. We also found that removing it does not significantly affect the effectiveness of the Rome algorithm. Nevertheless, adding a hyperparameter to provide the value of this constant should be a better choice.
from fastedit.
I see. For now, is there any ready-made constant? If not, I think I may check the original Rome code to compute them by myself.
from fastedit.
Related Issues (20)
- Would you consider supporting ChatGLM2-6B? HOT 5
- 编辑完的baichuan-13b该如何保存 HOT 3
- A little mistake in HyperParams HOT 1
- 使用在线量化的baichuan 13b chat 报错 LookupError: model.layers.5.mlp.down_proj.weight HOT 3
- 这种编辑的方式有副作用吗?比如模型遗忘问题 HOT 2
- 单张80G卡编辑7B模型 报显存不足 想请教一下如何单机多卡去run HOT 2
- 数据集格式
- LLaMA-2-7b-chat Editing failed
- this is not a good idea, may lead to severe overfitting. HOT 1
- 显存占用 HOT 1
- qwen support HOT 1
- RuntimeError: computing v Vector
- 运行时报错 HOT 2
- 请问编辑后的模型储存在哪里了 HOT 3
- 训练方式和LLaMA-Efficient-Tuning-main区别
- Llama-2-7b-chat - RuntimeError: Inference tensors cannot be saved for backward HOT 2
- [Llama-2-7b-chat] RuntimeError: expected scalar type Float but found Half HOT 7
- 编辑baichuan13b的时候报错NotImplementedError
- 错误 :TypeError: can't convert cuda:0 device type tensor to numpy. HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from fastedit.