arvinzhuang / dsi-transformers Goto Github PK

View Code? Open in Web Editor NEW

153.0 153.0 14.0 78 KB

A huggingface transformers implementation of "Transformer Memory as a Differentiable Search Index"

License: MIT License

Python 100.00%

dsi-transformers's People

Contributors

Stargazers

Watchers

Forkers

ii-research-yu zhiqiangohuo dhlee347 winghigh soselox roykim98 bogoliubon syncdoth e-qin apollohuang1 kidist-amde haochuan-li 21zhouyun kiris-z

dsi-transformers's Issues

About the figures of hist@1 and hits@10

I notice the code set max_steps=1000000 (1000k). But the figures of hist@1 and hits@10 only illustrate the scores until 120k. Will it continue training until 1000k steps?

步骤2train.py无法启动

您好，我用官方代码步骤2无法启动，以下是报错信息：

Traceback (most recent call last):
File "train.py", line 153, in
main()
File "train.py", line 77, in main
tokenizer = T5Tokenizer.from_pretrained(model_name, cache_dir='cache')
File "/opt/conda/envs/wyc_308/lib/python3.8/site-packages/transformers/tokenization_utils_base.py", line 1724, in from_pretrained
resolved_vocab_files[file_id] = cached_path(
File "/opt/conda/envs/wyc_308/lib/python3.8/site-packages/transformers/file_utils.py", line 1921, in cached_path
output_path = get_from_cache(
File "/opt/conda/envs/wyc_308/lib/python3.8/site-packages/transformers/file_utils.py", line 2177, in get_from_cache
raise ValueError(
ValueError: Connection error, and we cannot find the requested files in the cached path. Please try again or make sure your Internet connection is on.
wandb: Waiting for W&B process to finish... (failed 1).
wandb: You can sync this run to the cloud by running:
wandb: wandb sync /root/projects/wyc/dsi/wandb/offline-run-20230718_015122-1nx1bm3h
wandb: Find logs at: ./wandb/offline-run-20230718_015122-1nx1bm3h/logs

您有什么建议吗？

[Bug] Low Performance Due to Constraint in Docid Generation (Limited to Integer Docids).

Three types of docid representations are introduced in the paper "Transformer Memory as a Differentiable Search Index," namely, Unstructured Atomic Identifiers, Naively Structured String Identifiers, and Semantically Structured Identifiers.

In your code, you currently implement only the first type, Unstructured Atomic Identifiers. In the decoding phase, only integer docids are generated. I believe that the potential cause of lower performance compared to the source paper might be the suboptimal selection of INT_TOKEN_IDS.

I suggest to remove this section and retrain the DSI model.

他不应该是包括了一个Question和多个Document一起作为输入，来自回归question的docid吗？（根据您的代码我知道我可能理解错了）
不知道您能不能稍微教俺一下
十分感谢

arvinzhuang / dsi-transformers Goto Github PK

dsi-transformers's People

Contributors

Stargazers

Watchers

Forkers

dsi-transformers's Issues

About the figures of hist@1 and hits@10

步骤2train.py无法启动

[Bug] Low Performance Due to Constraint in Docid Generation (Limited to Integer Docids).

when running to 1000 steps, error happen...

Datasets setting?

Does this code include the Semantic String Docid method that cluster the docid in the paper?

data download

How long does it take to train the model with 1 V100 GPU?

关于datalloader

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent