Comments (4)
Thank you for the questions!
- For the retrieval results reported in the papers, they are all passage retrieval results (i.e., "Pid_Prec@n") not document retrieval (i.e., "Doc_Prec@n" in the output). The passage results you got is quite comparable to the last three columns in Table 4.
run_eval_rag_re.sh
only provides the retrieval results. For text generation evaluation scores (F1, EM, BL in Table 4) , please refer torun_eval_rag_e2e.sh
- D^token-*-ft means that we use finetuned-DPR encoders for the document index (
run_kb_index.sh
) and the biencoder for Retriever Module in RAG (run_converter.sh
).
from multidoc2dial.
Thanks for your reply!
So if I am not mistaken, Table 5 is from run_eval_rag_re.sh
, and Table 4&6 are from run_eval_rag_e2e.sh
, with task set as grounding&generation, is that right?
I mistakenly thought D^token-ft as DPR and D^token-rr-cls-ft as RAG, and so they are all RAG? And I am still confusing of the difference between D^token-ft and the *-rr-*.
from multidoc2dial.
- Table 4 and 6 contain retrieval evaluation (
run_eval_rag_re.sh
) and text generation evaluation (run_eval_rag_e2e.sh
) results by RAG models. Table 5 is DPR retrieval results, not RAG. - -rr- corresponds to reranking the retrieved passages by RAG retriever based on the retrieval results by only the current turn, where the embedding of the current turn is based on [CLS] (
rr-cls
) or pooled (rr-pl
). Please see Paper Section 3.2 and code as a reference.
from multidoc2dial.
Ok, I've got it, thank you for your prompt reply!
from multidoc2dial.
Related Issues (12)
- Error in running converter HOT 1
- How to reproduce the retrieval results in Table 5? HOT 4
- How to generate prediction file for sharetask? HOT 9
- Question about data preprocessing HOT 1
- the link for script run_sharedtask_eval.sh is corrupted HOT 1
- no run_finetune_rag.sh and missing positional argument: 'logits_processor' HOT 2
- Question about using multiple gpus
- About data download HOT 4
- Sharing unseen-domain data HOT 2
- question about data preprocessing HOT 5
- -nq model results and n_docs HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from multidoc2dial.