Comments (6)
Thank you for raising this! Bill (@yuchenlin) and I will try to find some workaround.
from crossfit.
Hi Sewon,
I'm trying to reproduce this issue but my scripts are working as expected. Could you please provide some extra information for us? Thank you.
- What are the error messages you're getting?
- Could you double-check if your huggingface dataset has version 1.4.0 and could you please try the scripts again after clearing the cache?
Attaching my logs for reference.
from crossfit.
Hi @cherry979988, thank you for your help. Yes, I double-checked that the HF datasets version is 1.4.0, and the error is keep occurring after clearing the cache. Error messages are saved here.
P.S. I think if you have downloaded the data once, the data is saved as a cache. Perhaps that is why you were not able to reproduce the error?
from crossfit.
Hi @shmsw25
Thank you for providing the logs. I am able to reproduce the errors.
My guess is that the dataset owners updated their files, and the checksums in HF datasets is not yet updated, so we're getting this checksum error.
A temporary solution will be using ignore_verifications=True
when loading datasets (e.g., dataset = load_dataset("kilt_tasks", "wow", ignore_verifications=True)
). However, this will probably leads to differences in few-shot sampling. I'll discuss with Bill and see if there is a better solution...
from crossfit.
Got it, thank you for taking a look at this!
from crossfit.
@cherry979988 Would you mind sharing your cache of the following for the unavailable network?
- jeopardy
- kilt_wow
- definite_pronoun_resolution
- wiki_auto
from crossfit.
Related Issues (8)
- Describe a bug HOT 2
- ModuleNotFoundError: No module named 'transformers.models'
- ModuleNotFoundError: No module named 'transformers.modeling_outputs' HOT 2
- Encoding problem for certain datasets HOT 3
- MD5 Checksum failure HOT 1
- Absolute Scores for Random Multi-task Learning HOT 1
- Finetuning script/config for T5 model HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from crossfit.