Comments (5)
From my experience what the load_from_checkpoint
downloads extra is the models--xlm-roberta-large
. You can download it yourself and save alongside the wmt20-comet-da
model and set the environment variable TRANSFORMERS_CACHE
to point to where the models--xlm-roberta-large
is. This will avoid the download.
I'm using comet version 1.1.3
.
from comet.
Yep exactly, The downloads you are observing are the hugging face XLM-R weights. You can also cache those weights to avoid download them multiple times.
from comet.
hmmm thats strange because the load_from_checkpoint function does not perform any download. Can you provide more information? for example a snipped of the script?
from comet.
Thanks, Ricardo!
Here is the whole script that I use:
# Calculate COMET
df = pd.DataFrame({"src":source_sentences, "mt":translations, "ref":references})
data = df.to_dict('records')
model_path = "/content/drive/MyDrive/models/wmt20-comet-da/checkpoints/model.ckpt"
model = load_from_checkpoint(model_path)
seg_scores, sys_score = model.predict(data, batch_size=128, gpus=1)
print("COMET:", round(sys_score*100, 2))
Here is a screenshot of what I see.
I think it does not download wmt20-comet-da
, but it downloads that 2.24GB every time, and I am not sure what it is. Thanks!
from comet.
Thanks, @avaz and @ricardorei! This worked.
First, download XLM-RoBERTa.
from transformers import AutoModelForMaskedLM
model = AutoModelForMaskedLM.from_pretrained("xlm-roberta-large", cache_dir="/content/drive/MyDrive/models/")
Then, set the cache folder for Transformers:
import os
os.environ['TRANSFORMERS_CACHE'] = "/content/drive/MyDrive/models/"
OR
export TRANSFORMERS_CACHE="/content/drive/MyDrive/models/"
Finally, install COMET
pip3 install unbabel-comet -q
from comet.
Related Issues (20)
- [QUESTION] Train UnifiedMetric/XCOMET with word level predictions. HOT 1
- Sparsemax not actually used in COMET-KIWI, XCOMET-XL/XXL HOT 4
- Invalid link reference of reference-free model in readme
- Minimizing cpu RAM vs only use GPU RAM HOT 1
- what is the precision when load_from_checkpoint?
- Runtime error when loading wmt23-cometkiwi-da-xl HOT 1
- Different scores from different COMET package versions 1.1.2 and 2.2.1 HOT 2
- Different versions of COMET code give different scores with the same model and date.
- [QUESTION] large file scoring HOT 3
- [QUESTION] Splitting big models over multiple GPUs HOT 6
- [QUESTION] Memory footprint HOT 21
- [INPUT] Text Length of Input (source, reference, and hypothesis) HOT 2
- Change the global variable logger to comet_logger HOT 1
- Training script for XCOMET HOT 1
- Safetensors Support
- [QUESTION] OOM when load XCOMET-XXL in A100 with 40G memory for prediction HOT 4
- [QUESTION] why num_layers = num_hidden_layers + 1 HOT 1
- [QUESTION] Comet kiwi architecture HOT 11
- Training data and scripts used for wmt22-cometkiwi-da HOT 3
- Add missing library stubs or py.typed marker
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from comet.