Comments (6)
Hi,
I think you mainly need to edit convert_examples_to_features
in data_utils.py to fit the Electra input pattern, and edit modeling_roberta.py
.
from bond.
Sure, Thanks a lot.
Also , my understanding is . first run the stage 1 by run_ner.py , and then on the top of the stage 1 output to run self-learning.py.
Please correct me if i am wrong.
Thanks a lot.
Marcus
from bond.
Hi Marcus, sorry for the confusion. Running run_self_training_ner.py
will directly produce the two-stage results. You don't need to run it on top of the stage 1 output.
from bond.
got it .
Thanks a lot.
What i just wanna know is ...... If i dont have the hp label in my dataset. how to produce it ??
Thanks a lot :)
from bond.
Hi Marcus, we produced hp labels for CoNLL03 using pos tagging and rule-based method, and see little improvement unfortunately. Hence, we decided not to use hp labels for training, by not setting args.self_training_hp_label
to be 2 or 3 (
Line 48 in 3651a92
from bond.
thanks..
If hp label is not so useful in BOND, I can run the Bond training by run_self_training_ner.py and set args.self_training_hp_label not to be 2 or 3 in the line 570 of run_self_training_ner.py.
parser.add_argument('--self_training_hp_label', type = float, default = 0, help = 'use high precision label.')
Thanks for that
Marcus
from bond.
Related Issues (20)
- Testing new dataset HOT 2
- Comparison with Positive-unlabeled learning
- question on stage 2 learning rate
- Trying BOND on new datasets and languages HOT 3
- RuntimeError: copy_if failed to synchronize: cudaErrorAssert: device-side assert triggered HOT 1
- Distant label generation code HOT 15
- Results reproduction HOT 5
- Are pseudo labels with high confidence retained ? HOT 4
- Questions About variable `self_training_hp_label`
- The file `dataset/BC5CDR-chem/turn.py` is missing
- Could you please provide the codes for matching distant labels?
- About the gazetteers information and distant label generation code HOT 4
- one question about "tags_hp" in the preprocessing stage HOT 7
- two questions about your paper HOT 1
- Reproducing distant labels with gazetteer information HOT 1
- Questions about "soft labels" HOT 2
- Question about the results HOT 1
- Can the NER model change from BERT+Linear_layer to BERT+CRF? HOT 7
- what is the format of the dataset? how to convert any new dataset into this format? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from bond.