Comments (3)
The English config doesn't work for the sentence, it gives the following results:
deeppavlov_model = build_model(
"ner_ontonotes_bert",
install=True,
download=True)
sentence = 'Today at 13:10 we had a meeting'
output = deeppavlov_model([sentence])
print(output[0])
[['Today', 'at', '13', ':', '10', 'we', 'had', 'a', 'meeting']]
print(output[1])
[['B-DATE', 'O', 'B-TIME', 'O', 'B-TIME', 'O', 'O', 'O', 'O']]
However, I expect 13:10
to be predicted as B-TIME, I-TIME, I-TIME
.
I would also like to use the model for Dutch sentences, therefore I chose to use the multilingual version. In the older version of Deeppavlov these mistakes didn't occur. The problem also occurs for other entities with interpunctions.
Would you happen to have any suggestions on how to solve this problem?
from deeppavlov.
The model ner_ontonotes_bert.json
https://github.com/deeppavlov/DeepPavlov/blob/master/deeppavlov/configs/ner/ner_ontonotes_bert.json
gives correct prediction for your example sentence. I suggest using English config for English sentences.
from deeppavlov.
from deeppavlov.
Related Issues (20)
- 👩💻📞DeepPavlov Community Call #18
- ERROR while training rel_ranking model of KBQA pipeline HOT 1
- Doesn't work with recent version of pytorch-crf HOT 3
- No such file or directory: 'syntax_ru_syntagrus_bert' HOT 2
- CRF model is not saved to correct path when training NER HOT 3
- ModuleNotFoundError: No module named 'torch' HOT 1
- Can't install model (deeppavlov colab example ) HOT 2
- Predictions NER for entities with interpunction HOT 2
- Neural morphological tagging with DeepPavlov: howto
- Insult detection algorithm (additional insult) HOT 1
- Columns and DataType Not Explicitly Set on line 411 of docred_reader.py
- Проблема с парсингом входных данных через api и через socket HOT 3
- Repl: KeyboardInterrupt, EOFError
- requests.exceptions.SSLError: HTTPSConnectionPool(host='files.deeppavlov.ai', port=443): Max retries exceeded with url: /v1/ner/ner_ontonotes_bert_mult_torch_crf.tar.gz (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: certificate has expired (_ssl.c:1091)')))
- The demo for the NER model contains an incorrect model name HOT 6
- Outdated Pandas and other dependencies
- Update to pydantic v2
- NER - “input sequence after bert tokenization shouldn’t exceed 512 tokens” (ner_bert_base) HOT 1
- ValueError: BuilderConfig 'terra' not found. Available: ['default']
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from deeppavlov.