Coder Social home page Coder Social logo

manivannanmurugavel / spacy-ner-annotator Goto Github PK

View Code? Open in Web Editor NEW
183.0 14.0 112.0 663 KB

Train Spacy ner with custom dataset

Home Page: https://medium.com/@manivannan_data/how-to-train-ner-with-custom-training-data-using-spacy-188e0e508c6

JavaScript 41.79% Python 33.21% HTML 25.00%
spacy-nlp spacy-nlp-ner spacy-ner spacy-ner-annotator ner-annotator annotator-spacy nlp-annotator

spacy-ner-annotator's Introduction

spacy-ner-annotator

Installation

pip3 install spacy

Steps for usage

  1. Open index.html file and open data on it.
  2. Post annotations download the data and convert to spacy format using convert_spacy_train_data.py
  3. Split data into train and test if you wish and add it to train.py
  4. finally run the train.py after setting the hyper-parameters. Iterations are losses are logged in output_log.txt. And precision, recall and f1 scores are logged in train_output.txt and test_output.txt
  5. Check progress by running losses_plotter.py.
  6. If you wish to train over a model download the model and add its name in train.py

Details & Credits

Visit this url:

https://manivannanmurugavel.github.io/annotating-tool/spacy-ner-annotator/

spacy-ner-annotator's People

Contributors

apzl avatar manivannanmurugavel avatar sakshi2399 avatar varad-more avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

spacy-ner-annotator's Issues

The json training file isn't saved

Hi, thank you for your work, your interface and code would be very useful to my project !
I tried using it and after annotating a few sentences, i saved the json file but it wasn't saved on my computer, is there an issue ?
Edit : It works on google chrome ;)

Cannot capture multiple words at once

Hi,

When I try to highlight a sentence. It would state an error saying: 'Please select entity inside content'

What I want to do is input the sentence as a whole.

image

Convert to Spacy format

Hi !
Does your tool convert files generated by webanno (uima binary cas, uima cas json, uima cas xmi, conll, tsv3) to Spacy NER training data format?
Thank you in advance !!!

# thanks giving

Great work . I find this helpful . please ignore this issue if you all want. Just give a great thanks to you guys .

Error : object of type 'NoneType' has no len()

i have used the code exactly as written, changed the TRAIN_DATA, used your ner annotator to get the data but still the error thrown as soon as train_spacy function is called.
it's returning null I guess hence the error. Please help.

Annotator is taking the same indices if the two entities have same value

For example, if the text is : Start Date/Time: 4/12/2019 2:00:00 AM EASTERN End Date/Time: 4/12/2019 6:00:00 AM EASTERN Description: Section Roll Expected Impact: 4 Hours Contact Information: Call the SCCC, and reference RFC-0051317 Thank you, Cincinnati Bell Change. It is giving the following output:
"entities": [[627,636,"StartDate"],[637,647,"StartTime"],[627,636,"EndDate"],[681,691,"EndTime"],[648,655,"TimeZone"].
Here the value of start and end date is same (4/12/2019). So, it is taking the same indices for both the indices. Please check it. Thank You in advance

How do we start the server

I have a basic question, how do you start the server, so that we can open in the webpage and train the data.

The youtube video is in silent mode, so I couldnt hear what you were telling.

Selected text is double when tagged

I'm selecting text to associate with an entity label but sometimes when I do so, the text itself (and sometimes other unselected texts) is repeated in an overlay that looks like a bug.

I have seen this in Chrome and Microsoft Edge.

reptition_error

Save into an existing model

I have a question, when
modelfile = input("Enter your Model Name: ") prdnlp.to_disk(modelfile) is executed and. it asks us for the model name, would entering an already existing model dictionary name save it there or would. it conflict?

Thanks

TypeError: 'tuple' object is not callable

Thank you for your annotator.

But when I executed train.py after copying text into TRAIN_DATA variable, I got this Error

File ".\train.py", TypeError: 'tuple' object is not callable.

I am working with python 3.7.

Could you please help me ?

Labels disappears if clicking the "Edit" button

Hi,
suppose I have labeled some text.
At some point I want to edit it for some reason.
If click on the Edit button and then Save, the labels disappear.
Is this a problem or an expected behaviour?
In addition, when i delete some text, it doesn't desappear until I don't click the "Save" button
Many thanks in advance

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.