Hi, Thank you for your work , Can you tell me The steps of model training more in det

There are 2 way you can use: Use a link list of pixels for eac

The steps of model training about scene-text-recognition HOT 13 CLOSED

hsiehyichia commented on July 22, 2024

The steps of model training

from scene-text-recognition.

Comments (13)

09876qwrte commented on July 22, 2024

I only want to train the part of detection , without the OCR recognition

from scene-text-recognition.

HsiehYiChia commented on July 22, 2024

You need to transform your data to LBP by get_lbp_data(). After that, feed the LBP data to train_cascade().
I'll fix this to make it easier in someday :P

from scene-text-recognition.

09876qwrte commented on July 22, 2024

So，there only use get_lbp_data() and train_cascade() in the training? I have train the model by your data yesterday for 16 hours, but there have nothing result and log, Is this normal?

from scene-text-recognition.

HsiehYiChia commented on July 22, 2024

Yes, for the training of detection classifier, it only involves

get_lbp_data (get lbp data of training data)
train_cascade (train weak and strong text classifiers)

I can't recall the detail of them right now, but both functions should finish within few minutes.
The code isn't too long to trace, so you can trace it and see how it works

from scene-text-recognition.

09876qwrte commented on July 22, 2024

I found opencv_train() in your code , could this function take place of train_cascade(). and have the same result?

from scene-text-recognition.

HsiehYiChia commented on July 22, 2024

This function will train Adaboost classifier for OpenCV's build-in Machine learning module, and that won't be compatible with my Adaboost. I add this function just for comparison between my Adaboost and OpenCV's Adaboost.

from scene-text-recognition.

09876qwrte commented on July 22, 2024

Hi , I want to get the Binary MASK for every ER*, Can this be implemented in your code?

from scene-text-recognition.

HsiehYiChia commented on July 22, 2024

That is feasible, I had already try that. However, both time and space complexity will increase, especially space complexity because there are enormous amount of ER in an image.

from scene-text-recognition.

09876qwrte commented on July 22, 2024

I want to put the Binary MASK to CNN, so could you tell where you have realize it in your code,Thank you

from scene-text-recognition.

HsiehYiChia commented on July 22, 2024

There are 2 way you can use:

Use a link list of pixels for each ER. Whenever a pixel is accumulate to this ER, append the pixel to the link list. It is implemented in er_accumulate or er_tree_extract. You can utilize int pixel and **struct plist ** of struct ER. A brief intro. of int pixel is it stands for the position of a pixel:
pixel%image_width = x ; pixel/image_width = y
Another work around is to binarize the ER since we keep tracking the level of every ER. You can utilize int level of struct ER. Keep in mind that this method may not give you exact pixel mask because the bounding box of ER could contain something which is not a text.

from scene-text-recognition.

09876qwrte commented on July 22, 2024

Thank you very much, And I want to continue training on the basis of the last training result, so I would like to ask if I can use the last training result to initialize the current training model, If this way is ok, what should to change

from scene-text-recognition.

HsiehYiChia commented on July 22, 2024

I am sorry, I don't get it. What do you mean "training on the basis of the last training result"?

from scene-text-recognition.

09876qwrte commented on July 22, 2024

the last training result is the "strong.classifier" and "weak.classifier", The "training on the basis of the last training result" is like the finetune in the deep learning

from scene-text-recognition.

The steps of model training about scene-text-recognition HOT 13 CLOSED

Comments (13)

Related Issues (13)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent