Coder Social home page Coder Social logo

Comments (13)

09876qwrte avatar 09876qwrte commented on July 22, 2024

I only want to train the part of detection , without the OCR recognition

from scene-text-recognition.

HsiehYiChia avatar HsiehYiChia commented on July 22, 2024

You need to transform your data to LBP by get_lbp_data(). After that, feed the LBP data to train_cascade().
I'll fix this to make it easier in someday :P

from scene-text-recognition.

09876qwrte avatar 09876qwrte commented on July 22, 2024

So,there only use get_lbp_data() and train_cascade() in the training? I have train the model by your data yesterday for 16 hours, but there have nothing result and log, Is this normal?

from scene-text-recognition.

HsiehYiChia avatar HsiehYiChia commented on July 22, 2024

Yes, for the training of detection classifier, it only involves

  • get_lbp_data (get lbp data of training data)
  • train_cascade (train weak and strong text classifiers)

I can't recall the detail of them right now, but both functions should finish within few minutes.
The code isn't too long to trace, so you can trace it and see how it works

from scene-text-recognition.

09876qwrte avatar 09876qwrte commented on July 22, 2024

I found opencv_train() in your code , could this function take place of train_cascade(). and have the same result?

from scene-text-recognition.

HsiehYiChia avatar HsiehYiChia commented on July 22, 2024

This function will train Adaboost classifier for OpenCV's build-in Machine learning module, and that won't be compatible with my Adaboost. I add this function just for comparison between my Adaboost and OpenCV's Adaboost.

from scene-text-recognition.

09876qwrte avatar 09876qwrte commented on July 22, 2024

Hi , I want to get the Binary MASK for every ER*, Can this be implemented in your code?

from scene-text-recognition.

HsiehYiChia avatar HsiehYiChia commented on July 22, 2024

That is feasible, I had already try that. However, both time and space complexity will increase, especially space complexity because there are enormous amount of ER in an image.

from scene-text-recognition.

09876qwrte avatar 09876qwrte commented on July 22, 2024

I want to put the Binary MASK to CNN, so could you tell where you have realize it in your code,Thank you

from scene-text-recognition.

HsiehYiChia avatar HsiehYiChia commented on July 22, 2024

There are 2 way you can use:

  1. Use a link list of pixels for each ER. Whenever a pixel is accumulate to this ER, append the pixel to the link list. It is implemented in er_accumulate or er_tree_extract. You can utilize int pixel and **struct plist ** of struct ER. A brief intro. of int pixel is it stands for the position of a pixel:
    pixel%image_width = x ; pixel/image_width = y
  2. Another work around is to binarize the ER since we keep tracking the level of every ER. You can utilize int level of struct ER. Keep in mind that this method may not give you exact pixel mask because the bounding box of ER could contain something which is not a text.

from scene-text-recognition.

09876qwrte avatar 09876qwrte commented on July 22, 2024

Thank you very much, And I want to continue training on the basis of the last training result, so I would like to ask if I can use the last training result to initialize the current training model, If this way is ok, what should to change

from scene-text-recognition.

HsiehYiChia avatar HsiehYiChia commented on July 22, 2024

I am sorry, I don't get it. What do you mean "training on the basis of the last training result"?

from scene-text-recognition.

09876qwrte avatar 09876qwrte commented on July 22, 2024

the last training result is the "strong.classifier" and "weak.classifier", The "training on the basis of the last training result" is like the finetune in the deep learning

from scene-text-recognition.

Related Issues (13)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.