Comments (13)
I only want to train the part of detection , without the OCR recognition
from scene-text-recognition.
You need to transform your data to LBP by get_lbp_data(). After that, feed the LBP data to train_cascade().
I'll fix this to make it easier in someday :P
from scene-text-recognition.
So,there only use get_lbp_data() and train_cascade() in the training? I have train the model by your data yesterday for 16 hours, but there have nothing result and log, Is this normal?
from scene-text-recognition.
Yes, for the training of detection classifier, it only involves
- get_lbp_data (get lbp data of training data)
- train_cascade (train weak and strong text classifiers)
I can't recall the detail of them right now, but both functions should finish within few minutes.
The code isn't too long to trace, so you can trace it and see how it works
from scene-text-recognition.
I found opencv_train() in your code , could this function take place of train_cascade(). and have the same result?
from scene-text-recognition.
This function will train Adaboost classifier for OpenCV's build-in Machine learning module, and that won't be compatible with my Adaboost. I add this function just for comparison between my Adaboost and OpenCV's Adaboost.
from scene-text-recognition.
Hi , I want to get the Binary MASK for every ER*, Can this be implemented in your code?
from scene-text-recognition.
That is feasible, I had already try that. However, both time and space complexity will increase, especially space complexity because there are enormous amount of ER in an image.
from scene-text-recognition.
I want to put the Binary MASK to CNN, so could you tell where you have realize it in your code,Thank you
from scene-text-recognition.
There are 2 way you can use:
- Use a link list of pixels for each ER. Whenever a pixel is accumulate to this ER, append the pixel to the link list. It is implemented in er_accumulate or er_tree_extract. You can utilize int pixel and **struct plist ** of struct ER. A brief intro. of int pixel is it stands for the position of a pixel:
pixel%image_width = x ; pixel/image_width = y - Another work around is to binarize the ER since we keep tracking the level of every ER. You can utilize int level of struct ER. Keep in mind that this method may not give you exact pixel mask because the bounding box of ER could contain something which is not a text.
from scene-text-recognition.
Thank you very much, And I want to continue training on the basis of the last training result, so I would like to ask if I can use the last training result to initialize the current training model, If this way is ok, what should to change
from scene-text-recognition.
I am sorry, I don't get it. What do you mean "training on the basis of the last training result"?
from scene-text-recognition.
the last training result is the "strong.classifier" and "weak.classifier", The "training on the basis of the last training result" is like the finetune in the deep learning
from scene-text-recognition.
Related Issues (13)
- I am using visual studio 2015 enterprice and getting error (Errpr : "node" is ambigious")? HOT 4
- error while making the scene text recognition
- Should the text data and non-text data in the root directory of /res/neg and /res/pos?
- What are these result output windows 'pool, weak, strong, tracked, result, all ' means respectively? HOT 2
- Poor Results
- Now i am getting this error after running the .exe file HOT 5
- What does it mean to be “strong” and “weak”? HOT 2
- I get this error when executing it via VisualStudio2015 HOT 4
- how can I train with my own data HOT 1
- Documentation on Training HOT 1
- Error executing with a video argument HOT 2
- The positive and negative samples cannot be uncompressed after downloading HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from scene-text-recognition.