Comments (5)
@cjnolet I think just the proposals without the postprocessing and tinkering with box dimensions leads to poor recall (see image below). @ankush-me can you confirm?
from synthtext.
@biggerlambda, you are correct and I have trained both the filtering classifier and the bounding box regressor model. 25% is extremely low recall for the FCRN output which is making me thinking I'm doing something wrong, my network is overfitting, or I'm not training it for long enough.
from synthtext.
From the gray curve above, multi-scale FCRN without any other post-processing gets 85% maximum recall on ICDAR. Precision can be improved through non-maximal suppression.
from synthtext.
@ankush-me, I understand tyour answer to @biggerlambda's question/suggestion but my original question in this ticket still remains unanswered. Have you ever tried using stochastic pooling? Did you find that it did not help your convolutional layers?
Also, how many epochs did you need to train your SynthText dataset on?
from synthtext.
I haven't tried any stochastic pooling.
I think it converged within 4-5 epochs.
(Closed in error -- feel free to comment if not resolved).
from synthtext.
Related Issues (20)
- Generator gives bad result HOT 4
- Ground truth file HOT 3
- results/SynthText.h5
- Text database
- Generating just one word HOT 1
- Saving masks in folder HOT 1
- Pizda shaatsan how to get generated picture with text but without border ? HOT 1
- AssertionError and text placement parameters misunderstanding
- Incorrect visualization of bboxes HOT 5
- Negative value of word-level bounding-boxes in gt.mat
- Mask and bounding boxes HOT 4
- Downloading of SynthText Pre-generated Dataset HOT 8
- Undefined functions in predict_depth.m in prep_scripts HOT 3
- Anyone know how to disable mirrored/backwards text? HOT 1
- zero-size array
- Some special characters are not generated HOT 3
- Can you generate a composite image consisting entirely of '+', '-', numbers, and decimal points?
- Do you have a non torrent download address? I would like to obtain the depth and seg of SynthText, as well as the original image file HOT 1
- Incorrect BBoxes
- downloading dataset
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from synthtext.