Coder Social home page Coder Social logo

Comments (14)

HelloTobe avatar HelloTobe commented on June 22, 2024 1

@MhLiao Hi, thanks for your reply!

Another problem is that some bounding boxes exceed image boundary(some coordinates are too large or negative), could you give me some adviece to solve this problem?

By the way, i'm wondering how long you finish your training.

from textboxes.

MhLiao avatar MhLiao commented on June 22, 2024

@lufo816 use the maximum rectangle(xmin,ymin,xmax,ymax) as the bounding box

from textboxes.

lufo816 avatar lufo816 commented on June 22, 2024

@MhLiao Understood, thanks a lot!

from textboxes.

vrbala avatar vrbala commented on June 22, 2024

@MhLiao I am trying to use SynthText to fine tune the model. However, I am not sure how to preprocess the data such that it is suitable to use with TextBoxes. Specifically, I couldn't figure out the schema of data in lmdb files. Could you kindly point me at some relevant references? Thanks in advance.

from textboxes.

MhLiao avatar MhLiao commented on June 22, 2024

@vrbala The schema of data in lmdb files is similar to the SSD. You can use the scripts of SSD to generate the lmdb data.

from textboxes.

vrbala avatar vrbala commented on June 22, 2024

@MhLiao Thanks so much.

from textboxes.

HelloTobe avatar HelloTobe commented on June 22, 2024

@MhLiao Hi, the SynthText datas are in decimal form, should i use the round function to make them into integers?
What pictures in SynthText did you use for training, validation or testing respectively? (I mean i don't know what pictures should be categorized into training, also is validation and testing.)
Thanks in advance.

from textboxes.

MhLiao avatar MhLiao commented on June 22, 2024

@HelloTobe yes. I used 800k images for training, 58750 images for validation.

from textboxes.

HelloTobe avatar HelloTobe commented on June 22, 2024

Thanks for your reply!
By the way, i'm wondering how long you finish your training.

from textboxes.

lomograb avatar lomograb commented on June 22, 2024

@HelloTobe You can enable CUDA or cuDNN (GPU MODE) for quick result of training time

from textboxes.

offlaner avatar offlaner commented on June 22, 2024

@MhLiao @lufo816 For training on synthText are we supposed to generate txt files in the form of :
xmin ymin xmax ymax "text"
instead of :
Xcentre Ycentre w h "text"

from textboxes.

github14343 avatar github14343 commented on June 22, 2024

@HelloTobe Have you solved the problem " bounding box exceeds image boundary." Thanks for help

from textboxes.

HelloTobe avatar HelloTobe commented on June 22, 2024

@github14343 I probably just clip the coordinate x such that 0 <= x <= H (or W).

from textboxes.

github14343 avatar github14343 commented on June 22, 2024

@HelloTobe Thanks!

from textboxes.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.