Comments (8)
Hi @ahmedmazari-dhatim ,
Following steps helped me run it on my own dataset:
-
Create list of image names using create_list.py. There are two versions of it with this code base.
https://github.com/MhLiao/TextBoxes/blob/TextBoxes/data/coco/create_list.py
https://github.com/MhLiao/TextBoxes/blob/TextBoxes/data/ILSVRC2016/create_list.py
You can modify any of these for your use case. -
Make sure to write file sizes of images in your dataset in a text file.
-
Add mapping for your custom classes in labelmap_voc.prototxt
-
Use create_data.sh file to create lmdb. Again there are three versions of it but you can modify any of them for your use case. Their core functionality is same, you just need to change some paths and configuration params. This file calls another file which eventually refers to the files created in above steps.
If you haven't already then my suggestion would be to read these create_list.py and create_data.sh (and also create_annoset.py which is called by create_data.sh) files and understand what they are doing.
If you want then I can share my files (I shall need sometime to dig them though).
Hope this helps :)
from textboxes.
@ravikantb Did you try the Synthetic dataset or ICDAR 2013 dataset? I suggest you try them first.
from textboxes.
@MhLiao : Thanks for your suggestion. We (me and my friends here) tried ICDAR 2013 dataset and found what was wrong with our way of training. Apparently we were trying to train without using test_size.txt files which did not generate correct lmdb database files. Now we are able to train TextBoxes on our own dataset also :)
from textboxes.
Hi @ravikantb ,
How did you create lmdb file for your own dataset ?
from textboxes.
@ravikantb How do I convert the dataset annotations into Pascal_VOC format (XML format) such that the codes you mentioned above can create an appropriate lmdb file, on which Textboxes could be trained successfully. Please help.
from textboxes.
@MhLiao I am experiencing the same issue. I have used create_list.py and create_data.sh to create the training LMDB for ICDAR13 and my own custom dataset without any error. However both multibox loss and detection_eval is turning out to be exactly 0 for both. Could you please help figure out why this could be happening? I will share xml files of annotation if need be.
@ravikantb Could you please elaborate on step 2 in your previous comment? I am not able to see documentation to help create this text file of image sizes (is it dimensions (e.g: 1920x1080) or size on disk). I know it is more than a year ago but any hint would be helpful.
Thanks,
Bhargava
from textboxes.
@ravikantb Hi, I have the same problem. I am trying to train SSD on COCO2017 dataset, but i generate lmdb files without train_size.txt and test_size.txt. So, is that the problem?
By the way, the value of loss starts off at 0 and stays at 0.
I am looking forward to your reply. Thank you!
from textboxes.
I'm facing same issue.. is there someway of debbugging the issue? The logs are not useful and the error messages are not informative. In my case, increasing the size of the input images by 10% creates out-of-memory exception and without increase in size (336 x 204 is the size of the original images) I get mbox_loss = 0 (* 1 = 0 loss) messages. Don't have a clue where to look.
==================================================================
Sorry.. Again, a minor issue with annotation data completely missed by me. With the data fixed the network is training. Wish I had found this much earlier.
from textboxes.
Related Issues (20)
- Running TextBoxes on Caffe installed in Anaconda3 env HOT 2
- can not compile the CRNN HOT 5
- cannot find -lopencv_imgcodes HOT 1
- 关于模型的参数设置问题 HOT 3
- demo.py takes about 0.4s per image, when the model load only once and single scale is 700*700
- 关于 multi-scale的问题
- 关于Test的一些问题 HOT 8
- when i run "python examples/TextBoxes/train_icdar13.py",the error is occured when i train on my dataset.
- importError: libhdf5.so.101 HOT 1
- Where to place the downloaded model?
- Failed to run make -j8 HOT 1
- 请问如何您有synthtext数据集格式转换为icdar格式的脚本嘛,谢谢您分享一下鸭
- 请问如何能分享一下synthtext格式转换为icdar格式的脚本吗,谢谢鸭
- testing results on Total-Text HOT 1
- call for loss info/curve.
- 关于TextBoxes_icdar13.caffemodel模型
- Question about mismatch between the code with original paper
- icdar13 dataset consists of 229 training images and 233 testing images,
- will this work as an OCR solution? HOT 1
- Fine tuning on custom dataset: converting to LMDB HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from textboxes.