amirmgh1375 / textrecognitiondatagenerator Goto Github PK
View Code? Open in Web Editor NEWA synthetic data generator for text recognition with arabic and persian strings support
A synthetic data generator for text recognition with arabic and persian strings support
Doesnt work also proper requirements not mentioned
[mohsen@localhost TextRecognitionDataGenerator]$ python3.6 run.py -w 5 -f 64
Missing modules for handwritten text generation.
2%|████▏ | 24/1000 [00:00<02:16, 7.16it/s]multiprocessing.pool.RemoteTraceback:
"""
Traceback (most recent call last):
File "/usr/lib64/python3.6/multiprocessing/pool.py", line 119, in worker
result = (True, func(*args, **kwds))
File "/home/mohsen/Desktop/python/TextRecognitionDataGenerator/data_generator.py", line 29, in generate_from_tuple
cls.generate(*t)
File "/home/mohsen/Desktop/python/TextRecognitionDataGenerator/data_generator.py", line 102, in generate
background = BackgroundGenerator.picture(background_height, background_width)
File "/home/mohsen/Desktop/python/TextRecognitionDataGenerator/background_generator.py", line 81, in picture
y = random.randint(0, picture.size[1] - height)
File "/usr/lib64/python3.6/random.py", line 221, in randint
return self.randrange(a, b+1)
File "/usr/lib64/python3.6/random.py", line 199, in randrange
raise ValueError("empty range for randrange() (%d,%d, %d)" % (istart, istop, width))
ValueError: empty range for randrange() (0,0, 0)
"""
The above exception was the direct cause of the following exception:
Traceback (most recent call last):
File "run.py", line 363, in <module>
main()
File "run.py", line 351, in main
), total=args.count):
File "/usr/lib/python3.6/site-packages/tqdm/_tqdm.py", line 1005, in __iter__
for obj in iterable:
File "/usr/lib64/python3.6/multiprocessing/pool.py", line 735, in next
raise value
ValueError: empty range for randrange() (0,0, 0)
Hello!
Your generators are mentioned in EasyOCR repository as an advised method for generating data for training custom text recognition models.
Is there any opportunity to generate dataset of Cyrillic symbols with your generators ?
@amirmgh1375
Thanks a lot for this great Generator!
Please What is the specific caracteristics of these synthetic images generated by these tools?
How can it proceeds when the input is an image containing a text(how it takes the text and generate images)
What differences between these images and the images generated by a data augmentation?
Thanks a lot for your help!
localhost TextRecognitionDataGenerator]$ sudo pip install -r requirements.txt
ERROR: Could not open requirements file: [Errno 2] No such file or directory: 'requirements.txt'
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.