Coder Social home page Coder Social logo

ketuikiri / poricom Goto Github PK

View Code? Open in Web Editor NEW

This project forked from blueaxis/poricom

0.0 0.0 0.0 79.49 MB

Optical character recognition in manga images. Manga OCR desktop application

License: GNU General Public License v3.0

Shell 0.63% Python 99.37%

poricom's Introduction

Project logo

Poricom

Optical character recognition in manga images. Manga OCR desktop application

Contents


About

Poricom is a desktop program for optical character recognition in manga images. Although it is a manga OCR application, it can recognize text on other type of images as well. The project is a GUI implementation of the Manga OCR library (supports Japanese only) and the Tesseract-API python wrapper tesserocr (supports other languages). See demo below to see how it works.

Detect text on locally stored manga images:

sample_usage.mp4

Perform OCR on the current screen by pressing Alt+Q:

external_capture.mp4

Alternatives

  • Cloe - The app is based on Poricom's global snipping functionality. If you downloaded Poricom to use only the global shortcut, it might be better if you use Cloe instead.
  • mokuro - Converts manga images to web pages with selectable text. This saves you time and manual effort since textboxes are automatically detected with an almost 100% accuracy.

User Guide

Follow the installation instructions here. Load a directory with manga images and select text boxes with Japanese text. If you are not getting good results using the default settings, use the MangaOcr model to improve text detection.

Features

Listed below are some of the features of Poricom. Smaller features that are not covered in this section are mentioned in the changelog. Click the arrow to see how each implemented feature works.

Open a directory with manga images or a supported manga file (cbz, cbr, pdf) and start scanning text bubbles.
manga-text-detection.mp4
Capture images outside the application using the shortcut `Alt+Q`.
external_capture.mp4
Load MangaOcr model to improve Japanese text recognition.
load-mangaocr-model.mp4
Change language and/or orientation (limited to the Tesseract API).
language-orientation-option.mp4
Detect text on non-manga images.
non-manga-detection.mp4

Installation

Download the latest zip file here. Decompress the file in the desired directory. Make sure that the app folder is in the same folder as the shortcut Poricom.

System Requirements

Recommended:

  • Hard drive: at least 800 MB HD space
  • RAM: at least 2 GB

Approximately 250 MB of free space and 200 MB of memory is needed to run the application using the Tesseract API. If using the Manga OCR model, an additional 450 MB of free space and 800 MB of memory is required.

Development Setup

  • Clone this repo and install conda.
  • Install dependencies by running conda env create -f environment/base.yaml.
  • Activate the environment with conda activate poricom-py39 and run the app using python main.py.
  • If you want to build the app locally, install build dependencies by running conda env update -f environment/build.yaml. Then run pyinstaller main.spec in the build directory.

Acknowledgements

This project will not be possible without the MangaOcr model by Maciej Budyś and the Tesseract python wrapper by sirfz and the tesserocr contributors.

The software is licensed under GPLv3 (see LICENSE) and uses third party libraries that are distributed under their own terms (see LICENSE-3RD-PARTY).

The icons used in this project are from Icons8.

poricom's People

Contributors

blueaxis avatar yupm avatar awsr avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.