OCR Image Processing

This project processes images to extract text using OCR and converts the text into structured data in a CSV format.

Setup

Prerequisites

Python 3.12.3+
Tesseract OCR
GitHub Account
Git

Creating a GitHub Account

Visit the GitHub Signup Page
Fill in the required details (username, email, password) and complete the sign-up process.
Verify your email address by clicking on the verification link sent to your email.

Installing Git

Windows

Download Git for Windows from the official website: git-scm.com
Run the installer and follow the instructions.
Verify the installation:
```
git --version
```

macOS

Install Homebrew if you haven't already:

/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"

Install Git:
```
brew install git
```
Verify the installation:
```
git --version
```

Linux (Ubuntu)

Update the package list:
```
sudo apt update
```
Install Git:
```
sudo apt install git
```
Verify the installation:
```
git --version
```

Installation

Clone the repository:

git clone https://github.com/your-username/ocr-image-processing.git
cd ocr-image-processing

Install the required Python packages:
```
pip install -r requirements.txt
```
Ensure Tesseract is installed and the TESSDATA_PREFIX environment variable is set:
```
export TESSDATA_PREFIX=/usr/local/share/tessdata/
```

Installing Python and Pip

Windows

Download Python from the official website: python.org
Run the installer and follow the instructions, ensuring to check the box to add Python to your PATH.
Verify the installation:
```
python --version
pip --version
```

macOS

Install Homebrew if you haven't already:

/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"

Install Python:
```
brew install python
```
Verify the installation:
```
python3 --version
pip3 --version
```

Linux (Ubuntu)

Update the package list:
```
sudo apt update
```
Install Python:
```
sudo apt install python3 python3-pip
```
Verify the installation:
```
python3 --version
pip3 --version
```

Usage

Place the images to be processed in the data/images directory.
Run the main script with the images directory and output directory as arguments:
```
python src/main.py data/images data/output
```
The processed CSV files will be saved in the specified output directory.

Example CLI Usage

python src/main.py data/images data/output

Using CI/CD with GitHub Actions

This project uses GitHub Actions for CI/CD. When you push images to the master branch, the GitHub Actions workflow will automatically process the images and upload the CSV outputs as artifacts.

Push Images to GitHub:

git add data/images/
git commit -m "Add new images"
git push origin master

Download CSV Outputs:
- Go to the Actions tab in your GitHub repository.
- Select the latest workflow run.
- Scroll down to the Artifacts section.
- Download the csv-files artifact.

Example Screenshots

GitHub Actions Workflow Run

Downloading Artifacts

Contributing

Contributions are welcome! Please open an issue or submit a pull request.

License

This project is licensed under the MIT License. See the LICENSE file for details.

rotemlevi / image2table Goto Github PK

image2table's Introduction

OCR Image Processing

Setup

Prerequisites

Creating a GitHub Account

Installing Git

Windows

macOS

Linux (Ubuntu)

Installation

Installing Python and Pip

Windows

macOS

Linux (Ubuntu)

Usage

Example CLI Usage

Using CI/CD with GitHub Actions

Example Screenshots

GitHub Actions Workflow Run

Downloading Artifacts

Contributing

License

image2table's People

Contributors

Watchers

Recommend Projects

Recommend Topics

Recommend Org