Coder Social home page Coder Social logo

docsaidlab / docsaidkit Goto Github PK

View Code? Open in Web Editor NEW
2.0 1.0 0.0 1.01 MB

A Toolbox in DOCSAID

Home Page: https://docsaid.org/en/docs/category/docsaidkit

License: Apache License 2.0

Python 98.99% HTML 0.34% Dockerfile 0.61% Shell 0.05%
opencv python pytorch pytorch-lightning toolkit

docsaidkit's Introduction

English | δΈ­ζ–‡

DocsaidKit

Introduction

This project is a toolbox for image processing and deep learning, primarily consisting of the following components:

  • Vision: Functions related to computer vision, such as image and video processing.
  • Structures: Modules for handling structured data, such as BoundingBox and Polygon.
  • ONNXEngine: Provides ONNX inference capabilities, supporting ONNX format models.
  • Torch: Related to PyTorch, including neural network architectures, optimizers, etc.
  • Utils: Miscellaneous utilities that do not fit into other categories.
  • Tests: Test files for verifying the functionality of various functions.

Documentation

For installation and usage instructions, please refer to the DocsaidKit Documents.

Here, you will find all the detailed information about this project.

Installation

Before installing DocsaidKit, ensure your system meets the following requirements:

Python Version

  • Ensure Python 3.8 or higher is installed on your system.

Dependencies

Install the required dependencies based on your operating system.

  • Ubuntu

    Open the terminal and run the following commands to install dependencies:

    sudo apt install libturbojpeg exiftool ffmpeg libheif-dev
  • MacOS

    Use brew to install dependencies:

    brew install jpeg-turbo exiftool ffmpeg libheif

pdf2image Dependencies

pdf2image is a Python module for converting PDF documents into images.

Follow these instructions to install it based on your operating system:

  • For detailed installation instructions, refer to the pdf2image project page.

  • MacOS: Mac users need to install poppler. Install it via Brew:

    brew install poppler
  • Linux: Most Linux distributions come with pdftoppm and pdftocairo pre-installed.

    If not, install poppler-utils via your package manager:

    sudo apt install poppler-utils

Installation via git clone

  1. Clone the repository:

    git clone https://github.com/DocsaidLab/DocsaidKit.git
  2. Install the wheel package:

    pip install wheel
  3. Build the wheel file:

    cd DocsaidKit
    python setup.py bdist_wheel
  4. Install the built wheel package:

    pip install dist/docsaidkit-*-py3-none-any.whl

    To install the version that supports PyTorch:

    pip install "dist/docsaidKit-${version}-none-any.whl[torch]"

Installation via Docker (Recommended)

Install via Docker to ensure environment consistency.

Use the following commands:

cd DocsaidKit
bash docker/build.bash

Once completed, run your commands within Docker:

docker run -v ${PWD}:/code -it docsaid_training_base_image your_scripts.py

For the specifics of the build file, refer to: Dockerfile

Testing

To ensure the stability and accuracy of DocsaidKit, we use pytest for unit testing.

Users can run the tests themselves to verify the accuracy of the functionalities they are using.

To run the tests:

python -m pytest tests

docsaidkit's People

Contributors

zephyr-sh avatar github-actions[bot] avatar jack-lin-ntu avatar

Stargazers

 avatar Ting-Jun Wang avatar

Watchers

Kostas Georgiou avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.