Coder Social home page Coder Social logo

optical-character-recognition-with-plagiarism-checker's Introduction

Optical Character Recognition with Plagiarism Checker

  • This project is an implementation of Optical Character Recognition (OCR) with a built-in plagiarism checker. It was developed collaboratively by me and my project mates.

Plagiarism Checker

  • To begin with, we created a plagiarism checker using the Python programming language. We utilized the scikit-learn library, which provides powerful tools for text analysis and comparison. The plagiarism checker compares each pair of text files within a specified folder and generates a percentage indicating the similarity between them.

Optical Character Recognition (OCR)

  • As an additional feature, we incorporated an open-source OCR engine called Tesseract into our project. This enables us to convert text from image files into text files with the .txt extension. By leveraging the OCR functionality, we can now apply the plagiarism checker code to determine the similarity percentage between text extracted from images.

  • To integrate the OCR capabilities into our project, we utilized the pytesseract module in Python. This module allows seamless integration with Tesseract, making it easier for us to extract text from images and include them in our plagiarism analysis.

  • Overall, this project combines OCR and a plagiarism checker, offering a comprehensive solution for detecting similarities in both text and image files.

optical-character-recognition-with-plagiarism-checker's People

Contributors

pravin-jalodiya avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.