Coder Social home page Coder Social logo

prathami1 / slide-scan Goto Github PK

View Code? Open in Web Editor NEW
0.0 0.0 0.0 1.11 MB

A notes application with the ability to capture, record, and transcribe text, easily and intuitively.

Java 99.23% Kotlin 0.77%
android-application android-studio image-to-text java kotlin-android notes-app voice-to-text

slide-scan's Introduction

Slide Scan

An android application designed for students to easily record and notate lectures, whether they be in a verbal or powerpoint format. Utilizes Google's speech-to-text recognition kit and ML kit for text recognition.

Summary

Designed as a user-friendly note-storing platform, this application features standard note-taking characteristics, such as creating, editing, searching for, deleting, pinning, and viewing a note. In addition to those features, it boasts two NLP (Natural Language Processing) based tools to help convert ideas to notes: speech and image to text recognition.

How it works

Built in primarily Java, this application consists of three main UIs: the Homepage, Create a New Note, and Crop an Image. The Homepage consists of a search view, followed by a recycler view filled with card views. The search view serves as a search bar to filter the notes, a useful addition for note-takers to quickly find what they are looking for. Each card view contains the title, contents, and date of the note. After tapping on a card view, the user will enter the Create a New Note screen, prompted to create a title, edit the contents of the note, and have options to record/capture a picture to transcribe into text. If the user decides to capture an image and transcribe it to text, they will be redirected to a page with a camera UI, ready for them to take a picture. After snapping the picture, they'll be given the option to edit their image, whether it be to crop, rotate, flip, or modify their image before transcribing it. If the user decides to record a memo and have that transcribed, they will be redirected to a Voice UI, ready to record their speech and transcribe it to text.

Quick Walkthrough

SlideScan.mp4

Application in Action

From cycling, pinning, and managing your notes to creating a new note itself, the Home Page is the hub for everything you've saved in the app.

Home Page Note Features Pinning Notes Create A New Note
scene1 scene2 scene3 scene4

The app's transcription functionality features the ability to capture or upload an image, in order to have all the text in the image transcribed and pasted into a note.

Select Image Source Snap a Picture Confirm Picture Crop/Edit Frame Transcription
scene1 scene2 scene3 scene4 scene5

The app also features the ability to transcribe speech into text, as well as searching through your notes to find the one you were looking for.

Live Transcription of Speech Search Functionality Search Functionality
scene1 scene2 scene3

Dependencies Utilized

  • Java
  • Kotlin
  • ML Kit
  • Speech-To-Text recognition model
  • Android Image Cropper

slide-scan's People

Contributors

prathami1 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.