Follow My Reading provides a API service for our users to upload an image and audio of their reading session, and our service checks whether there are any mistakes in pronunciation in the audio.
๐ users can take a photo of the page
๐ read the page aloud
๐ค record their audio
โฌ๏ธ upload image and audio files to sever
:park: server can process image
๐ต server can process audio
๐ server can compare audio against image
๐ server can compare audio against text
โฌ server can extract audio segments with requested phrases
FastAPI is a modern, fast (high-performance) web framework for building APIs with Python 3.6+ based on standard Python type hints, making it easy to write and maintain complex APIs in a fast and scalable way.
Redis is an open source, in-memory data structure store, used as a database, cache, and message broker, with support for a wide range of data structures and features that make it highly versatile and efficient.
Huey is a lightweight task queue for Python that allows for easy integration with Redis and asynchronous processing of tasks in distributed systems.
โ Image and audio upload
โ Audio Processing
โ Splitting audio by words or by phrases
โ Image Processing
โ Reporting text coordinates on the image
โ Comparing audio and image
โ Comparing audio and text
โ Extracting audio by given phrases
โ Plugin Support
โ Multi-language support
โ Distributed computing using Task System
โ Authentication
We host the documentation for our API here: ๐ Gitlab Pages, where you will find all the information you need to use our API effectively.
In addition to the API documentation, we also have a detailed wiki that explains everything in detail: ๐ Project Wiki. This resource provides more comprehensive information, so feel free to take a deep dive and explore the different sections.
- @a.kudryavtsev
- @a.soldatov
- @i.sannikov
- @f.smirnov
- @l.novikov
This project is licensed under the terms of the MIT License.