Coder Social home page Coder Social logo

screengpt-vision's Introduction

ScreenGPT-Vision – A Friendly Desktop Companion

Hey there, fellow explorers of GPT! πŸš€

I've put together a little side project called ScreenGPT-Vision. Think of it as a casual chat with GPT-4, but from the comfort of your desktop and with a cool twist: the ability to throw in images! Yep, now asking questions about those pesky screenshots or any visual puzzles is just a click or a hotkey away.

What's This All About? πŸ€”

Ever been knee-deep in work and stumbled upon an error message as cryptic as an ancient manuscript? Or maybe you've had a math equation staring at you from a photo, begging to be solved. Instead of tab-hopping and website navigating, why not just press a key, snag a screenshot, and get querying? That's the convenience ScreenGPT-Vision brings to your desktop!

Features (The Cool Bits) πŸŽ‰

  • Chat with GPT-4: Just type in your question and voilΓ  – wisdom from GPT-4.
  • Screenshot Magic: Capture your screen's content and let GPT-4 do the heavy lifting.
  • Lazy Loading: Queue up those screenshots and fire away all your queries at once.
  • Shortcut Simplicity: Hit Ctrl+Shift + P and your screenshot is ready to be analyzed. Ctrl + W to quickly send your messages.
  • Feather-Light: A minimalist app that's easy on your machine.

The Backstory πŸ“š

Cobbled together in just 3 days, this app is my humble foray into desktop development – no prior experience, just a lot of googling and some help from my code mentor, Code Mentor GPT (he even helped with the editing this README). The code might be rough around the edges, and the UI won't win beauty contests, but it gets the job done. If this little tool piques your interest, I'm all in for round two of development. Think object localization, automated actions... the sky's the limit!

Join the Fun! 🎈

Stumble upon this repo and find it neat? Star it, fork it, send pull requests, or just spread the good word – every bit of support counts. Your interest is the fuel for this project's growth. Let's make desktop AI chat a thing!

Compatibility Note 🍏

Doesn't support MacOs

Watch It in Action! πŸ“Ή

Watch It in Action! πŸ“Ή

Curious to see ScreenGPT-Vision in action? I've put together some screenshots to show you just how easy and fun it is to use:

Locate Object on Screen Demo

Locate Object on Screen Demo

Ask Question About Graphs

Ask Question About Graphs

Get LaTeX of Math Equations

Get LaTeX of Math Equations

Get Guided to Build Your Landing Page

Get Guided to Build Your Landing Page

Each image is a snapshot of the app in use, showcasing the various features in action.

How to Get It Rolling πŸ› οΈ

Prerequisites

  • Python 3.8+
  • OpenAI API key (Don't have one? No worries! Head over to OpenAI's API keys page to get your key.)
  • Docker (optional, for Docker setup)

Installation

Clone the Repository

git clone https://github.com/AmT42/ScreenGPT-Vision.git
cd ScreenGPT-Vision

Create a .env File

Inside the ScreenGPT-Vision directory, create a file named .env. Unix-based systems (Linux, macOS, etc.)

touch .env

Windows

 type nul > .env

Add Your API Key to the .env File

Open your new .env file with your favorite text editor and add the following line: OPEN_API_KEY=your-openai-key

With Docker

  1. Run docker compose
    docker-compose up --build

Run the Frontend

  1. Install PyQT5 to run the front
    pip install -r requirements_GUI.txt
  2. In a separate terminal, execute the following command:
    python GUI/app.py

Without Docker

  1. Install Requirements
    pip install -r requirements.txt
  2. Start the Backend
    uvicorn app.main:app --reload

Run the Frontend

In a separate terminal, execute the following command:

 python GUI/app.py

Usage

[Include instructions on how to use the application, along with any screenshots or videos if available.]

Want to Contribute? 🀝

If you've got ideas or code to improve this app, I'm all ears! Contributing is simple:

  1. Fork this repo.
  2. Create a branch for your awesome new feature (git checkout -b amazing-feature).
  3. Commit your changes (git commit -m 'Add some awesomeness').
  4. Push to the branch (git push origin amazing-feature).
  5. Create a new Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Citation πŸ“„

If you end up using ScreenGPT-Vision in your work, a shoutout would be super cool:

  @misc{ScreenGPT-Vision,
    author = "AmT42",
    title = "ScreenGPT-Vision for Desktop",
    year = "2023",
    url = "https://github.com/AmT42/ScreenGPT-Vision"
  }

Contact

AmT42 - [email protected] / https://www.linkedin.com/in/ahmet-celebi-973b63197/

Project Link: https://github.com/AmT42/ScreenGPT-Vision

=======

screengpt-vision's People

Contributors

amt42 avatar

Stargazers

Yurui Zhu avatar Aditya Shankar Khorne avatar  avatar ClΓ©ment Castellon avatar Hemesky avatar Elias Tazartes avatar

Watchers

Kostas Georgiou avatar Hemesky avatar  avatar

Forkers

adityaskhorne

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.