Coder Social home page Coder Social logo

r37r0-gh057 / pycatai-pet Goto Github PK

View Code? Open in Web Editor NEW
2.0 1.0 0.0 146 KB

A Windows based AI powered desktop pet written in python.

License: MIT License

Python 100.00%
desktop-pet gemini-vision python tkinter virtual-pet virtual-pets win32api

pycatai-pet's Introduction

pyCatAI-pet (v0.9)

A Windows based AI powered desktop pet written in Python. It can move around on your screen and jump on top of active program windows. And with the help of Google's Gemini Vision model, it can generate funny comments on current on-screen activity by taking screenshots of your screen every minute.

Watch video demo here

logo

Python

Note

All of the cat sprites used in this project are not mine. They have been taken from here.

Current Features:

  • Uses tkinter to display sprite images and text on screen.
  • Uses win32gui library to access and utitlize the Windows API to get the active program windows and their X, Y positions.
  • Uses pyttsx3 library for Text-to-speech.
  • Uses Google's Gemini Vision model for generating comments.

To do:

  • Add support for linux.
  • Add better TTS.
  • Add & use more idle animations.
  • Make the sprite draggable using mouse.
  • Make the sprite stick on other program window borders.

Getting started

Install Python on your machine if you haven't already.

Download this repository manually, or if you have git installed::

git clone https://github.com/R37r0-Gh057/pyCatAI-pet

Once inside the directory, open your terminal enter the following commands to install the necessary libraries:

pip install -r requirements.txt

Usage

Important

Make sure that you have generated your own Gemini API key and placed it in the script:

self.API_KEY = ""

Run the main.py file from terminal:

python main.py

Contributing PRs Welcome

If you want to suggest a new feature or functionality then you can open a new issue with the "enhancement" title.

If you want to add features or enhance existing code by yourself then feel free to open a Pull Request:

  1. Fork this repository
  2. Create a separate branch
  3. Make your changes
  4. Open pull request

You can get started by checking the currently open issues, or create new ones.

Contact

Feel free to reach out to me on discord: @retr0_gh0st

pycatai-pet's People

Contributors

r37r0-gh057 avatar

Stargazers

Tanmay Sharma avatar  avatar

Watchers

 avatar

pycatai-pet's Issues

Add better TTS

Currently using the Windows SAPI for text-to-speech conversions.
Need to implement a non-blocking TTS and custom voice model if possible.

Add support for linux

Known issues:

  • Unlike Windows, displaying images without a window on the screen is not possible through tkinter.
  • Win32API equivalent or some other workaround is needed to interact with other application windows on the screen.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.