Coder Social home page Coder Social logo

comfyui-autolabel's Introduction

ComfyUI-AutoLabel

ComfyUI-AutoLabel is a custom node for ComfyUI that uses BLIP (Bootstrapping Language-Image Pre-training) to generate detailed descriptions of the main object in an image. This node leverages the power of BLIP to provide accurate and context-aware captions for images.

ComfyUI-AutoLabel

Features

  • Image to Text Description: Generate detailed descriptions of the main object in an image.
  • Customizable Prompts: Provide your own prompt to guide the description generation.
  • Flexible Inference Modes: Supports GPU, GPU with float16, and CPU inference modes.
  • Offline Mode: Option to download and use models offline.

Installation

  1. Clone the Repository: Clone this repository into your custom_nodes folder in ComfyUI.

    git clone https://github.com/fexploit/ComfyUI-AutoLabel custom_nodes/ComfyUI-AutoLabel
  2. Install Dependencies: Navigate to the cloned folder and install the required dependencies.

    cd custom_nodes/ComfyUI-AutoLabel
    pip install -r requirements.txt

Usage

Adding the Node

  1. Start ComfyUI.
  2. Add the AutoLabel node from the custom nodes list.
  3. Connect an image input and configure the parameters as needed.

Parameters

  • image (required): The input image tensor.
  • prompt (optional): A string to guide the description generation (default: "a photography of").
  • repo_id (optional): The Hugging Face model repository ID (default: "Salesforce/blip-image-captioning-base").
  • inference_mode (optional): The inference mode, can be "gpu_float16", "gpu", or "cpu" (default: "gpu").
  • get_model_online (optional): Boolean flag to download the model online if not already present (default: True).

Contributing

Contributions are welcome! Please open an issue or submit a pull request with your changes.

License

This project is licensed under the MIT License.

Acknowledgements

Contact

For any inquiries, please open an issue on the GitHub repository.

comfyui-autolabel's People

Contributors

fexploit avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Forkers

comfynodeprs

comfyui-autolabel's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.