Coder Social home page Coder Social logo

islinxu / vision-process-webui Goto Github PK

View Code? Open in Web Editor NEW
27.0 2.0 5.0 102.08 MB

💡💡💡awesome compute vision app in gradio

License: Apache License 2.0

Python 99.31% Shell 0.69%
detectron2 detrex mmaction2 mmdetection mmocr mmpose mmsegmentation mmyolo openmmlab timm

vision-process-webui's Introduction

vision-process-webui

GitHub watchers GitHub stars GitHub forks GitHub followers Build Status imgGitHub repo size GitHub language count GitHub last commit GitHubimg

language: [en | 中文]


🎤Introduction

Due to the increasing complexity of the theory and models of computer vision, in order to facilitate intuitive understanding and reproduction, reduce the threshold of use, and quickly verify image processing effects, inspired by the stable-diffusion-webui project in the promotion of the stable-diffusion model application, some models based on tasks such as object detection, image segmentation, and image classification are deployed and demonstrated on Gradio for inference. More people are welcome to contribute and use!

🛜Online running

OpenMMLab

detectron2

detrex

modelscope&AdaDet

detection

classification

segmentation

🧙performance&demo

🔨OpenMMLab

MMPreTrain MMYOLO MMDetection MMPose
MMSegmentation MMRotate MMOCR MMAction2

🔨detectron2

🔨classification


🔨detection

YOLOv8-det YOLOv8-seg YOLOv8-seg YOLOv8-seg
YOLOv3 YOLOv5 YOLOX
YOLO-NAS PP-YOLOE RT-Detr

🔨segmentation

mobile-sam[point] mobile-sam[bbox]

🆕News

  • (2024-05-06): add yolo_world_with_efficientvit_sam
  • (2024-05-04): support upload model for inference.
  • (2023-09-29): update README.md
  • (2023-09-20): detrex、damo-yolo、easy-face
  • (2023-09-18): detectron2
  • (2023-09-16): mmagic
  • (2023-09-14): mmtracking
  • (2023-09-12): mmaction2
  • (2023-09-10): mmocr、mmroate、mmsegmentation
  • (2023-09-08): mmyolo、mmpretrain、mmdetection、mmpose
  • (2023-09-07): yolov3、yolov5、yolov8、yolo_nas、yolox、torchvision-detection、mobile-sam、timm-classification
  • (2023-09-02): repo init.

🗓support list

Model Nums list
yolov3 3 model_list
yolov5 4 model_list
yolox 5 model_list
yolonas 3 model_list
yolov8 4 model_list
timm 20 model_list
torchvision_cls 14 model_list
torchvision_det 6 model_list
detectron2 36 model_list
detrex 61 model_list
mmpretrain 545 model_list
mmyolo 74 model_list
mmdetection 559 model_list
mmsegmentation 622 model_list
mmocr 17 model_list
mmaction2 180 model_list
mmrorate 50 model_list
mmpose 10 model_list
mmagic 14 model_list
damo_face 4 model_list
damo_yolo 8 model_list

🔨classification

🔨detection


🔨segmentation

📖Usage

1. install

git clone https://github.com/isLinXu/vision-process-webui.git
cd vision-process-webui
pip install -r requirements.txt

2. download weights

cd weights
cd [model_name]
sh download_weights.sh

model_name=xxxx

3. run

python webui/model_app.py

model_app=classification|detection|segmentation or

cd webui/app
python [model_app].py

model_app=yolov3|yolov5|yolov8|yolonas|ppyoloe|torchvision-detection|torchvision-classification|torchvision-segmentation|mobile-sam|fast-sam

🧾TODO

support more models and libraries

OpenMMLab

detectron2 series

EasyCV

AdaDet

gluon-cv

  • building...

PaddleDetection

  • building...

docker image build

  • building...

merge all ui.py in one

  • building...

🌸Reference

  • stable-diffusion-webui: Stable Diffusion web UI
  • torchvision: Datasets, Transforms and Models specific to Computer Vision
  • timm: PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
  • yolov3: YOLOv3 in PyTorch > ONNX > CoreML > TFLite
  • yolov5: YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
  • ultralytics: NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
  • super-gradients: Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
  • MMEngine: OpenMMLab foundational library for training deep learning models.
  • MMCV: OpenMMLab foundational library for computer vision.
  • MMPreTrain: OpenMMLab pre-training toolbox and benchmark.
  • MMagic: OpenMMLab Advanced, Generative and Intelligent Creation toolbox.
  • MMDetection: OpenMMLab detection toolbox and benchmark.
  • MMDetection3D: OpenMMLab's next-generation platform for general 3D object detection.
  • MMRotate: OpenMMLab rotated object detection toolbox and benchmark.
  • MMYOLO: OpenMMLab YOLO series toolbox and benchmark.
  • MMSegmentation: OpenMMLab semantic segmentation toolbox and benchmark.
  • MMOCR: OpenMMLab text detection, recognition, and understanding toolbox.
  • MMPose: OpenMMLab pose estimation toolbox and benchmark.
  • MMHuman3D: OpenMMLab 3D human parametric model toolbox and benchmark.
  • MMSelfSup: OpenMMLab self-supervised learning toolbox and benchmark.
  • MMRazor: OpenMMLab model compression toolbox and benchmark.
  • MMFewShot: OpenMMLab fewshot learning toolbox and benchmark.
  • MMAction2: OpenMMLab's next-generation action understanding toolbox and benchmark.
  • MMTracking: OpenMMLab video perception toolbox and benchmark.
  • MMFlow: OpenMMLab optical flow toolbox and benchmark.
  • MMEditing: OpenMMLab image and video editing toolbox.
  • MMGeneration: OpenMMLab image and video generative models toolbox.
  • MMDeploy: OpenMMLab model deployment framework.
  • MIM: MIM installs OpenMMLab packages.
  • MMEval: OpenMMLab machine learning evaluation library.
  • Playground: A central hub for gathering and showcasing amazing projects built upon OpenMMLab.
  • detectron2: Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
  • detrex: detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
  • gluon-cv:Gluon CV Toolkit
  • autogluon: AutoGluon: AutoML for Image, Text, Time Series, and Tabular Data
  • EasyCV: An all-in-one toolkit for computer vision
  • AdaDet:AdaDet: A Development Toolkit for Object Detection based on ModelScope
  • mediapipe:MediaPipe Solutions provides a suite of libraries and tools for you to quickly apply artificial intelligence (AI) and machine learning (ML) techniques in your applications.
  • dlib:A toolkit for making real world machine learning and data analysis applications in C++

vision-process-webui's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

vision-process-webui's Issues

To create a public link, set `share=True` in `launch()`.

Looking in yolov8_app.py I don't see launch() anywhere

I tried adding it like so

if __name__ == '__main__':
    web_ui = YOLOv8WebUI()
    web_ui.launch(share=True)

But that fails with

Traceback (most recent call last):
  File "/home/vision-process-webui/webui/app/yolov8_app.py", line 102, in <module>
    web_ui.launch(share=True)
TypeError: YOLOv8WebUI.launch() got an unexpected keyword argument 'share'

What file should I be adding 'share=True` to please?

I read this for stable-diffusion-webui but nothing there helped

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.