Coder Social home page Coder Social logo

awesome-anything's Introduction

Awesome-Anything

Awesome Anything

A curated list of general AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, etc.

Contributions are welcome!

  • Awesome-Anything
    • AnyObject - Segmentation, Detection, Classification, Medical Image, OCR, etc.
    • AnyGeneration - Text-to-Image Generation, Editing, Inpainting.
    • AnyTask - LLM Controller + ModelZoo, General Decoding, Multi-Task Learning.
    • AnyModel - Network Pruning, Network Quantization, Model Reuse.
    • AnyX - Other Topics: Captioning, etc.
    • Paper List

AnyObject

Title & Authors Intro Useful Links
Star
Segment Anything
Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alex Berg, Wan-Yen Lo, Piotr Dollar, Ross Girshick
Preprint'23

[Segment Anything (Project)]
intro [Github]
[Page]
[Demo]
Star
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
Shilong Liu and Zhaoyang Zeng and Tianhe Ren and Feng Li and Hao Zhang and Jie Yang and Chunyuan Li and Jianwei Yang and Hang Su and Jun Zhu and Lei Zhang
Preprint'23

[Grounded-SAM, GroundingDINO (Project)]
intro [Github]
[Demo]
Star
SegGPT: Segmenting Everything In Context
Xinlong Wang, Xiaosong Zhang, Yue Cao, Wen Wang, Chunhua Shen, Tiejun Huang
Preprint'23

[SegGPT (Project)]
image [Github]
V3Det: Vast Vocabulary Visual Detection Dataset
Jiaqi Wang, Pan Zhang, Tao Chu, Yuhang Cao, Yujie Zhou, Tong Wu, Bin Wang, Conghui He, Dahua Lin
Preprint'23
image --
Star
segment-anything-video (Project)
Kadir Nar
intro [Github]
Star
Towards Segmenting Anything That Moves
Achal Dave, Pavel Tokmakov, Deva Ramanan
ICCV'19 Workshop

[segment-any-moving (Project)]
[Github]
Star
Semantic Segment Anything
Jiaqi Chen, Zeyu Yang, Li Zhang

[Semantic-Segment-Anything (Project)]
image [Github]
Star
Grounded Segment Anything: From Objects to Parts (Project)
Peize Sun and Shoufa Chen
intro [Github]
Star
GroundedSAM-zero-shot-anomaly-detection (Project)
Yunkang Cao
image [Github]
Star
Segment Anything Labelling Tool (SALT) (Project)
Anurag Ghosh
intro [Github]
Star
Prompt-Segment-Anything (Project)
Rockey
intro [Github]
Star
SAM-RBox (Project)
Qingyun Li
intro [Github]
Star
VISAM (Project)
Feng Yan, Weixin Luo, Yujie Zhong, Yiyang Gan, Lin Ma
intro [Github]
Star
Segment Anything Prompt (Project)
MagicSource
intro [Github]
Star
Segment Anything EO tools: Earth observation tools for Meta AI Segment Anything (Project)
Aliaksandr Hancharenka, Alexander Chichigin
intro [Github]
Star
napari-segment-anything: Segment Anything Model (SAM) native Qt UI (Project)
Jordão Bragantini, Kyle I S Harrington, Ajinkya Kulkarni
image [Github]
Star
SAM-Medical-Imaging: Segment Anything Model (SAM) native Qt UI (Project)
Jordão Bragantini, Kyle I S Harrington, Ajinkya Kulkarni
image [Github]
Star
OCR-SAM: Combining MMOCR with Segment Anything & Stable Diffusion. (Project)
Zhenhua Yang, Qing Jiang
image [Github]



AnyGeneration

Title & Authors Intro Useful Links
Star
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach and Andreas Blattmann and Dominik Lorenz and Patrick Esser and Björn Ommer
CVPR'22

[Stable-Diffusion (Project)]
intro [Github]
[Page]
[Demo]
Star
Adding Conditional Control to Text-to-Image Diffusion Models
Lvmin Zhang, Maneesh Agrawala
Preprint'23

[ControlNet (Project)]
intro [Github]
[Demo]
GigaGAN: Large-scale GAN for Text-to-Image Synthesis
Minguk Kang, Jun-Yan Zhu, Richard Zhang, Jaesik Park, Eli Shechtman, Sylvain Paris, Taesung Park
CVPR'23
image [Page]
Star
Inpaint-Anything: Segment Anything Meets Image Inpainting (Project)
Tao Yu
intro [Github]
Star
IEA: Image Editing Anything (Project)
Zhengcong Fei
intro [Github]
Star
EditAnything (Project)
Shanghua Gao, Pan Zhou
intro [Github]
Star
Segment Anything for Stable Diffusion Webui (Project)
Chengsong Zhang
image [Github]
Star
Segment Anything with Clip (Project)
Jinwoo Park
intro [Github]
Star
ShowAnything: Edit and Generate Anything In Image and Video (Project)
Showlab, NUS
intro Github



AnyTask

Title & Authors Intro Useful Links
Star
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace
Yongliang Shen, Kaitao Song, Xu Tan, Dongsheng Li, Weiming Lu, Yueting Zhuang
Preprint'23

[Jarvis (Project)]
[Github]
[Demo]
TaskMatrix.AI: Completing Tasks by Connecting Foundation Models with Millions of APIs
Yaobo Liang, Chenfei Wu, Ting Song, Wenshan Wu, Yan Xia, Yu Liu, Yang Ou, Shuai Lu, Lei Ji, Shaoguang Mao, Yun Wang, Linjun Shou, Ming Gong, Nan Duan Preprint'23
intro [Github]
Star
Generalized Decoding for Pixel, Image and Language
Xueyan Zou, Zi-Yi Dou, Jianwei Yang, Zhe Gan, Linjie Li, Chunyuan Li, Xiyang Dai, Harkirat Behl, Jianfeng Wang, Lu Yuan, Nanyun Peng, Lijuan Wang, Yong Jae Lee, Jianfeng Gao
CVPR'23

[X-Decoder (Project)]
intro [Github]
[Page]
[Demo]
Star
Pre-Trained Image Processing Transformer
Chen, Hanting and Wang, Yunhe and Guo, Tianyu and Xu, Chang and Deng, Yiping and Liu, Zhenhua and Ma, Siwei and Xu, Chunjing and Xu, Chao and Gao, Wen
CVPR'21

[Pretrained-IPT (Project)]
intro [Github]
Star
OpenAGI: When LLM Meets Domain Experts
Yingqiang Ge, Wenyue Hua, Jianchao Ji, Juntao Tan, Shuyuan Xu, Yongfeng Zhang

[OpenAGI (Project)]
intro Github



AnyModel

Title & Authors Intro Useful Links
Star
DepGraph: Towards Any Structural Pruning
Gongfan Fang, Xinyin Ma, Mingli Song, Michael Bi Mi, Xinchao Wang
CVPR'23

[Torch-Pruning (Project)]
intro [Github]
[Demo]
Star
MQBench: Towards Reproducible and Deployable Model Quantization Benchmark
Yuhang Li and Mingzhu Shen and Jian Ma and Yan Ren and Mingxin Zhao and Qi Zhang and Ruihao Gong and Fengwei Yu and Junjie Yan
NeurIPS'21

[MQBench (Project)]
intro [Github]
[Page]
Star
OTOv2: Automatic, Generic, User-Friendly
Tianyi Chen, Luming Liang, Tianyu Ding, Ilya Zharkov
ICLR'23

[Only Train Once (Project)]
intro [Github]
Star
Deep Model Reassembly
Xingyi Yang, Daquan Zhou, Songhua Liu, Jingwen Ye, Xinchao Wang
NeurIPS'22

[Deep Model Reassembly (Project)]
intro [Github]
[Page]



AnyX

Title & Authors Intro Useful Links
Star
Caption Anything (Project)
Teng Wang, Jinrui Zhang, Junjie Fei, Yunlong Tang, Zhe Li, Mingqi Gao
intro [Github]
Star
Image2Paragraph:Transform Image into Unique Paragraph (Project)
Jinpeng Wang
intro Github
...



Paper List for Anything AI

A paper list for Anything AI

AnyObejct

Paper First Author Venue Topic
Segment Anything Alexander Kirillov Preprint'23 Segmentation
Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection Shilong Liu Preprint'23 Grouding+Detection
SegGPT: Segmenting Everything In Context Xinlong Wang Preprint'23 Segmentation
V3Det: Vast Vocabulary Visual Detection Dataset Jiaqi Wang Preprint'23 Dataset

AnyGeneration

Paper First Author Venue Topic
High-Resolution Image Synthesis with Latent Diffusion Models Robin Rombach CVPR'22 Text-to-Image Generation
Adding Conditional Control to Text-to-Image Diffusion Models Lvmin Zhang Preprint'23 Controlllable Generation
GigaGAN: Large-scale GAN for Text-to-Image Synthesis Minguk Kang CVPR'23 Large-scale GAN

AnyModel

Paper First Author Venue Topic
DepGraph: Towards Any Structural Pruning Gongfan Fang CVPR'23 Network Pruning
MQBench: Towards Reproducible and Deployable Model Quantization Benchmark Yuhang Li NeurIPS'21 Network Quantization
OTOv2: Automatic, Generic, User-Friendly Tianyi Chen ICLR'23 Network Pruning
Deep Model Reassembly Xingyi Yang NeurIPS'22 Model Reuse

AnyTask

Paper First Author Venue Topic
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace Yongliang Shen Preprint'23 Modelzoo + LLM
TaskMatrix.AI: Completing Tasks by Connecting Foundation Models with Millions of APIs Yaobo Liang Preprint'23 Modelzoo + LLM
Generalized Decoding for Pixel, Image and Language Xueyan Zou CVPR'23 Multi Tasking
Pre-Trained Image Processing Transformer Chen, Hanting CVPR'21 Low-level Vision

awesome-anything's People

Stargazers

 avatar  avatar  avatar

Watchers

 avatar

Forkers

jeromyjsmith

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.