Topic: clip Goto Github
Some thing interesting about clip
Some thing interesting about clip
clip,An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
User: arrowluo
Home Page: https://arxiv.org/abs/2104.08860
clip,Android Easy Reveal Library
User: chrisvin
clip,CLIPort: What and Where Pathways for Robotic Manipulation
User: cliport
Home Page: https://cliport.github.io
clip,Effortless data labeling with AI support from Segment Anything and other awesome models.
User: cvhub520
clip,Official Pytorch implementation of "CLIPstyler:Image Style Transfer with a Single Text Condition" (CVPR 2022)
User: cyclomon
clip,CLIP + FFT/DWT/RGB = text to image/video
User: eps696
clip,Search photos on Unsplash using natural language
User: haltakov
clip,Search inside YouTube videos using natural language
User: haltakov
clip,[ECCV 2022] Official PyTorch implementation of the paper Image-Based CLIP-Guided Essence Transfer.
User: hila-chefer
clip,[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.
User: hila-chefer
clip,PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)
User: j-min
Home Page: https://arxiv.org/abs/2205.13115
clip,Collection of AWESOME vision-language models for vision tasks
User: jingyi0000
clip,ZMJImageEditor is a picture editing component like WeChat. It is powerful and easy to integrate, supporting rendering, text, rotation, tailoring, mapping and other functions. (ZMJImageEditor 是一个和微信一样图片编辑的组件,功能强大,极易集成,支持绘制、文字、旋转、剪裁、贴图等功能)
User: keshiim
clip,Keras beit,caformer,CMT,CoAtNet,convnext,davit,dino,efficientdet,edgenext,efficientformer,efficientnet,eva,fasternet,fastervit,fastvit,flexivit,gcvit,ghostnet,gpvit,hornet,hiera,iformer,inceptionnext,lcnet,levit,maxvit,mobilevit,moganet,nat,nfnets,pvt,swin,tinynet,tinyvit,uniformer,volo,vanillanet,yolor,yolov7,yolov8,yolox,gpt2,llama2, alias kecam
User: leondgarse
clip,GenSim: Generating Robotic Simulation Tasks via Large Language Models
User: liruiw
Home Page: https://liruiw.github.io/gensim
clip,Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
Organization: marqo-ai
Home Page: https://www.marqo.ai/
clip,"Video-ChatGPT" is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
Organization: mbzuai-oryx
Home Page: https://mbzuai-oryx.github.io/Video-ChatGPT
clip,Getting the latest versions of Disco Diffusion to work locally, instead of colab. Including how I run this on Windows, despite some Linux only dependencies ;)
User: mohamadzeina
clip,CLIP inference in plain C/C++ with no extra dependencies
User: monatis
clip,Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Organization: ofa-sys
clip,Official Pytorch Implementation for "Text2LIVE: Text-Driven Layered Image and Video Editing" (ECCV 2022 Oral)
User: omerbt
Home Page: https://text2live.github.io/
clip,Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 30+ HF models, 15+ benchmarks
Organization: open-compass
Home Page: https://rank.opencompass.org.cn/leaderboard-multimodal
clip,OpenMMLab Pre-training Toolbox and Benchmark
Organization: open-mmlab
Home Page: https://mmpretrain.readthedocs.io/en/latest/
clip,Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
Organization: opengvlab
clip,React component for truncating multi-line spans and adding an ellipsis.
User: pablosichert
Home Page: https://www.webpackbin.com/bins/-Kw6QnAkjmv1OD6Of-ZD
clip,Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
Organization: paddlepaddle
clip,PASSL包含 SimCLR,MoCo v1/v2,BYOL,CLIP,PixPro,simsiam, SwAV, BEiT,MAE 等图像自监督算法以及 Vision Transformer,DEiT,Swin Transformer,CvT,T2T-ViT,MLP-Mixer,XCiT,ConvNeXt,PVTv2 等基础视觉算法
Organization: paddlepaddle
clip,Pathology Language and Image Pre-Training (PLIP) is the first vision and language foundation model for Pathology AI (Nature Medicine). PLIP is a large-scale pre-trained model that can be used to extract visual and language features from pathology images and text description. The model is a fine-tuned version of the original CLIP model.
Organization: pathologyfoundation
clip,FashionCLIP is a CLIP-like model fine-tuned for the fashion domain.
User: patrickjohncyh
clip,[CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies
User: pengsongyou
Home Page: https://pengsongyou.github.io/openscene
clip,Image to prompt with BLIP and CLIP
User: pharmapsychotic
clip,🥂 Gracefully face hCaptcha challenge with MoE(ONNX) embedded solution.
User: qin2dim
Home Page: https://docs.captchax.top/
clip,Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥
Organization: roboflow
clip,Easily compute clip embeddings and build a clip retrieval system with them
User: rom1504
Home Page: https://rom1504.github.io/clip-retrieval/
clip,Android UI 快速开发,专治原生控件各种不服
User: ruffianzhong
clip,Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm
Organization: sense-gvt
clip,👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]
User: skalskip
clip,基于Stable Diffusion优化的AI绘画模型。支持输入中英文文本,可生成多种现代艺术风格的高质量图像。| An optimized text-to-image model based on Stable Diffusion. Both Chinese and English text inputs are available to generate images. The model can generate high-quality images in several modern art styles.
User: skyworkaigc
Home Page: https://sky-paint.singularity-ai.com/index.html#/
clip,Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️
Organization: unum-cloud
Home Page: https://unum-cloud.github.io/uform/
clip,Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, ResNet features.
User: v-iashin
Home Page: https://v-iashin.github.io/video_features
clip,CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks
Organization: xmed-lab
clip,中文nlp解决方案(大模型、数据、模型、训练、推理)
User: yuanzhoulvpi2017
clip,Language Models Can See: Plugging Visual Controls in Text Generation
User: yxuansu
Home Page: https://arxiv.org/abs/2205.02655
clip,Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).
User: yzhuoning
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.