lily11223344,data,github

a-pytorch-project-to-image-caption

Image Caption with Attention | a PyTorch Project to Image Caption

a-pytorch-tutorial-to-image-captioning

Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning

ad-nerf

This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

attnganwithbert

Implementation of a text to image generator in ATTNGAN paper improved using BERT transformer

Live real-time avatars from your webcam in the browser. No dedicated hardware or software installation needed. A pure Google Colab wrapper for live First-order-motion-model, aka Avatarify in the browser. And other Colabs providing an accessible interface for using FOMM, Wav2Lip and Liquid-warping-GAN with your own media and a rich GUI.

basic_vqa

Pytorch VQA : Visual Question Answering (https://arxiv.org/pdf/1505.00468.pdf)

bert

TensorFlow code and pre-trained models for BERT

bidirectional_dalle

Translation-equivariant Image Quantizer for Bi-directional Image-Text Generation, Stage 2

butd_model

A pytorch implementation of "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering" for image captioning.

clip

Contrastive Language-Image Pretraining

clip-gen

CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP

clip-training

Code to train CLIP model

cmpc-refseg

Code for Referring Image Segmentation via Cross-Modal Progressive Comprehension, CVPR2020.

cogview2

official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"

controlgan-tensorflow

Simple Tensorflow implementation of "ControlGAN: Controllable Text-to-Image Generation" (NeurIPS 2019)

cross-attention-vizwiz-vqa

A self-evident application of the VQA task is to design systems that aid blind people with sight reliant queries. The VizWiz VQA dataset originates from images and questions compiled by members of the visually impaired community and as such, highlights some of the challenges presented by this particular use case.

lily11223344 Goto Github PK

data's Projects

Recommend Projects

Recommend Topics

Recommend Org