Coder Social home page Coder Social logo

data's Projects

ad-nerf icon ad-nerf

This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".

attnganwithbert icon attnganwithbert

Implementation of a text to image generator in ATTNGAN paper improved using BERT transformer

avatars4all icon avatars4all

Live real-time avatars from your webcam in the browser. No dedicated hardware or software installation needed. A pure Google Colab wrapper for live First-order-motion-model, aka Avatarify in the browser. And other Colabs providing an accessible interface for using FOMM, Wav2Lip and Liquid-warping-GAN with your own media and a rich GUI.

basic_vqa icon basic_vqa

Pytorch VQA : Visual Question Answering (https://arxiv.org/pdf/1505.00468.pdf)

bert icon bert

TensorFlow code and pre-trained models for BERT

bidirectional_dalle icon bidirectional_dalle

Translation-equivariant Image Quantizer for Bi-directional Image-Text Generation, Stage 2

butd_model icon butd_model

A pytorch implementation of "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering" for image captioning.

clip icon clip

Contrastive Language-Image Pretraining

clip-gen icon clip-gen

CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP

cmpc-refseg icon cmpc-refseg

Code for Referring Image Segmentation via Cross-Modal Progressive Comprehension, CVPR2020.

cogview2 icon cogview2

official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"

controlgan-tensorflow icon controlgan-tensorflow

Simple Tensorflow implementation of "ControlGAN: Controllable Text-to-Image Generation" (NeurIPS 2019)

cross-attention-vizwiz-vqa icon cross-attention-vizwiz-vqa

A self-evident application of the VQA task is to design systems that aid blind people with sight reliant queries. The VizWiz VQA dataset originates from images and questions compiled by members of the visually impaired community and as such, highlights some of the challenges presented by this particular use case.

css-vqa icon css-vqa

Counterfactual Samples Synthesizing for Robust VQA

ddpm icon ddpm

PyTorch DDPM implementation

deepsc-s icon deepsc-s

Semantic Communication Systems for Speech Transmission

evp icon evp

Code for paper 'Audio-Driven Emotional Video Portraits'.

gcn-glac icon gcn-glac

Graph convolution-based visual storytelling

glide-text2im icon glide-text2im

GLIDE: a diffusion-based text-conditional image synthesis model

graphvqa icon graphvqa

GraphVQA: Language-Guided Graph Neural Networks for Scene Graph Question Answering

image-captioning icon image-captioning

CNN-Encoder and RNN-Decoder (Bahdanau Attention) for image caption or image to text on MS-COCO dataset. 图片描述

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.