Coder Social home page Coder Social logo

data's Projects

image-captioning-1 icon image-captioning-1

Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]

image-captioning-2 icon image-captioning-2

Computer Vision: Generate captions that describe the contents of images using PyTorch

iqan icon iqan

Visaul Question Generation as Dual Task of Visual Question Answering (PyTorch Version)

irlc-vqa-counting icon irlc-vqa-counting

Code for Interpretable Counting for Visual Question Answering for ICLR 2018 reproducibility challenge.

lantern icon lantern

Lantern官方版本下载 蓝灯 翻墙 代理 科学上网 外网 加速器 梯子 路由 lantern proxy vpn censorship-circumvention censorship gfw accelerator

lscm-refseg icon lscm-refseg

Code for Linguistic Structure Guided Context Modeling for Referring Image Segmentation, ECCV2020.

lxmert-test icon lxmert-test

PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".

mac-network icon mac-network

Implementation for the paper "Compositional Attention Networks for Machine Reasoning" (Hudson and Manning, ICLR 2018)

mcan-vqa icon mcan-vqa

Deep Modular Co-Attention Networks for Visual Question Answering

mfas icon mfas

Implementation of CVPR 2019 paper "Mfas: Multimodal fusion architecture search"

mkgformer icon mkgformer

Code for the SIGIR 2022 paper "Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion"

multilingual-vqa icon multilingual-vqa

Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.

naivevqa icon naivevqa

A Visual Question Answering model implemented in MindSpore and PyTorch. The model is a reimplementation of the paper *Show, Ask, Attend, and Answer: A Strong Baseline For Visual Question Answering*. It's our final project for course DL4NLP at ZJU.

rosita icon rosita

ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration

semantic-communication-systems icon semantic-communication-systems

pytorch implementation of "Deep Learning-Enabled Semantic Communication Systems with Task-Unaware Transmitter and Dynamic Data"

soho icon soho

[CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning

ssl-vqa icon ssl-vqa

Code for our IJCAI2020 paper: Overcoming Language Priors with Self-supervised Learning for Visual Question Answering

style-attngan icon style-attngan

Improves Text to Image synthesis from AttnGAN by integrating the scale-specific control from StyleGAN; can optionally use GPT-2 as text encoder

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.