lily11223344,data,github

image-captioning-1

Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]

image-captioning-2

Computer Vision: Generate captions that describe the contents of images using PyTorch

image-captioning-3

Image captioning models "show and tell" + "show, attend and tell" in PyTorch

iqan

Visaul Question Generation as Dual Task of Visual Question Answering (PyTorch Version)

irlc-vqa-counting

Code for Interpretable Counting for Visual Question Answering for ICLR 2018 reproducibility challenge.

lantern

Lantern官方版本下载蓝灯翻墙代理科学上网外网加速器梯子路由 lantern proxy vpn censorship-circumvention censorship gfw accelerator

lscm-refseg

Code for Linguistic Structure Guided Context Modeling for Referring Image Segmentation, ECCV2020.

lxmert-test

PyTorch code for EMNLP 2019 paper "LXMERT: Learning Cross-Modality Encoder Representations from Transformers".

mac-network

Implementation for the paper "Compositional Attention Networks for Machine Reasoning" (Hudson and Manning, ICLR 2018)

mcan-vqa

Deep Modular Co-Attention Networks for Visual Question Answering

mfas

Implementation of CVPR 2019 paper "Mfas: Multimodal fusion architecture search"

mkgformer

Code for the SIGIR 2022 paper "Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion"

multilingual-vqa

Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.

murel.bootstrap.pytorch

MUREL (CVPR 2019), a multimodal relational reasoning module for VQA

A Visual Question Answering model implemented in MindSpore and PyTorch. The model is a reimplementation of the paper *Show, Ask, Attend, and Answer: A Strong Baseline For Visual Question Answering*. It's our final project for course DL4NLP at ZJU.

neural-vqa-attention

:question: Attention-based Visual Question Answering in Torch

nmn-pytorch

Neural Module Network for VQA in Pytorch

pytorch-fastcampus

PyTorch로 시작하는 딥러닝 입문 CAMP (2017.7~2017.12) 강의자료

pytorch-tutorial

PyTorch Tutorial for Deep Learning Researchers

qll

reasoningconsistency-vqa

relation-network-tensorflow

Tensorflow implementations of Relational Networks and a VQA dataset named Sort-of-CLEVR proposed by DeepMind.

rosita

ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration

selfsupervisedimagetext

semantic-communication-systems

pytorch implementation of "Deep Learning-Enabled Semantic Communication Systems with Task-Unaware Transmitter and Dynamic Data"

soho

[CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning

ssl-vqa

Code for our IJCAI2020 paper: Overcoming Language Priors with Self-supervised Learning for Visual Question Answering

style-attngan

Improves Text to Image synthesis from AttnGAN by integrating the scale-specific control from StyleGAN; can optionally use GPT-2 as text encoder

lily11223344 Goto Github PK

data's Projects

Recommend Projects

Recommend Topics

Recommend Org