Light

huangjh98 / awesome-text2video-retrieval Goto Github PK

View Code? Open in Web Editor NEW

This project forked from liuxiaolei88/awesome-text2video-retrieval

0.0 0.0 0.0 7 KB

一个近几年来各大视觉顶会关于视频文本检索的库，同步我的博客：https://blog.csdn.net/AAliuxiaolei/article/details/121433833

awesome-text2video-retrieval's Introduction

Awesome-Text-Video-Retrieval

一个近几年来各大顶会关于视频文本检索库，同步我的博客：https://blog.csdn.net/AAliuxiaolei/article/details/121433833

汇总与Video Retrieval 各大顶会的文章以及相关论文查找的链接

附一个比较好总结的GitHub仓库

https://github.com/danieljf24/awesome-video-text-retrieval

ICCV

2021 https://openaccess.thecvf.com/ICCV2021
TeachText: CrossModal Generalized Distillation for Text-Video Retrieval
HiT: Hierarchical Transformer With Momentum Contrast for Video-Text Retrieval
TACo: Token-Aware Cascade Contrastive Learning for Video-Text Alignment

2019 https://openaccess.thecvf.com/ICCV2019
Neighborhood Preserving Hashing for Scalable Video Retrieval
SVD: A Large-Scale Short Video Dataset for Near-Duplicate Video Retrieval
HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips

SIGIR

2021 https://sigir.org/sigir2021/accepted-papers/
Improving Video Retrieval by Adaptive Margin
Hierarchical Cross-Modal Graph Consistency Learning for Video-Text Retrieval

2020 http://www.sigir.org/sigir2020/accepted-papers/
Tree-augmented Cross-Modal Encoding for Complex-Query Video Retrieval

2019 http://sigir.org/sigir2019/program/accepted/
无

ACM MM

2021 https://2021.acmmm.org/main-track-list
Understanding Chinese Video and Language via Contrastive Multimodal Pre-Training
HANet: Hierarchical Alignment Networks for Video-Text Retrieval
Discriminative Latent Semantic Graph for Video Captioning
Fine-grained Cross-modal Alignment Network for Text-Video Retrieval
Learning Segment Similarity and Alignment in Large-Scale Content Based Video Retrieval
Progressive Semantic Matching for Video-Text Retrieval
CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising∗

2020 https://2020.acmmm.org/main-track-list.html
Interpretable Embedding for Ad-Hoc Video Search
Memory Enhanced Embedding Learning for Cross-Modal Video-Text Retrieval
A W2VV++ Case Study with Automated and Interactive Text-to-Video Retrieval

2019 https://2019.acmmm.org/accepted-papers/index.html
You Only Recognize Once: Towards Fast Video Text Spotting

AAAI

2021 https://aaai.org/Conferences/AAAI-21/wp-content/uploads/2020/12/AAAI-21_Accepted-Paper-List.Main_.Technical.Track_.pdf
Mind-the-Gap! Unsupervised Domain Adaptation for Text-Video Retrieval

2020 https://aaai.org/Conferences/AAAI-20/wp-content/uploads/2020/01/AAAI-20-Accepted-Paper-List.pdf
无

2019 https://aaai.org/Conferences/AAAI-19/wp-content/uploads/2018/11/AAAI-19_Accepted_Papers.pdf
无

IJCAI

2021 https://www.ijcai.org/proceedings/2021/
Dig into Multi-modal Cues for Video Retrieval with Hierarchical Alignment

2020 https://www.ijcai.org/proceedings/2020/
Exploiting Visual Semantic Reasoning for Video-Text Retrieval

2019 https://www.ijcai.org/proceedings/2019/
无

CVPR

2021 https://openaccess.thecvf.com/CVPR2021
T2VLAD: Global-Local Sequence Alignment for Text-Video Retrieval
On Semantic Similarity in Video Retrieval
Thinking Fast and Slow: Efficient Text-to-Visual Retrieval With Transformers
Adaptive Cross-Modal Prototypes for Cross-Domain Visual-Language Retrieval
Less is more: Clipbert for video-and-language learning via sparse sampling
Mdmmt: Multidomain multimodal transformer for video retrieval

2020 https://openaccess.thecvf.com/CVPR2020_search
ActBERT: Learning Global-Local Video-Text Representations
Fine-Grained Video-Text Retrieval With Hierarchical Graph Reasoning

2019 https://openaccess.thecvf.com/CVPR2019_search
无

WACV

2021 https://openaccess.thecvf.com/WACV2021
Temporal Context Aggregation for Video Retrieval With Contrastive Learning

2019 https://openaccess.thecvf.com/WACV2020_search
无

ECCV

2020
Gabeur, Valentin, et al. "Multi-modal transformer for video retrieval." Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part IV 16. Springer International Publishing, 2020.
Graph Wasserstein Correlation Analysis for Movie Retrieval

ICLR

2022 https://openreview.net/group?id=ICLR.cc/2021/Conference
LEARNING CONTEXT-ADAPTED VIDEO-TEXT RETRIEVAL BY ATTENDING TO USER COMMENTS

2021 https://openreview.net/group?id=ICLR.cc/2021/Conference
PARAMETER EFFICIENT MULTIMODAL TRANSFORMERS FOR VIDEO REPRESENTATION LEARNING
Support-set bottlenecks for video-text representation learning

2020 https://openreview.net/group?id=ICLR.cc/2021/Conference
无

2019 https://openreview.net/group?id=ICLR.cc/2019/Conference
无

TIP

2021 https://ieeexplore.ieee.org/xpl/tocresult.jsp?isnumber=9263394&punumber=83&sortType=vol-only-seq&searchWithin=retrieval&pageNumber=6
Semantics-Aware Spatial-Temporal Binaries for Cross-Modal Video Retrieval

2020
无

2019
无

TPAMI

2021 https://ieeexplore.ieee.org/xpl/tocresult.jsp?isnumber=4359286&searchWithin=video&pageNumber=1
Generalized Few-Shot Video Classification with Video Retrieval and Feature Generation
Dual Encoding for Video Retrieval by Text
Universal Weighting Metric Learning for Cross-Modal Retrieval
Self-supervised Video Representation Learning by Uncovering Spatio-temporal Statistics

awesome-text2video-retrieval's People

Contributors

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.