Topic: video-captioning Goto Github

Some thing interesting about video-captioning

👇 Here are 73 public repositories matching this topic...

abdelrhman-yasser / video-content-description

video-captioning,Video content description model for generating descriptions for unconstrained videos

User: abdelrhman-yasser

nlp deep-learning tensorflow keras-tensorflow video-captioning

acherstyx / cocap

video-captioning,[ICCV 2023] Accurate and Fast Compressed Video Captioning

User: acherstyx

Home Page: https://arxiv.org/abs/2309.12867

compressed-video iccv2023 video-captioning

acht7111020 / adlxmlds2017

video-captioning,Deep learning projects in ADL (2017 FALL) @ NTU. Implement in Tensorflow and Python. Work on Sequence Labeling, Video Caption, Game Playing, and Comics Generation.

User: acht7111020

video-captioning game-playing-agent comics-generation sequence-labeling deep-learning tensorflow

adit31 / captionomaly-deep-learning-toolbox-for-anomaly-captioning

video-captioning,Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos

User: adit31

python tensorflow anomaly-detection video-captioning anomaly-captioning keras ucfc-vd ucf101 msrvtt

aimagelab / mvad-names-dataset

video-captioning,M-VAD Names Dataset. Multimedia Tools and Applications (2019)

Organization: aimagelab

mvad-names-dataset captioning-videos video-captioning

amazon-science / crossmodal-contrastive-learning

video-captioning,CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations, ICCV 2021

Organization: amazon-science

multi-modality video video-text-retrieval video-captioning computer-vision natural-language-processing transformers contrastive-learning

antoyang / vidchapters

video-captioning,[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale

User: antoyang

Home Page: http://arxiv.org/abs/2309.13952

dense-video-captioning multimodal-learning pre-training temporal-language-grounding video-captioning video-understanding vision-and-language weakly-supervised-learning vid2seq video-chapter-generation

arpitpatel1501 / capsum-joint-video-summarization-and-video-captioing

video-captioning,AI-based Video summarizer along with captioning.

User: arpitpatel1501

video-summarization video-captioning gcp-storage gcp-sql gcp-app-engine

awkrail / svpc

video-captioning,Official implementation of state-aware video procedural captioning (ACM MM 2021)

User: awkrail

video-captioning youcook2

b05902062 / tdconved

video-captioning,implementation of TDConvED for video captioning

User: b05902062

video-captioning

bytedance / shot2story

video-captioning,A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.

Organization: bytedance

Home Page: https://mingfei.info/shot2story

benchmark dataset large-language-models video-language video-language-pretraining video-question-answering video-summarization vision-language video-captioning video-story

crux82 / msr-vtt-it

video-captioning,A large scale dataset for Video Captioning in Italian

User: crux82

video-captioning deep-neural-networks dataset italian italiano

imshaikot / srt-webvtt

video-captioning,Convert SRT formatted subtitle to WebVTT on the fly over HTML5/browser environment

User: imshaikot

Home Page: https://imshaikot.github.io/srt-webvtt

video video-captioning web-vtt srt-subtitles converter html5 html5-video

jacobswan1 / video2commonsense

video-captioning,Video captioning baseline models on Video2Commonsense Dataset.

User: jacobswan1

Home Page: https://asu-active-perception-group.github.io/Video2Commonsense/

video-captioning video2commonsense commonsense-question-answering commonsense-story

jasoneppink / multicaptions

video-captioning,Multicaptions processes and displays subtitles on a graphic LCD or VFD display while simultaneously playing fullscreen HD video on a separate monitor

User: jasoneppink

raspberry-pi lcd-display subtitle video arduino captions video-captioning vfd

jasonyao81000 / mlds2018spring

video-captioning,Machine Learning and having it Deep and Structured (MLDS) in 2018 spring

User: jasonyao81000

mlds2018spring ntu hung-yi-lee mlds seq2seq sequence-to-sequence gan generative-adversarial-network reinforcement-learning policy-gradient

jayleicn / recurrent-transformer

video-captioning,[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning

User: jayleicn

Home Page: https://arxiv.org/abs/2005.05402

pytorch video-captioning activitynet-captions youcook2

jayleicn / tvcaption

video-captioning,[ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset

User: jayleicn

Home Page: https://tvr.cs.unc.edu/tvc.html

video-captioning dataset pytorch

jpthu17 / emcl

video-captioning,[NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations

User: jpthu17

cross-modal-retrieval neurips video-captioning video-question-answering video-retrieval

jssprz / attentive_specialized_network_video_captioning

video-captioning,Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*

User: jssprz

video-captioning msvd msr-vtt icpr2020 deep-learning video-to-text video-description

jssprz / video_captioning_datasets

video-captioning,Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*

User: jssprz

video-captioning video-description vision-and-language video-dataset video-to-text msvd msr-vtt activitynet-captions trecvid charades

jssprz / video_features_extractor

video-captioning,Python implementation of extraction of several visual features representations from videos

User: jssprz

visual-representation video-representation cnn c3d video-captioning msvd msr-vtt trecvid activitynet-captions vatex

jssprz / visual_syntactic_embedding_video_captioning

video-captioning,Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*

User: jssprz

video-captioning msvd msr-vtt wacv2021 deep-learning pos-tagging representation-learning encoder-decoder syntactic-representations video-description video-to-text

kamino666 / video-captioning-transformer

video-captioning,这是一个基于Pytorch平台、Transformer框架实现的视频描述生成 (Video Captioning) 深度学习模型。视频描述生成任务指的是：输入一个视频，输出一句描述整个视频内容的文字（前提是视频较短且可以用一句话来描述）。本repo主要目的是帮助视力障碍者欣赏网络视频、感知周围环境，促进“无障碍视频”的发展。

User: kamino666

pytorch transformer video-captioning

lvapeab / abivirnet

video-captioning,Attention Bidirectional Video Recurrent Net

User: lvapeab

video-captioning deep-learning keras theano attention-mechanism lstm python tensorflow

lvapeab / interactive-keras-captioning

video-captioning,Interactive multimedia captioning with Keras

User: lvapeab

Home Page: http://casmacat.prhlt.upv.es/interactive-seq2seq/

keras sequence-to-sequence transformer attention-mechanism rnn lstm image-captioning video-captioning interactive-machine-learning theano

mlvlab / meltr

video-captioning,MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)

Organization: mlvlab

cvpr2023 meta-learning multi-modal video-captioning video-question-answering video-retrieval

nasib-ullah / thvc

video-captioning,A PyTorch implementation of the paper Thinking Hallucination for Video Captioning.

User: nasib-ullah

accv2022 hallucinations video-captioning

nasib-ullah / video-captioning-models-in-pytorch

video-captioning,A PyTorch implementation of state of the art video captioning models from 2015-2019 on MSVD and MSRVTT datasets.

User: nasib-ullah

video-captioning deep-learning sequence-to-sequence msvd msrvtt s2vt pytorch pytorch-implementation video-captioning-models video

paritoshparmar / mtl-aqa

video-captioning,What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment [CVPR 2019]

User: paritoshparmar

Home Page: https://arxiv.org/abs/1904.04346

action-quality-assessment mtl-aqa multitask-learning video-understanding video-processing video-captioning fine-grained-classification pytorch action-recognition fine-grained-action-recognition

pochih / video-cap

video-captioning,🎬 Video Captioning: ICCV '15 paper implementation

User: pochih

seq2seq video-captioning attention-mechanism tnesorflow deep-learning nlp computer-vision

rohit-gupta / video2language

video-captioning,Generating video descriptions using deep learning in Keras

User: rohit-gupta

deep-learning computer-vision natural-language-processing keras keras-models deep-video-analytics video-captioning video-to-text

scopeinfinity / video2description

video-captioning,Video to Text: Natural language description generator for some given video. [Video Captioning]

User: scopeinfinity

deep-neural-networks cnn-keras lstm-neural-networks image-captioning video-captioning video-processing audio-processing video-to-text

terry-r123 / awesome-captioning

video-captioning,A curated list of Multimodal Captioning related research(including image captioning, video captioning, and text captioning)

User: terry-r123

image-captioning text-captioning video-captioning

thtang / adlxmlds2017

video-captioning,Deep learning works for ADLxMLDS (CSIE 5431) in NTU

User: thtang

Home Page: https://www.csie.ntu.edu.tw/~yvchen/f106-adl/index.html

seq2seq sequence-labeling video-captioning deep-reinforcement-learning atari-games gan conditional-gan dcgan s2vt image-generation

tomchang25 / whisper-auto-transcribe

video-captioning,Auto transcribe tool based on whisper

User: tomchang25

asr text-to-speech deep-learning speech-recognition speech-to-text language-model pytorch speech-processing voice-activity-detection gradio

tsujuifu / pytorch_empirical-mvm

video-captioning,A PyTorch implementation of EmpiricalMVM

User: tsujuifu

cvpr2023 pytorch pre-training video-captioning video-question-answering video-retrieval vision-and-language

txh-mercury / cosa

video-captioning,Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model

User: txh-mercury

Home Page: https://arxiv.org/abs/2306.09085

video-captioning video-qa video-retrieval vision-language-pretraining video-language-pretrainng

uark-aicv / vlcap

video-captioning,[ICIP 2022] VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning

Organization: uark-aicv

Home Page: https://ieeexplore.ieee.org/document/9897766

transformer video-captioning vision-and-language contrastive-learning

uark-aicv / vltint

video-captioning,[AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning

Organization: uark-aicv

Home Page: https://uark-aicv.github.io/VLTinT/

aaai2023 transformer-architecture video-captioning vision-language pytorch video-paragraph-captioning

valterlej / zsarcap

video-captioning,Official code for Tell Me What You See: A Zero-Shot Action Recognition Method Based on Natural Language Descriptions (Multimedia Tools and Applications 2024)

User: valterlej

Home Page: https://link.springer.com/article/10.1007/s11042-023-16566-5

video-captioning zero-shot-learning cross-dataset-learning

video-captioning,This repository contains the code for a video captioning system inspired by Sequence to Sequence -- Video to Text. This system takes as input a video and generates a caption in English describing the video.

User: vijayvee

video-captioning tensorflow s2vt sequence-to-sequence multimodal-deep-learning seq2seq

willyfh / awesome-video-text-datasets

video-captioning,A curated list of video-text datasets in a variety of languages. These datasets can be used for video captioning (video description) or video retrieval.

User: willyfh

dataset video-captioning video-description video-text video-to-text vision-language video-language video-retrieval

wingsbrokenangel / delving-deeper-into-the-decoder-for-video-captioning

video-captioning,Source code for Delving Deeper into the Decoder for Video Captioning

User: wingsbrokenangel

decoder msr-vtt msvd professional-learning semantics state-of-the-art tensorflow video-captioning

xiadingz / video-caption-opennmt.pytorch

video-captioning,implement video caption based on openNMT

User: xiadingz

pytorch video-captioning

xiadingz / video-caption.pytorch

video-captioning,pytorch implementation of video captioning

User: xiadingz

deep-learning pytorch video-captioning

yangbang18 / care

video-captioning,(TIP) Concept-Aware Video Captioning: Describing Videos with Effective Prior Information

User: yangbang18

concept-detection pytorch video-captioning

yehli / xmodaler

video-captioning,X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).

User: yehli

image-captioning video-captioning vision-and-language pretraining cross-modal-retrieval visual-question-answering tden

zjr2000 / llmva-gebc

video-captioning,Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)

User: zjr2000

long-video-understanding pytorch-implementation video-captioning

zubairhussain / video-paragraph-captioning---keras

video-captioning,Generating paragraph captions for videos

User: zubairhussain

video-captioning keras python lstm

Topic: video-captioning Goto Github

👇 Here are 73 public repositories matching this topic...

abdelrhman-yasser / video-content-description

acherstyx / cocap

acht7111020 / adlxmlds2017

adit31 / captionomaly-deep-learning-toolbox-for-anomaly-captioning

aimagelab / mvad-names-dataset

amazon-science / crossmodal-contrastive-learning

antoyang / vidchapters

arpitpatel1501 / capsum-joint-video-summarization-and-video-captioing

awkrail / svpc

b05902062 / tdconved

bytedance / shot2story

crux82 / msr-vtt-it

imshaikot / srt-webvtt

jacobswan1 / video2commonsense

jasoneppink / multicaptions

jasonyao81000 / mlds2018spring

jayleicn / recurrent-transformer

jayleicn / tvcaption

jpthu17 / emcl

jssprz / attentive_specialized_network_video_captioning

jssprz / video_captioning_datasets

jssprz / video_features_extractor

jssprz / visual_syntactic_embedding_video_captioning

kamino666 / video-captioning-transformer

lvapeab / abivirnet

lvapeab / interactive-keras-captioning

mlvlab / meltr

nasib-ullah / thvc

nasib-ullah / video-captioning-models-in-pytorch

paritoshparmar / mtl-aqa

pochih / video-cap

rohit-gupta / video2language

scopeinfinity / video2description

terry-r123 / awesome-captioning

thtang / adlxmlds2017

tomchang25 / whisper-auto-transcribe

tsujuifu / pytorch_empirical-mvm

txh-mercury / cosa

uark-aicv / vlcap

uark-aicv / vltint

valterlej / zsarcap

vijayvee / video-captioning

willyfh / awesome-video-text-datasets

wingsbrokenangel / delving-deeper-into-the-decoder-for-video-captioning

xiadingz / video-caption-opennmt.pytorch

xiadingz / video-caption.pytorch

yangbang18 / care

yehli / xmodaler

zjr2000 / llmva-gebc

zubairhussain / video-paragraph-captioning---keras

Recommend Projects

Recommend Topics

Recommend Org