Topic: video-captioning Goto Github
Some thing interesting about video-captioning
Some thing interesting about video-captioning
video-captioning,Video content description model for generating descriptions for unconstrained videos
User: abdelrhman-yasser
video-captioning,[ICCV 2023] Accurate and Fast Compressed Video Captioning
User: acherstyx
Home Page: https://arxiv.org/abs/2309.12867
video-captioning,Deep learning projects in ADL (2017 FALL) @ NTU. Implement in Tensorflow and Python. Work on Sequence Labeling, Video Caption, Game Playing, and Comics Generation.
User: acht7111020
video-captioning,Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos
User: adit31
video-captioning,M-VAD Names Dataset. Multimedia Tools and Applications (2019)
Organization: aimagelab
video-captioning,CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations, ICCV 2021
Organization: amazon-science
video-captioning,[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale
User: antoyang
Home Page: http://arxiv.org/abs/2309.13952
video-captioning,AI-based Video summarizer along with captioning.
User: arpitpatel1501
video-captioning,Official implementation of state-aware video procedural captioning (ACM MM 2021)
User: awkrail
video-captioning,implementation of TDConvED for video captioning
User: b05902062
video-captioning,A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.
Organization: bytedance
Home Page: https://mingfei.info/shot2story
video-captioning,A large scale dataset for Video Captioning in Italian
User: crux82
video-captioning,Convert SRT formatted subtitle to WebVTT on the fly over HTML5/browser environment
User: imshaikot
Home Page: https://imshaikot.github.io/srt-webvtt
video-captioning,Video captioning baseline models on Video2Commonsense Dataset.
User: jacobswan1
Home Page: https://asu-active-perception-group.github.io/Video2Commonsense/
video-captioning,Multicaptions processes and displays subtitles on a graphic LCD or VFD display while simultaneously playing fullscreen HD video on a separate monitor
User: jasoneppink
video-captioning,Machine Learning and having it Deep and Structured (MLDS) in 2018 spring
User: jasonyao81000
video-captioning,[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
User: jayleicn
Home Page: https://arxiv.org/abs/2005.05402
video-captioning,[ECCV 2020] PyTorch code of MMT (a multimodal transformer captioning model) on TVCaption dataset
User: jayleicn
Home Page: https://tvr.cs.unc.edu/tvc.html
video-captioning,[NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations
User: jpthu17
video-captioning,Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*
User: jssprz
video-captioning,Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review*
User: jssprz
video-captioning,Python implementation of extraction of several visual features representations from videos
User: jssprz
video-captioning,Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*
User: jssprz
video-captioning,这是一个基于Pytorch平台、Transformer框架实现的视频描述生成 (Video Captioning) 深度学习模型。 视频描述生成任务指的是:输入一个视频,输出一句描述整个视频内容的文字(前提是视频较短且可以用一句话来描述)。本repo主要目的是帮助视力障碍者欣赏网络视频、感知周围环境,促进“无障碍视频”的发展。
User: kamino666
video-captioning,Attention Bidirectional Video Recurrent Net
User: lvapeab
video-captioning,Interactive multimedia captioning with Keras
User: lvapeab
Home Page: http://casmacat.prhlt.upv.es/interactive-seq2seq/
video-captioning,MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models (CVPR 2023)
Organization: mlvlab
video-captioning,A PyTorch implementation of the paper Thinking Hallucination for Video Captioning.
User: nasib-ullah
video-captioning,A PyTorch implementation of state of the art video captioning models from 2015-2019 on MSVD and MSRVTT datasets.
User: nasib-ullah
video-captioning,What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment [CVPR 2019]
User: paritoshparmar
Home Page: https://arxiv.org/abs/1904.04346
video-captioning,🎬 Video Captioning: ICCV '15 paper implementation
User: pochih
video-captioning,Generating video descriptions using deep learning in Keras
User: rohit-gupta
video-captioning,Video to Text: Natural language description generator for some given video. [Video Captioning]
User: scopeinfinity
video-captioning,A curated list of Multimodal Captioning related research(including image captioning, video captioning, and text captioning)
User: terry-r123
video-captioning,Deep learning works for ADLxMLDS (CSIE 5431) in NTU
User: thtang
Home Page: https://www.csie.ntu.edu.tw/~yvchen/f106-adl/index.html
video-captioning,Auto transcribe tool based on whisper
User: tomchang25
video-captioning,A PyTorch implementation of EmpiricalMVM
User: tsujuifu
video-captioning,Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
User: txh-mercury
Home Page: https://arxiv.org/abs/2306.09085
video-captioning,[ICIP 2022] VLCap: Vision-Language with Contrastive Learning for Coherent Video Paragraph Captioning
Organization: uark-aicv
Home Page: https://ieeexplore.ieee.org/document/9897766
video-captioning,[AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning
Organization: uark-aicv
Home Page: https://uark-aicv.github.io/VLTinT/
video-captioning,Official code for Tell Me What You See: A Zero-Shot Action Recognition Method Based on Natural Language Descriptions (Multimedia Tools and Applications 2024)
User: valterlej
Home Page: https://link.springer.com/article/10.1007/s11042-023-16566-5
video-captioning,This repository contains the code for a video captioning system inspired by Sequence to Sequence -- Video to Text. This system takes as input a video and generates a caption in English describing the video.
User: vijayvee
video-captioning,A curated list of video-text datasets in a variety of languages. These datasets can be used for video captioning (video description) or video retrieval.
User: willyfh
video-captioning,Source code for Delving Deeper into the Decoder for Video Captioning
User: wingsbrokenangel
video-captioning,implement video caption based on openNMT
User: xiadingz
video-captioning,pytorch implementation of video captioning
User: xiadingz
video-captioning,(TIP) Concept-Aware Video Captioning: Describing Videos with Effective Prior Information
User: yangbang18
video-captioning,X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
User: yehli
video-captioning,Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)
User: zjr2000
video-captioning,Generating paragraph captions for videos
User: zubairhussain
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.