标签 - Video Captioning

论文笔记 VidChapters-7M Video Chapters at Scale Video Captioning 11-06

论文笔记 Vid2Seq Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning 11-05

论文笔记 Human-centric Behavior Description in Videos New Benchmark and Model 10-26

论文笔记 UCF-Crime Annotation A Benchmark for Surveillance Video-and-Language Understanding 10-23

基于梯度下降算法的Zero-shot Captioning方法 09-08

论文笔记 Zero-Shot Scene Graph Relation Prediction through Commonsense Knowledge Integration 02-08

End-to-end Generative Pretraining for Multimodal Video Captioning 论文笔记 10-12

GIT A Generative Image-to-text Transformer for Vision and Language 论文笔记 10-11

SwinBERT End-to-End Transformers with Sparse Attention for Video Captioning 论文笔记 10-10

Open-book Video Captioning with Retrieve-Copy-Generate Network论文笔记 09-30