共计 82 篇文章
2023
论文笔记 LanguageBind Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment 论文笔记 ImageBind One Embedding Space To Bind Them All 论文笔记 Video Event Restoration Based on Keyframes for Video Anomaly Detection 论文笔记 UnLoc A Unified Framework for Video Localization Tasks 论文笔记 VidChapters-7M Video Chapters at Scale Video Captioning 论文笔记 Vid2Seq Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning 论文笔记 SoccerNet-Caption Dense Video Captioning for Soccer Broadcasts Commentaries 论文笔记 Human-centric Behavior Description in Videos New Benchmark and Model 论文笔记 UCF-Crime Annotation A Benchmark for Surveillance Video-and-Language Understanding 论文笔记 A New Comprehensive Benchmark for Semi-supervised Video Anomaly Detection and Anticipation