Kamino
  • 首页
  • 归档
  • 分类
  • 标签
  • 关于
  • 友链
  •   
  •   

共计 15 篇文章


2023

论文笔记 VidChapters-7M Video Chapters at Scale Video Captioning 11-06 论文笔记 Vid2Seq Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning 11-05 论文笔记 Human-centric Behavior Description in Videos New Benchmark and Model 10-26 论文笔记 UCF-Crime Annotation A Benchmark for Surveillance Video-and-Language Understanding 10-23 基于梯度下降算法的Zero-shot Captioning方法 09-08 论文笔记 Zero-Shot Scene Graph Relation Prediction through Commonsense Knowledge Integration 02-08

2022

End-to-end Generative Pretraining for Multimodal Video Captioning 论文笔记 10-12 GIT A Generative Image-to-text Transformer for Vision and Language 论文笔记 10-11 SwinBERT End-to-End Transformers with Sparse Attention for Video Captioning 论文笔记 10-10 Open-book Video Captioning with Retrieve-Copy-Generate Network论文笔记 09-30
12

搜索

Hexo Fluid
载入天数... 载入时分秒...
总访问量 次 总访客数 人