Kamino
  • 首页
  • 归档
  • 分类
  • 标签
  • 关于
  • 友链
  •   
  •   

共计 111 篇文章


2023

论文笔记 CoCa 与 VideoCoCa 03-21 论文笔记 STOA-VLP:Spatial-Temporal Modeling of Object and Action for Video-Language Pre-training 03-20 论文笔记 mPLUG-2:A Modularized Multi-modal Foundation Model Across Text, Image and Video 03-18 Image Captioning常用指标CIDEr原理 03-16 论文笔记 Self-critical Sequence Training for Image Captioning 03-16 学习笔记 Gumbel-Softmax分布 03-15 论文笔记 两篇分析多头注意力的论文 03-13 论文笔记 XCLIP Expanding Language-Image Pretrained Models for General Video Recognition 03-01 论文笔记 BLIP-2 Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models 02-17 论文笔记 Zero-Shot Scene Graph Relation Prediction through Commonsense Knowledge Integration 02-08
123…12

搜索

Hexo Fluid
载入天数... 载入时分秒...
总访问量 次 总访客数 人