学习笔记
41
论文笔记
26
论文笔记 X-VLM Multi-Grained Vision Language Pre-Training Aligning Texts with Visual Concepts
图神经网络学习笔记:从GCN到GAT再到Relation-aware GNN
《知识增强的预训练语言模型》论文简单翻译
DEKCOR:使用外部知识来进行常识QA任务
Detecting Twenty-thousand Classes using Image-level Supervision论文笔记(以及目标检测基础知识)
知识图谱(Knowledge Graph)与计算机视觉(Computer Vision)结合初见笔记
SmallCap:Lightweight Image Captioning Prompted with Retrieval Augmentation 论文笔记
End-to-end Generative Pretraining for Multimodal Video Captioning 论文笔记
GIT A Generative Image-to-text Transformer for Vision and Language 论文笔记
SwinBERT End-to-End Transformers with Sparse Attention for Video Captioning 论文笔记
More...