共计 174 篇文章
2023
论文笔记 XCLIP Expanding Language-Image Pretrained Models for General Video Recognition 论文笔记 BLIP-2 Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models 论文笔记 Zero-Shot Scene Graph Relation Prediction through Commonsense Knowledge Integration 论文笔记 mPLUG Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections 论文笔记 X-VLM Multi-Grained Vision Language Pre-Training Aligning Texts with Visual Concepts 图神经网络学习笔记:从GCN到GAT再到Relation-aware GNN 《知识增强的预训练语言模型》论文简单翻译2022
DEKCOR:使用外部知识来进行常识QA任务 Detecting Twenty-thousand Classes using Image-level Supervision论文笔记(以及目标检测基础知识) 知识图谱(Knowledge Graph)与计算机视觉(Computer Vision)结合初见笔记