论文笔记
76
论文笔记 Language-conditioned Detection Transformer
论文笔记 AttrSeg:Open-Vocabulary Semantic Segmentation via Attribute Decomposition-Aggregation
论文笔记 RWKV:Reinventing RNNs for the Transformer Era
论文笔记 MLP-Mixer:An all-MLP Architecture for Vision
论文笔记 Proposal-based Multiple Instance Learning for Weakly-supervised Temporal Action Localization
论文笔记 ActionFormer:Localizing Moments of Actions with Transformers
论文笔记 InternVideo:General Video Foundation Models via Generative and Discriminative Learning
论文笔记 Distilling Vision-Language Pre-training to Collaborate with Weakly-Supervised Temporal Action Localization
论文笔记 Multi-modal Prompting for Low-Shot Temporal Action Localization
论文笔记 BatchNorm-based Weakly Supervised Video Anomaly Detection
More...
学习笔记
47