Repositories tagged with "video-retrieval"
InternVideo
OpenGVLab
โ[ECCV2024] Video Foundation Models & Data for Multimodal Understandingโ
ClipBERT
jayleicn
โ[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks. โ
VLM2Vec
TIGER-AI-Lab
โThis repo contains the code for "VLM2Vec" [ICLR 2025], "VLM2Vec-V2 [TMLR 2026]", and "MMEB-V3"โ
MiniGPT4-video
Vision-CAIR
โOfficial code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding โ
PreenCut
roothch
โAI-Powered Video Retrieval & Clipping Toolโ
moment_detr
โ[NeurIPS 2021] Moment-DETR code and QVHighlights datasetโ
collaborative-experts
albanie
โVideo embeddings for retrieval with natural language queriesโ
Youku-mPLUG
X-PLUG
โYouku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarksโ
QD-DETR
wjun0830
โOfficial pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 Paper)โ
mPLUG-2
โmPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)โ
visil
MKLab-ITI
โAuthors official PyTorch implementation of the "ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning" [ICCV 2019]โ
TVRetrieval
โ[ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrievalโ
EMCL
jpthu17
โ[NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representationsโ
DiffusionRet
โ[ICCV 2023] DiffusionRet: Generative Text-Video Retrieval with Diffusion Modelโ
pytorch_violet
tsujuifu
โA PyTorch implementation of VIOLETโ
HBI
โ[CVPR 2023 Highlight & TPAMI] Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learningโ
ndvr-dml
โAuthors official Tensorflow implementation of the "Near-Duplicate Video Retrieval with Deep Metric Learning" [ICCVW 2017]โ
HiREST
j-min
โHierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)โ