- CCKT-Det: Chuhan Zhang, Chaoyang Zhu, Pingcheng Dong, Long Chen, Dong Zhang. Cyclic Contrastive Knowledge Transfer for Open-Vocabulary Object Detection. ICLR 2025. [paper]
- Sambor: Xumeng Han, Longhui Wei, Xuehui Yu, Zhiyang Dou, Xin He, Kuiran Wang, Yingfei Sun, Zhenjun Han, Qi Tian. Boosting Segment Anything Model Towards Open-Vocabulary Learning. AAAI 2025. [paper]
- OV-DQUO: Junjie Wang, Bin Chen, Bin Kang, Yulin Li, YiChi Chen, Weizhi Xian, Huifeng Chang, Yong Xu. OV-DQUO: Open-Vocabulary DETR with Denoising Text Query Training and Open-World Unknown Objects Supervision. AAAI 2025. [paper] [code]
- Rohit Bharadwaj, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan. Enhancing Novel Object Detection via Cooperative Foundational Models. WACV 2025. [paper] [code]
- HD-OVD: Shenghao Fu, Junkai Yan, Qize Yang, Xihan Wei, Xiaohua Xie, Wei-Shi Zheng. A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object Detection. TMM 2025. [paper]
- Hojun Choi, Junsuk Choe, Hyunjung Shim. Sampling Bag of Views for Open-Vocabulary Object Detection. arxiv 2024. [paper]
- OV-DINO: Hao Wang, Pengzhen Ren, Zequn Jie, Xiao Dong, Chengjian Feng, Yinlong Qian, Lin Ma, Dongmei Jiang, Yaowei Wang, Xiangyuan Lan, Xiaodan Liang. OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion. arxiv 2024. [paper] [code]
- NRAA: Sunyuan Qiang, Xianfei Li, Yanyan Liang, Wenlong Liao, Tao He, Pai Peng. Open-Vocabulary Object Detection via Neighboring Region Attention Alignment. arxiv 2024. [paper]
- AggDet: Yanhao Zheng, Kai Liu. Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation. arxiv 2024. [paper]
- OmDet-Turbo: Tiancheng Zhao, Peng Liu, Xuan He, Lu Zhang, Kyusong Lee. Real-time Transformer-based Open-Vocabulary Detection with Efficient Fusion Head. arxiv 2024. [paper] [code]
- MIPT: Guilin Li, Mengdan Zhang, Xiawu Zheng, Peixian Chen, Zihan Wang, Yunhang Shen, Mingchen Zhuge, Chenglin Wu, Fei Chao, Ke Li, Xing Sun, Rongrong Ji. Multimodal Inplace Prompt Tuning for Open-set Object Detectio. ACM MM 2024. [paper]
- CLIFF: Wuyang Li, Xinyu Liu, Jiayi Ma, Yixuan Yuan. CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection. ECCV 2024. [paper]
- OpenSight: Hu Zhang, Jianhua Xu, Tao Tang, Haiyang Sun, Xin Yu, Zi Huang, Kaicheng Yu. OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection. ECCV 2024. [paper]
- CastDet: Yan Li, Weiwei Guo, Xue Yang, Ning Liao, Dunyun He, Jiaqi Zhou, Wenxian Yu. Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher Learning. ECCV 2024. [paper] [code]
- MarvelOVD: Kuo Wang, Lechao Cheng, Weikai Chen, Pingping Zhang, Liang Lin, Fan Zhou, Guanbin Li. MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection. ECCV 2024. [paper] [code]
- LaMI-DETR: Penghui Du, Yu Wang, Yifan Sun, Luting Wang, Yue Liao, Gang Zhang, Errui Ding, Yan Wang, Jingdong Wang, Si Liu. LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction. ECCV 2024. [paper]
- T-Rex2: Qing Jiang, Feng Li, Zhaoyang Zeng, Tianhe Ren, Shilong Liu, Lei Zhang. T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy. ECCV 2024. [paper]
- YOLO-World: Tianheng Cheng, Lin Song, Yixiao Ge, Wenyu Liu, Xinggang Wang, Ying Shan. YOLO-World: Real-Time Open-Vocabulary Object Detection. CVPR 2024. [paper] [code]
- SAS-Det: Shiyu Zhao, Samuel Schulter, Long Zhao, Zhixing Zhang, Vijay Kumar B.G, Yumin Suh, Manmohan Chandraker, Dimitris N. Metaxas. Taming Self-Training for Open-Vocabulary Object Detection. CVPR 2024. [paper] [code]
- LBP: Jiaming Li, Jiacheng Zhang, Jichang Li, Ge Li, Si Liu, Liang Lin, Guanbin Li. Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection. CVPR 2024. [paper]
- SHiNe: Mingxuan Liu, Tyler L. Hayes, Elisa Ricci, Gabriela Csurka, Riccardo Volpi. SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection. CVPR 2024. [paper]
- BIND: Heng Zhang, Qiuyu Zhao, Linyu Zheng, Hao Zeng, Zhiwei Ge, Tianhao Li, Sulong Xu. Exploring Region-Word Alignment in Built-in Detector for Open-Vocabulary Object Detection. CVPR 2024. [paper]
- DetCLIPv3: Lewei Yao, Renjie Pi, Jianhua Han, Xiaodan Liang, Hang Xu, Wei Zhang, Zhenguo Li, Dan Xu. DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection. CVPR 2024. [paper]
- HyperLearner: Fanjie Kong, Yanbei Chen, Jiarui Cai, Davide Modolo. Hyperbolic Learning with Synthetic Captions for Open-World Detection. CVPR 2024. [paper]
- RALF: Jooyeon Kim, Eulrang Cho, Sehyung Kim, Hyunwoo J. Kim. Retrieval-Augmented Open-Vocabulary Object Detection. CVPR 2024. [paper] [code]
- InstaGen: Chengjian Feng, Yujie Zhong, Zequn Jie, Weidi Xie, Lin Ma. InstaGen: Enhancing Object Detection by Training on Synthetic Dataset. CVPR 2024. [paper] [code]
- Lorenzo Bianchi, Fabio Carrara, Nicola Messina, Claudio Gennaro, Fabrizio Falchi. The Devil is in the Fine-Grained Details: Evaluating Open-Vocabulary Object Detectors for Fine-Grained Understanding. CVPR 2024. [paper] [code]
- Jang Hyun Cho, Philipp Krähenbühl. Language-conditioned Detection Transformer. CVPR 2024.[paper]
- WSOVOD: Jianghang Lin, Yunhang Shen, Bingquan Wang, Shaohui Lin, Ke Li, Liujuan Cao. Weakly Supervised Open-Vocabulary Object Detection. AAAI 2024. [paper]
- CLIM: Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Wentao Liu, Chen Change Loy. CLIM: Contrastive Language-Image Mosaic for Region Representation. AAAI 2024. [paper]
- SIC-CADS: Ruohuan Fang, Guansong Pang, Xiao Bai. Simple Image-level Classification Improves Open-vocabulary Object Detection. AAAI 2024. [paper] [code]
- ProxyDet: Joonhyun Jeong, Geondo Park, Jayeon Yoo, Hyungsik Jung, Heesu Kim. ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for Open-Vocabulary Object Detection. AAAI 2024. [paper] [code]
- OVDEval: Yiyang Yao, Peng Liu, Tiancheng Zhao, Qianqian Zhang, Jiajia Liao, Chunxin Fang, Kyusong Lee, Qing Wang. How to Evaluate the Generalization of Detection? A Benchmark for Comprehensive Open-Vocabulary Detection. AAAI 2024. [paper] [code]
- Chau Pham, Truong Vu, Khoi Nguyen. LP-OVOD: Open-Vocabulary Object Detection by Linear Probing. WACV 2024. [paper]
- MMC-Det: Yifan Xu, Mengdan Zhang, Xiaoshan Yang, Changsheng Xu. Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection. TIP 2024. [paper]
- VTP-OVD: Yanxin Long, Jianhua Han, Runhui Huang, Xu Hang, Yi Zhu, Chunjing Xu, Xiaodan Liang. P3OVD: Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection. TNNLS 2024. [paper]
- UOVN: Hengcan Shi, Munawar Hayat, Jianfei Cai. Unified Open-Vocabulary Dense Visual Prediction. TMM 2024. [paper]
- Chaoyang Zhu, Long Chen. A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future. TPAMI 2024. [paper]
- OpenSD: Shuai Li, Minghan Li, Pengfei Wang, Lei Zhang. OpenSD: Unified Open-Vocabulary Segmentation and Detection. arxiv 2024. [paper]
- PLAC: Sunghun Kang, Junbum Cha, Jonghwan Mun, Byungseok Roh, Chang D. Yoo. Learning Pseudo-Labeler beyond Noun Concepts for Open-Vocabulary Object Detection. arxiv 2023. [paper]
- MIC: Zhao Wang, Aoxue Li, Fengwei Zhou, Zhenguo Li, Qi Dou. Open-Vocabulary Object Detection with Meta Prompt Representation and Instance Contrastive Optimization. BMVC 2023. [paper]
- OWL-ST: Matthias Minderer, Alexey Gritsenko, Neil Houlsby. Scaling Open-Vocabulary Object Detection. NeurIPS 2023. [paper]
- CoDet: Chuofan Ma, Yi Jiang, Xin Wen, Zehuan Yuan, Xiaojuan Qi. CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection. NeurIPS 2023. [paper] [code]
- SGDN: Hengcan Shi, Munawar Hayat, Jianfei Cai. Open-Vocabulary Object Detection via Scene Graph Discovery. ACM MM 2023. [paper]
- DE-ViT: Xinyu Zhang, Yuting Wang, Abdeslam Boularias. Detect Every Thing with Few Examples. GCPR 2023. [paper] [code]
- DITO: Dahun Kim, Anelia Angelova, Weicheng Kuo. Detection-Oriented Image-Text Pretraining for Open-Vocabulary Detection. arxiv 2023. [paper] [code]
- OpenSeeD: Hao Zhang, Feng Li, Xueyan Zou, Shilong Liu, Chunyuan Li, Jianfeng Gao, Jianwei Yang, Lei Zhang. A Simple Framework for Open-Vocabulary Segmentation and Detection. ICCV 2023. [paper] [code]
- CFM-ViT: Dahun Kim, Anelia Angelova, Weicheng Kuo. Contrastive Feature Masking Open-Vocabulary Vision Transformer. ICCV 2023. [paper]
- EdaDet: Cheng Shi, Sibei Yang. EdaDet: Open-Vocabulary Object Detection Using Early Dense Alignment. ICCV 2023. [paper]
- CGG: Jianzong Wu, Xiangtai Li, Henghui Ding, Xia Li, Guangliang Cheng, Yunhai Tong, Chen Change Loy. Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation. ICCV 2023. [paper] [code]
- Jincheng Li, Chunyu Xie, Xiaoyu Wu, Bin Wang, Dawei Leng. What Makes Good Open-Vocabulary Detector: A Disassembling Perspective. KDD workshop 2023. [paper]
- SAS-Det: Shiyu Zhao, Samuel Schulter, Long Zhao, Zhixing Zhang, Vijay Kumar B. G, Yumin Suh, Manmohan Chandraker, Dimitris N. Metaxas. Improving Pseudo Labels for Open-Vocabulary Object Detection. arxiv 2023. [paper]
- Prannay Kaul, Weidi Xie, Andrew Zisserman. Multi-Modal Classifiers for Open-Vocabulary Object Detection. ICML 2023. [paper][code]
- Relja Arandjelović, Alex Andonian, Arthur Mensch, Olivier J. Hénaff, Jean-Baptiste Alayrac, Andrew Zisserman. Three Ways to Improve Feature Alignment for Open Vocabulary Eetection. arXiv 2023. [paper]
- Prompt-OVD: Hwanjun Song, Jihwan Bang. Prompt-Guided Transformers for End-to-End Open-Vocabulary Object Detection. arXiv 2023. [paper]
- PCL: Han-Cheol Cho, Won Young Jhoo, Wooyoung Kang, Byungseok Roh. Open-Vocabulary Object Detection using Pseudo Caption Labels. arXiv 2023. [paper]
- CORA: Xiaoshi Wu, Feng Zhu, Rui Zhao, Hongsheng Li. CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-Matching. CVPR 2023. [paper] [code]
- OADP: Luting Wang, Yi Liu, Penghui Du, Zihan Ding, Yue Liao, Qiaosong Qi, Biaolong Chen, Si Liu. Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection. CVPR 2023. [paper] [code]
- BARON: Size Wu, Wenwei Zhang, Sheng Jin, Wentao Liu, Chen Change Loy. Aligning Bag of Regions for Open-Vocabulary Object Detection. CVPR 2023. [paper] [code]
- RO-ViT: Dahun Kim, Anelia Angelova, Weicheng Kuo. Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers. CVPR 2023. [paper] [code]
- DetCLIPv2: Lewei Yao, Jianhua Han, Xiaodan Liang, Dan Xu, Wei Zhang, Zhenguo Li, Hang Xu. DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment. CVPR 2023. [paper]
- CondHead: Tao Wang. Learning to Detect and Segment for Open Vocabulary Object Detection. CVPR 2023. [paper]
- F-VLM: Weicheng Kuo, Yin Cui, Xiuye Gu, AJ Piergiovanni, Anelia Angelova. F-VLM: Open-Vocabulary Object Detection upon Frozen Vision and Language Models. ICLR 2023. [paper] [code]
- VLDet: Chuang Lin, Peize Sun, Yi Jiang, Ping Luo, Lizhen Qu, Gholamreza Haffari, Zehuan Yuan, Jianfei Cai. Learning Object-Language Alignments for Open-Vocabulary Object Detection. ICLR 2023. [paper] [code]
- MEDet: Peixian Chen, Kekai Sheng, Mengdan Zhang, Yunhang Shen, Ke Li, Chunhua Shen. Open Vocabulary Object Detection with Proposal Mining and Prediction Equalization. arXiv 2022. [paper] [code]
- LocOV: Maria A. Bravo, Sudhanshu Mittal, Thomas Brox. Localized Vision-Language Matching for Open-vocabulary Object Detection. DAGM German Conference on Pattern Recognition (GCPR) 2022. [paper] [code]
- Object-Centric-OVD: Hanoona Rasheed, Muhammad Maaz, Muhammad Uzair Khattak, Salman Khan, Fahad Shahbaz Khan. Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection. NeurIPS 2022. [paper] [code]
- VL-PLM: Shiyu Zhao, Zhixing Zhang, Samuel Schulter, Long Zhao, Vijay Kumar B.G, Anastasis Stathopoulos, Manmohan Chandraker, Dimitris Metaxas. Exploiting Unlabeled Data with Vision and Language Models for Object Detection. ECCV 2022. [paper] [code]
- PromptDet: Chengjian Feng, Yujie Zhong, Zequn Jie, Xiangxiang Chu, Haibing Ren, Xiaolin Wei, Weidi Xie, Lin Ma. PromptDet: Towards Open-vocabulary Detection using Uncurated Images. ECCV 2022. [paper] [code]
- OpenSeg: Golnaz Ghiasi, Xiuye Gu, Yin Cui, Tsung-Yi Lin. Scaling Open-Vocabulary Image Segmentation with Image-Level Labels. ECCV 2022. [paper] [code]
- OV-DETR: Yuhang Zang, Wei Li, Kaiyang Zhou, Chen Huang, Chen Change Loy. Open-Vocabulary DETR with Conditional Matching. ECCV 2022. [paper] [code]
- PB-OVD: Mingfei Gao, Chen Xing, Juan Carlos Niebles, Junnan Li, Ran Xu, Wenhao Liu, Caiming Xiong. Open Vocabulary Object Detection with Pseudo Bounding-Box Labels. ECCV 2022. [paper] [code]
- OWL-ViT: Matthias Minderer, Alexey Gritsenko, Austin Stone, Maxim Neumann, Dirk Weissenborn, Alexey Dosovitskiy, Aravindh Mahendran, Anurag Arnab, Mostafa Dehghani, Zhuoran Shen, Xiao Wang, Xiaohua Zhai, Thomas Kipf, Neil Houlsby. Simple Open-Vocabulary Object Detection with Vision Transformers. ECCV 2022. [paper] [code]
- RegionCLIP: Yiwu Zhong, Jianwei Yang, Pengchuan Zhang, Chunyuan Li, Noel Codella, Liunian Harold Li, Luowei Zhou, Xiyang Dai, Lu Yuan, Yin Li, Jianfeng Gao. RegionCLIP: Region-Based Language-Image Pretraining. CVPR 2022. [paper] [code]
- XPM: Dat Huynh, Jason Kuen, Zhe Lin, Jiuxiang Gu, Ehsan Elhamifar. Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-Labeling. CVPR 2022. [paper] [code]
- HierKD: Zongyang Ma, Guan Luo, Jin Gao, Liang Li, Yuxin Chen, Shaoru Wang, Congxuan Zhang, Weiming Hu. Open-Vocabulary One-Stage Detection With Hierarchical Visual-Language Knowledge Distillation. CVPR 2022. [paper] [code]
- DetPro: Yu Du, Fangyun Wei, Zihe Zhang, Miaojing Shi, Yue Gao, Guoqi Li. Learning to Prompt for Open-Vocabulary Object Detection with Vision-Language Model. CVPR 2022. [paper] [code]
- ViLD: Xiuye Gu, Tsung-Yi Lin, Weicheng Kuo, Yin Cui. Open-vocabulary Object Detection via Vision and Language Knowledge Distillation. ICLR 2022. [paper] [code]