🔥🔥🔥 This repository lists some awesome public object detection and recognition datasets.
- Awesome-Object-Detection-Datasets
- Summary
- General Detection and Recognition Datasets
- Autonomous Driving Datasets
- Adverse Weather Datasets
- Person Detection Datasets
- Anti-UAV Datasets
- Optical Aerial Imagery Datasets
- Low-light Image Datasets
- Infrared Image Datasets
- SAR Image Datasets
- Multispectral Image Datasets
- 3D Object Detection Datasets
- Vehicle-to-Everything Field Datasets
- Super-Resolution Field Datasets
- Face Detection and Recognition Datasets
- Blogs
-
-
wenhwu/awesome-remote-sensing-change-detection : List of datasets, codes, and contests related to remote sensing change detection.
-
ZHOUYI1023/awesome-radar-perception : A curated list of radar datasets, detection, tracking and fusion.
-
lartpang/awesome-segmentation-saliency-dataset : A collection of some datasets for segmentation / saliency detection. Welcome to PR...😄
-
TianhaoFu/Awesome-3D-Object-Detection : Papers, code and datasets about deep learning for 3D Object Detection.
-
xahidbuffon/Awesome_Underwater_Datasets : Pointers to large-scale underwater datasets and relevant resources.
-
M-3LAB/awesome-industrial-anomaly-detection : Paper list and datasets for industrial image anomaly detection.
-
ZhangXiwuu/Awesome_visual_place_recognition_datasets : A curated list of Visual Place Recognition (VPR)/ loop closure detection (LCD) datasets.
-
ari-dasci/OD-WeaponDetection : Datasets for weapon detection based on image classification and object detection tasks.
-
DLLXW/objectDetectionDatasets : 目标检测数据集制作:VOC,COCO,YOLO等常用数据集格式的制作和互相转换脚本。
-
codingonion/awesome-object-detection-and-recognition-datasets : A collection of some awesome public object detection and recognition datasets.
-
-
-
OpenDataLab : OpenDataLab 是上海人工智能实验室的大模型数据基座团队打造的数据开放平台,现已成为中国大模型语料数据联盟开源数据服务指定平台,为开发者提供全链条的 AI 数据支持,应对和解决数据处理中的风险与挑战,推动 AI 研究及应用。
-
Science Data Bank(ScienceDB) : Make your research data citable, discoverable and persistently accessible Satisfy flexible data sharing requirements Dedicate to facilitating data dissemination and reusing. Science Data Bank (ScienceDB) is a public, general-purpose data repository aiming to provide data services (e.g. data acquisition, long-term preservation, publishing, sharing and access) for researchers, research projects/teams, journals, institutions, universities, etc. It supports a variety of data acquisition and data licenses. ScienceDB is dedicated to promoting data findable, citable and reusable on the prerequisite of protecting the rights and interests of data owners and it is built and operated by Computer Network Information Center, Chinese Academy of Sciences.
-
中国科学数据 : 《中国科学数据(中英文网络版)》(China Scientific Data)(CN11-6035/N,ISSN 2096-2223)是目前中国唯一的专门面向多学科领域科学数据出版的学术期刊,作为国家网络连续型出版物的首批试点之一,由中国科学院主管,中国科学院计算机网络信息中心和ISC CODATA中国全国委员会合办,国家科技基础条件平台中心、中国科学院网络安全和信息化领导小组办公室指导,国内外公开发行,中英文,季刊。 中国科学引文数据库(CSCD)来源期刊,中国科技核心期刊 ,收录于中国科协高质量科技期刊分级目录。
-
飞桨AI Studio : 飞桨AI Studio开放数据集。
-
极市开发者平台 : 极市开发者平台开放数据集。
-
openvinotoolkit/datumaro : Dataset Management Framework, a Python library and a CLI tool to build, analyze and manage Computer Vision datasets.
-
-
-
-
Label Studio : Label Studio is a multi-type data labeling and annotation tool with standardized output format. labelstud.io
-
AnyLabeling : Effortless data labeling with AI support from YOLO and Segment Anything! AnyLabeling = LabelImg + Labelme + Improved UI + Auto-labeling.
-
LabelImg : 🖍️ LabelImg is a graphical image annotation tool and label object bounding boxes in images.
-
labelme : Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
-
DarkLabel : Video/Image Labeling and Annotation Tool.
-
AlexeyAB/Yolo_mark : GUI for marking bounded boxes of objects in images for training neural network Yolo v3 and v2.
-
Cartucho/OpenLabeling : Label images and video for Computer Vision applications.
-
CVAT : Computer Vision Annotation Tool (CVAT). Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
-
VoTT : Visual Object Tagging Tool: An electron app for building end to end Object Detection Models from Images and Videos.
-
WangRongsheng/KDAT : 一个专为视觉方向目标检测全流程的标注工具集,全称:Kill Object Detection Annotation Tools。
-
Rectlabel-support : RectLabel - An image annotation tool to label images for bounding box object detection and segmentation.
-
cnyvfang/labelGo-Yolov5AutoLabelImg : 💕YOLOV5 semi-automatic annotation tool (Based on labelImg)💕一个基于labelImg及YOLOV5的图形化半自动标注工具。
-
CVUsers/Auto_maker : 深度学习数据自动标注器开源 目标检测和图像分类(高精度高效率)。
-
MyVision : Computer vision based ML training data generation tool 🚀
-
wufan-tb/AutoLabelImg : auto-labelimg based on yolov5, with many other useful tools. AutoLabelImg 多功能自动标注工具。
-
MrZander/YoloMarkNet : Darknet YOLOv2/3 annotation tool written in C#/WPF.
-
mahxn0/Yolov3_ForTextLabel : 基于yolov3的目标/自然场景文字自动标注工具。
-
MNConnor/YoloV5-AI-Label : YoloV5 AI Assisted Labeling.
-
LILINOpenGitHub/Labeling-Tool : Free YOLO AI labeling tool. YOLO AI labeling tool is a Windows app for labeling YOLO dataset.
-
whs0523003/YOLOv5_6.1_autolabel : YOLOv5_6.1 自动标记目标框。
-
2vin/PyYAT : Semi-Automatic Yolo Annotation Tool In Python.
-
AlturosDestinations/Alturos.ImageAnnotation : A collaborative tool for labeling image data for yolo.
-
stephanecharette/DarkMark : Marking up images for use with Darknet.
-
2vin/yolo_annotation_tool : Annotation tool for YOLO in opencv.
-
sanfooh/quick_yolo2_label_tool : yolo快速标注工具 quick yolo2 label tool.
-
folkien/yaya : YAYA - Yet annother YOLO annoter for images (in QT5). Support yolo format, image modifications, labeling and detecting with previously trained detector.
-
pylabel-project/pylabel : Python library for computer vision labeling tasks. The core functionality is to translate bounding box annotations between different formats-for example, from coco to yolo.
-
opendatalab/labelU : Uniform, Unlimited, Universal and Unbelievable Annotation Toolbox.
-
-
-
Albumentations : Albumentations is a Python library for image augmentation. Image augmentation is used in deep learning and computer vision tasks to increase the quality of trained models. The purpose of image augmentation is to create new training samples from the existing data. "Albumentations: Fast and Flexible Image Augmentations". (Information 2020)
-
doubleZ0108/Data-Augmentation : General Data Augmentation Algorithms for Object Detection(esp. Yolo).
-
-
- YOLOExplorer : YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds. Explore, manipulate and iterate on Computer Vision datasets with precision using simple APIs. Supports SQL filters, vector similarity search, native interface with Pandas and more.
-
-
-
COCO : "Microsoft COCO: Common Objects in Context". (ECCV 2014)
-
PASCAL VOC : "The Pascal Visual Object Classes Challenge: A Retrospective". (IJCV 2015)
-
Objects365 : "Objects365: A Large-scale, High-quality Dataset for Object Detection". (ICCV 2019)
-
[V3Det](The dataset will be publicly available by June 2023.) : "V3Det: Vast Vocabulary Visual Detection Dataset". (arXiv 2023)
-
-
-
TT100K : "Traffic-Sign Detection and Classification in the Wild". (CVPR 2016)
-
CCTSDB : CSUST Chinese Traffic Sign Detection Benchmark 中国交通数据集由长沙理工大学综合交通运输大数据智能处理湖南省重点实验室张建明老师团队制作完成。 "A Real-Time Chinese Traffic Sign Detection Algorithm Based on Modified YOLOv2". (Algorithms, 2017)
-
CCTSDB2021 : "CCTSDB 2021: a more comprehensive traffic sign detection benchmark". (Human-centric Computing and Information Sciences, 2022)
-
- RESID : "Benchmarking Single-Image Dehazing and Beyond". (IEEE Transactions on Image Processing 2018)
-
INRIA Person : "Histograms of oriented gradients for human detection". (CVPR 2005)
-
CrowdHuman : "CrowdHuman: A Benchmark for Detecting Human in a Crowd". (arXiv 2018)
-
PANDA : "PANDA: A Gigapixel-Level Human-Centric Video Dataset". (CVPR 2020)
-
TinyPerson : "Scale Match for Tiny Person Detection". (WACV 2020)
-
TinyPerson v2 | SeaPerson : "Object Localization Under Single Coarse Point Supervision". (CVPR 2022)
- Anti-UAV : 🔥🔥Official Repository for Anti-UAV🔥🔥. (arXiv 2023)
-
COWC : "A large contextual dataset for classification, detection and counting of cars with deep learning". (ECCV 2016)
-
RSOD : "Accurate object localization in remote sensing images based on convolutional neural networks". (IEEE TGRS 2017)
-
LEVIR : "Random access memories: A new paradigm for target detection in high resolution aerial remote sensing images". (IEEE Transactions on Image Processing 2017)
-
LEVIR-Ship : "A Degraded Reconstruction Enhancement-based Method for Tiny Ship Detection in Remote Sensing Images with A New Large-scale Dataset". (IEEE TGRS 2022)
-
MASATI : "Automatic ship classification from optical aerial images with convolutional neural networks". (Remote Sensing 2018)
-
xView : "xView: Objects in Context in Overhead Imagery". (arXiv 2018)
-
DOTA : "DOTA: A Large-Scale Dataset for Object Detection in Aerial Images". (CVPR 2018). "Object Detection in Aerial Images: A Large-Scale Benchmark and Challenges". (IEEE TPAMI 2021).
-
ITCVD : "Deep Learning for Vehicle Detection in Aerial Images". (IEEE ICIP 2018)
-
Bridge Dataset : "A Tool for Bridge Detection in Major Infrastructure Works Using Satellite Images". (IEEE ICIP 2018)
-
DIOR : "Object detection in optical remote sensing images: A survey and a new benchmark". (ISPRS 2020)
-
PESMOD : "UAV Images Dataset for Moving Object Detection from Moving Cameras". (arXiv 2021)
-
AI-TOD : "Tiny Object Detection in Aerial Images". (IEEE ICPR 2021)
-
RsCarData : "DSFNet: Dynamic and Static Fusion Network for Moving Object Detection in Satellite Videos". (IEEE GRSL 2021)
-
VISO : "Detecting and Tracking Small and Dense Moving Objects in Satellite Videos: A Benchmark". (IEEE TGRS 2021)
-
VisDrone : "Detection and Tracking Meet Drones Challenge". (IEEE TPAMI 2021)
-
FAIR1M : "FAIR1M: A benchmark dataset for fine-grained object recognition in high-resolution remote sensing imagery". (ISPRS 2021)
-
SeaDronesSee : "SeaDronesSee: A Maritime Benchmark for Detecting Humans in Open Water". (WACV 2022)
-
NightOwls : "NightOwls: A Pedestrians at Night Dataset". (ACCV 2018).
-
ExDark : "Getting to know low-light images with the exclusively dark dataset". (CVIU 2019). "Low-light image enhancement using Gaussian Process for features retrieval". (Signal Processing: Image Communication, 2019).
-
DARK FACE : DARK FACE: Face Detection in Low Light Condition. "Advancing Image Understanding in Poor Visibility Environments: A Collective Benchmark Study". (IEEE Transactions on Image Processing 2020).
-
SCUT_FIR_Pedestrian_Dataset : "Benchmarking a large-scale FIR dataset for on-road pedestrian detection". (Infrared Physics & Technology, 2019)
-
NUDT-SIRST : "Dense Nested Attention Network for Infrared Small Target Detection". (arXiv 2021)
-
SIRST : "Asymmetric Contextual Modulation for Infrared Small Target Detection". (WACV 2021)
-
SNL VideoSAR : "Developments in sar and ifsar systems and technologies at sandia national laboratories". (IEEE Aerospace Conference Proceedings, 2003)
-
MSTAR : MSTAR public dataset. "Object recognition results using MSTAR synthetic aperture radar data". (IEEE CVBVS 2000)
-
OpenSARShip : "OpenSARShip: A Dataset Dedicated to Sentinel-1 Ship Interpretation". (IEEE JSTAEORS 2017)
-
OpenSARShip 2.0 : "OpenSARShip 2.0: A large-volume dataset for deeper interpretation of ship targets in Sentinel-1 imagery". (IEEE BIGSARDATA 2017)
-
SSDD : "Ship detection in SAR images based on an improved faster R-CNN". (IEEE BIGSARDATA 2017). "基于深度学习的SAR图像舰船检测数据集及性能分析". (第五届高分辨率对地观测学术年会, 2018)
-
AIR-SARShip : "高分辨率SAR舰船检测数据集-2.0". "AIR-SARShip-1.0: 高分辨率 SAR 舰船检测数据集". (雷达学报 2019)
-
SAR-Ship-Dataset : "A SAR Dataset of Ship Detection for Deep Learning under Complex Backgrounds". (Remote Sensing, 2019)
-
OpenSARUrban : "OpenSARUrban: A Sentinel-1 SAR Image Dataset for Urban Interpretation". (IEEE JSTAEORS 2020)
-
HRSID : "HRSID: A High-Resolution SAR Images Dataset for Ship Detection and Instance Segmentation". (IEEE Access 2020)
-
FUSAR-Ship : 高分辨率船只数据集FUSAR-Ship1.0. (雷达学报). "FUSAR-Ship: building a high-resolution SAR-AIS matchup dataset of Gaofen-3 for ship detection and recognition". (Science China Information Sciences, 2020)
-
Official-SSDD : "SAR Ship Detection Dataset (SSDD): Official Release and Comprehensive Data Analysis ". (Remote Sensing, 2021)
-
FLIR_ADAS : Teledyne FLIR Free ADAS Thermal Dataset v2.
-
VEDAI : "Vehicle Detection in Aerial Imagery: A small target detection benchmark". (Journal of Visual Communication and Image Representation 2015)
-
KAIST_rgbt : "Multispectral Pedestrian Detection: Benchmark Dataset and Baseline". (CVPR 2015)
-
TNO : "The TNO multiband image data collection". (Data in brief, 2017)
-
MFNet : MFNet-pytorch, image semantic segmentation using RGB-Thermal images. "MFNet: Towards real-time semantic segmentation for autonomous vehicles with multi-spectral scenes". (IROS 2017). (MFNet Dataset : Multi-spectral Object Detection and Semantic Segmentation Datasets)
-
LLVIP : "LLVIP: A Visible-Infrared Paired Dataset for Low-Light Vision". (ICCV 2021)
-
MSRS : MSRS: Multi-Spectral Road Scenarios for Practical Infrared and Visible Image Fusion. "PIAFusion : A progressive infrared and visible image fusion network based on illumination aware". (Information Fusion, 2022)
-
TarDAL : "Target-Aware Dual Adversarial Learning and a Multi-Scenario Multi-Modality Benchmark To Fuse Infrared and Visible for Object Detection". (CVPR 2022). (M3FD Dataset)
-
DroneVehicle : "Drone-based RGB-Infrared Cross-Modality Vehicle Detection via Uncertainty-Aware Learning". (IEEE TCSVT 2022)
- Objectron : "Objectron: A Large Scale Dataset of Object-Centric Videos in the Wild with Pose Annotations". (CVPR, 2021)
-
OpenCOOD|OPV2V : OpenCOOD is an Open COOperative Detection framework for autonomous driving. It is also the official implementation of the ICRA 2022 paper OPV2V. "OPV2V: An Open Benchmark Dataset and Fusion Pipeline for Perception with Vehicle-to-Vehicle Communication". (ICRA, 2022). mobility-lab.seas.ucla.edu/opv2v/
-
CoBEVT : "CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers". (CoRL, 2022).
-
Where2comm : "Where2comm: Communication-Efficient Collaborative Perception via Spatial Confidence Maps". (Neurips, 2022).
-
PJLab-ADG/LiDARSimLib-and-Placement-Evaluation : "Analyzing Infrastructure LiDAR Placement with Realistic LiDAR Simulation Library". (ICRA, 2023).
-
CoAlign : "Robust Collaborative 3D Object Detection in Presence of Pose Errors". (ICRA, 2023).
-
V2V4Real : "V2V4Real: A Real-World Large-Scale Dataset for Vehicle-to-Vehicle Cooperative Perception". (CVPR, 2023).
-
V2X-ViT|V2XSet : "V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer". (ECCV, 2022).
-
DAIR-V2X : "DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object Detection". (CVPR, 2022). 全球首个车路协同自动驾驶数据集发布
-
V2X-Seq : "V2X-Seq: A Large-Scale Sequential Dataset for Vehicle-Infrastructure Cooperative Perception and Forecasting". (CVPR, 2023). 全球首个大规模时序车路协同自动驾驶数据集发布
- VideoLQ : "Investigating Tradeoffs in Real-World Video Super-Resolution". (CVPR, 2022)
-
-
WIDER FACE : "WIDER FACE: A Face Detection Benchmark". (CVPR 2016)
-
UFDD : Unconstrained Face Detection Dataset(UFDD). "Pushing the Limits of Unconstrained Face Detection: a Challenge Dataset and Baseline Results". (IEEE BTAS 2018)
-
-
-
LFW : Labeled Faces in the Wild(LFW). "Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments". (Workshop on faces in'Real-Life'Images: detection, alignment, and recognition. 2008)
-
YouTube Faces (YTF) : "Face recognition in unconstrained videos with matched background similarity". (CVPR 2011)
-
CASIA-WebFace : "Learning Face Representation from Scratch". (arXiv 2014)
-
IJB-A : "Pushing the Frontiers of Unconstrained Face Detection and Recognition: IARPA Janus Benchmark A". (CVPR 2015)
-
MS-Celeb-1M : "MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition". (ECCV 2016)
-
MegaFace : "The MegaFace Benchmark: 1 Million Faces for Recognition at Scale". (CVPR 2016)
-
UMDFaces : "UMDFaces: An annotated face dataset for training deep networks". (IJCB 2017)
-
IJB-C : "IARPA Janus Benchmark - C: Face Dataset and Protocol". (ICB 2018)
-
VGGFace2 : "VGGFace2: A Dataset for Recognising Faces across Pose and Age". (FG 2018)
-
- 微信公众号「PandaCVer」
- 微信公众号「自动驾驶之心」
- 微信公众号「整数智能AI研究院」