Note
This is a Hugging Face dataset. Learn how to load datasets from the Hub in the Hugging Face integration docs.
Dataset Card for PKLot#

PKLot is a robust dataset for parking lot classification containing 12,416 images captured from three different parking lots (PUCPR, UFPR04, UFPR05) under various weather conditions (sunny, cloudy, rainy). Each image includes detailed annotations for individual parking spaces with occupancy status, resulting in approximately 695,900 segmented parking space instances.
This is a FiftyOne dataset with 12,416 samples.
Installation#
If you haven’t already, install FiftyOne:
pip install -U fiftyone
Usage#
import fiftyone as fo
from fiftyone.utils.huggingface import load_from_hub
# Load the dataset
# Note: other available arguments include 'max_samples', etc
dataset = load_from_hub("Voxel51/PKLot")
# Launch the App
session = fo.launch_app(dataset)
Dataset Details#
Dataset Description#
The PKLot dataset is a comprehensive parking lot classification dataset designed for computer vision research in parking space detection and occupancy classification. The dataset contains:
12,416 high-resolution images (1280Ă—720 pixels)
3 different parking lots: PUCPR (PontifĂcia Universidade CatĂłlica do Paraná), UFPR04, and UFPR05 (Federal University of Paraná)
3 weather conditions: Sunny, Cloudy, and Rainy
Time-series data: Images captured at 5-minute intervals throughout different days
~695,900 parking space instances: Each image contains 45-100 annotated parking spaces
Rich annotations: Each parking space includes polygon boundaries and occupancy status
The dataset is particularly valuable for:
Parking space detection algorithms
Occupancy classification models
Temporal analysis of parking patterns
Weather-robust computer vision systems
Smart city and intelligent transportation system research
Curated by: Paulo R. L. de Almeida, Luiz S. Oliveira, Alceu S. Britto Jr., Eunelson J. Silva Jr., Alessandro L. Koerich
Funded by [optional]: Federal University of Paraná (UFPR) and PontifĂcia Universidade CatĂłlica do Paraná (PUCPR)
Shared by [optional]: Vision, Robotics and Imaging Laboratory (VRI) - UFPR
Language(s) (NLP): Not applicable (computer vision dataset)
License: Creative Commons Attribution 4.0 International License
Dataset Sources#
Uses#
Direct Use#
The PKLot dataset is intended for:
Parking Space Detection: Training and evaluating algorithms to detect individual parking spaces in aerial/surveillance imagery
Occupancy Classification: Developing models to classify parking spaces as occupied or vacant
Temporal Analysis: Studying parking patterns over time and predicting future occupancy
Weather Robustness: Testing computer vision models under different weather conditions
Smart Parking Systems: Developing real-time parking availability systems
Benchmark Dataset: Comparing performance of different parking detection algorithms
Out-of-Scope Use#
This dataset should not be used for:
Identifying individuals or vehicles (images are not high-resolution enough for identification)
Real-time commercial applications without proper validation
Training models for different parking lot layouts without additional data
Applications requiring night-time or low-light conditions (dataset only contains daylight images)
Dataset Structure#
FiftyOne Dataset Fields#
Each sample in the FiftyOne dataset contains the following fields:
Field |
Type |
Description |
|---|---|---|
|
string |
Path to the image file |
|
string |
Parking lot identifier ( |
|
Classification |
Weather condition label ( |
|
date |
Date of image capture (YYYY-MM-DD) |
|
datetime |
Full timestamp of capture (YYYY-MM-DD HH:MM:SS) |
|
Polylines |
Collection of parking space polygons |
Parking Space Annotations (Polylines)#
Each parking space polyline contains:
Attribute |
Type |
Description |
|---|---|---|
|
string |
Always “parking_space” |
|
list |
Normalized polygon vertices [[x,y], …] in [0,1] range |
|
int |
Unique parking space ID (1-100) |
|
bool |
True (parking spaces are closed polygons) |
|
bool |
True (for visualization as filled polygons) |
|
string |
“occupied”, “not occupied”, or “unknown” |
|
int |
Parking space identifier |
Dataset Statistics#
Total Samples: 12,416 images
Parking Lots Distribution:
PUCPR: ~4,474 images
UFPR04: ~3,791 images
UFPR05: ~4,152 images
Weather Distribution:
Sunny: ~50% of images
Cloudy: ~35% of images
Rainy: ~15% of images
Temporal Coverage: September 2012 - April 2013
Capture Frequency: 5-minute intervals
Dataset Creation#
Curation Rationale#
The PKLot dataset was created to address the lack of robust, publicly available datasets for parking lot classification research. Key motivations included:
Standardized Benchmark: Providing a common dataset for comparing parking detection algorithms
Real-World Conditions: Capturing diverse weather conditions and lighting variations
Temporal Dynamics: Understanding parking patterns over time
Scale: Offering sufficient data for training deep learning models
Reproducible Research: Enabling researchers to compare results on the same dataset
Source Data#
Data Collection and Processing#
The data collection process involved:
Camera Setup: Fixed surveillance cameras installed at three parking lots
Capture Protocol: Automatic image capture every 5 minutes during daylight hours
Weather Diversity: Deliberate collection across different weather conditions
Time Period: Data collected from September 2012 to April 2013
Image Resolution: All images captured at 1280Ă—720 pixels
Quality Control: Manual verification of image quality and weather labels
Who are the source data producers?#
The data was produced by researchers at:
Federal University of Paraná (UFPR), Brazil
PontifĂcia Universidade CatĂłlica do Paraná (PUCPR), Brazil
Vision, Robotics and Imaging Laboratory (VRI)
Annotations#
Annotation process#
The annotation process consisted of:
Parking Space Delineation: Manual marking of parking space boundaries using rotated rectangles and polygons
Occupancy Labeling: Binary classification (0=vacant, 1=occupied) for each parking space
XML Format: Annotations stored in XML files with both rotated rectangle and contour representations
Consistency: Same parking space IDs maintained across all images from the same parking lot
Validation: Cross-checking of annotations for accuracy
Who are the annotators?#
Annotations were created by the research team at the Vision, Robotics and Imaging Laboratory (VRI) at UFPR, with quality control and validation performed by multiple team members.
Personal and Sensitive Information#
The dataset contains surveillance imagery of parking lots but:
Images are taken from elevated positions at resolution insufficient for personal identification
No license plates or individual features are distinguishable
Focus is on parking space occupancy, not vehicle or person identification
The dataset complies with privacy regulations for public space surveillance
Bias, Risks, and Limitations#
Known Limitations#
Geographic Bias: All data from two universities in Curitiba, Brazil
Temporal Bias: Limited to daylight hours (approximately 6 AM to 7 PM)
Seasonal Bias: Data from September 2012 to April 2013 only
Weather Distribution: Unbalanced weather conditions (more sunny than rainy days)
Parking Lot Types: Only university parking lots, may not generalize to other environments
Camera Angles: Fixed camera positions, limited viewpoint diversity
Technical Limitations#
No night-time or low-light conditions
No snow or extreme weather conditions
Fixed parking space layouts (no dynamic spaces)
Resolution limitations for fine-grained vehicle classification
Recommendations#
Users should be aware that:
Generalization: Models trained on this dataset may need adaptation for different geographic locations or parking lot types
Lighting Conditions: Additional data may be needed for 24-hour operation systems
Real-time Deployment: Validation on target deployment environment is essential
Privacy Considerations: Ensure compliance with local regulations when deploying models
Weather Robustness: Test model performance across all weather conditions in the dataset
Citation#
BibTeX:
@article{almeida2015pklot,
title={PKLot--A robust dataset for parking lot classification},
author={Almeida, Paulo and Oliveira, Luiz S and Silva Jr, Eunelson and Britto Jr, Alceu and Koerich, Alessandro},
journal={Expert Systems with Applications},
volume={42},
number={11},
pages={4937--4949},
year={2015},
publisher={Elsevier}
}
APA:
Almeida, P. R., Oliveira, L. S., Britto Jr, A. S., Silva Jr, E. J., & Koerich, A. L. (2015). PKLot–A robust dataset for parking lot classification. Expert Systems with Applications, 42(11), 4937-4949.
Glossary#
Parking Space: Individual parking slot/bay in a parking lot
Occupancy Status: Binary classification of whether a parking space contains a vehicle
Polyline: Closed polygon defining the boundary of a parking space
Rotated Rectangle: Bounding box with rotation angle for non-axis-aligned parking spaces
Normalized Coordinates: Coordinates scaled to [0,1] range relative to image dimensions
More Information#
For more information about the dataset, visit the official PKLot page or read the original paper.
Dataset Card Contact#
For questions about the original PKLot dataset, please contact the Vision, Robotics and Imaging Laboratory at UFPR.