Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Deep Learning
  3. LLVIP

LLVIP

Jupyter Notebookv1.0

A visible-infrared paired dataset for low-light vision tasks like pedestrian detection, image fusion, and image-to-image translation.

GitHubGitHub
819 stars74 forks0 contributors

What is LLVIP?

LLVIP is a visible-infrared paired dataset designed for low-light vision research, containing over 30,000 aligned image pairs with pedestrian annotations. It addresses the challenge of poor visibility in dark environments by providing thermal infrared data that highlights human subjects, enabling tasks like pedestrian detection, image fusion, and cross-modal translation. The dataset serves as a benchmark for developing and evaluating algorithms that leverage multimodal imaging.

Target Audience

Computer vision researchers and practitioners working on low-light applications, multimodal learning, pedestrian detection, or image fusion. It's particularly relevant for those developing algorithms for surveillance, autonomous driving, or night-vision systems.

Value Proposition

LLVIP offers a unique large-scale collection of precisely aligned visible-infrared pairs with clean annotations, filling a gap in publicly available low-light datasets. Its inclusion of baseline implementations and tools lowers the barrier to entry, while the Kaggle competition fosters community engagement and benchmarking.

Overview

LLVIP: A Visible-infrared Paired Dataset for Low-light Vision

Use Cases

Best For

  • Training pedestrian detection models for low-light surveillance systems
  • Researching image fusion techniques between visible and infrared modalities
  • Developing image-to-image translation models for night-vision enhancement
  • Benchmarking object detection algorithms in challenging lighting conditions
  • Studying multimodal learning with aligned thermal and visual data
  • Creating synthetic training data for low-light vision tasks

Not Ideal For

  • Commercial projects requiring licensing for product deployment
  • Research focused exclusively on visible-light imaging without infrared data
  • Real-time applications needing lightweight, fast-processing datasets
  • Studies in well-lit environments where low-light conditions are irrelevant

Pros & Cons

Pros

Large-Scale Aligned Pairs

With 30,976 precisely aligned visible-infrared images, it offers substantial data for robust multimodal model training, as emphasized in the dataset description.

Comprehensive Task Baselines

Provides implementations for image fusion, pedestrian detection, and image translation, including pre-trained models like pix2pixGAN, reducing initial setup effort for researchers.

Pedestrian-Focused Annotations

Includes bounding box labels for pedestrians, enabling direct benchmarking for object detection in low-light scenarios, with tools for format conversion.

Community and Competition

Hosts a Kaggle competition to foster algorithmic development, encouraging collaborative research and standardizing evaluation metrics.

Cons

Non-Commercial License Limitation

The dataset is restricted to non-commercial use, which excludes industry applications and requires alternative sources for commercial projects.

Outdated Dependencies and Setup

Baseline implementations rely on deprecated libraries like TensorFlow 1.14.0, making environment setup complex and prone to compatibility issues.

Annotation Quality Concerns

The README notes corrections to annotation errors, indicating potential inconsistencies that researchers must account for in their work.

Frequently Asked Questions

Quick Stats

Stars819
Forks74
Contributors0
Open Issues32
Last commit8 months ago
CreatedSince 2021

Tags

#image-translation#multimodal-data#deep-learning#yolov5#cnn#gan#generative-adversarial-networks#computer-vision#image-to-image-translation#dataset#object-detection

Built With

T
TensorFlow
O
OpenCV
P
Python
P
PyTorch

Included in

Deep Learning27.8k
Auto-fetched 6 hours ago

Related Projects

Fashion-MNISTFashion-MNIST

A MNIST-like fashion product database. Benchmark :point_down:

Stars12,723
Forks3,075
Last commit3 years ago
Open Images datasetOpen Images dataset

The Open Images dataset

Stars4,365
Forks606
Last commit4 years ago
DeepMind QA CorpusDeepMind QA Corpus

Question answering dataset featured in "Teaching Machines to Read and Comprehend

Stars1,296
Forks240
Last commit9 years ago
Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub