Open-Awesome
CategoriesAlternativesStacksSelf-HostedExplore
Open-Awesome

© 2026 Open-Awesome. Curated for the developer elite.

TermsPrivacyAboutGitHubRSS
  1. Home
  2. Robotic Tooling
  3. semantic_slam

semantic_slam

GPL-3.0C++

Real-time 3D semantic mapping system using a handheld RGB-D camera, built on ROS with ORB_SLAM2 and PSPNet.

GitHubGitHub
716 stars175 forks0 contributors

What is semantic_slam?

Semantic SLAM is a real-time 3D mapping system that builds semantically annotated maps from a handheld RGB-D camera. It solves the problem of environment understanding for robotics by combining simultaneous localization and mapping (SLAM) with deep learning-based semantic segmentation to label objects (e.g., walls, furniture) in the generated 3D map.

Target Audience

Robotics researchers, computer vision engineers, and developers working on autonomous navigation, augmented reality, or environmental modeling who need real-time semantic 3D perception.

Value Proposition

It offers an open-source, integrated pipeline for semantic SLAM using commodity hardware, with configurable output (semantic or RGB maps) and multiple fusion methods, built on robust ROS and proven libraries like ORB_SLAM2 and OctoMap.

Overview

Real time semantic slam in ROS with a hand held RGB-D camera

Use Cases

Best For

  • Robotic navigation in dynamic indoor environments
  • Building 3D semantic maps for augmented reality applications
  • Research on real-time semantic SLAM algorithms
  • Environmental modeling with object-level understanding
  • Integrating deep learning semantic segmentation with SLAM systems
  • Educational projects in robotic perception and computer vision

Not Ideal For

  • Applications requiring semantic updates faster than 2-3 Hz for dynamic environments
  • Projects not using ROS or those with different middleware frameworks
  • Systems lacking GPU hardware for efficient deep learning inference
  • Teams needing support for modern RGB-D cameras beyond Asus Xtion or ROS bags

Pros & Cons

Pros

Real-Time Semantic Mapping

Integrates ORB_SLAM2, depth data, and PSPNet segmentation to generate 3D semantic octomaps at interactive rates, as demonstrated in the run-time analysis with 1 Hz map updates.

Configurable Output Formats

Allows switching between semantic octomaps with object labels and standard RGB octomaps via parameter settings, offering flexibility for different perception tasks.

Multiple Fusion Methods

Supports max-confidence or Bayesian fusion for semantic label integration, providing options for robustness in label assignment, as detailed in the project report.

ROS Integration

Implemented as ROS nodes with launch files, compatible with common RGB-D cameras like Asus Xtion and ROS bags, simplifying deployment in robotic systems.

Cons

Outdated Dependencies

Relies on PyTorch 0.4.0 and ORB_SLAM2, which are older versions that may cause compatibility issues with modern systems and lack ongoing updates.

Complex Installation Process

Requires manual building of ORB_SLAM2, handling multiple dependencies, and configuring parameters in YAML files, making setup non-trivial and error-prone.

Slow Semantic Update Rate

Semantic segmentation runs at only 2-3 Hz, as noted in the run-time section, which could bottleneck real-time applications in fast-moving scenarios.

Limited Sensor Support

Primarily tested with Asus Xtion camera and specific GPU hardware, with no clear documentation for adapting to other RGB-D sensors or edge devices.

Frequently Asked Questions

Quick Stats

Stars716
Forks175
Contributors0
Open Issues44
Last commit7 years ago
CreatedSince 2018

Tags

#robotics#3d-mapping#rgb-d#3d-reconstruction#semantic-segmentation#ros#computer-vision#pytorch#slam

Built With

R
ROS
P
PCL
O
OctoMap
P
PyTorch

Included in

Robotic Tooling3.8k
Auto-fetched 1 day ago

Related Projects

detectron2detectron2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Stars34,442
Forks7,932
Last commit26 days ago
EasyOCREasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Stars29,389
Forks3,567
Last commit5 months ago
imgaugimgaug

Image augmentation for machine learning experiments.

Stars14,737
Forks2,455
Last commit1 year ago
meshroommeshroom

Node-based Visual Programming Toolbox

Stars12,703
Forks1,206
Last commit4 days ago
Community-curated · Updated weekly · 100% open source

Found a gem we're missing?

Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.

Submit a projectStar on GitHub