A lightweight encoder-decoder neural network for real-time semantic segmentation on resource-constrained devices.
LEDNet is a lightweight convolutional neural network designed for real-time semantic segmentation, a computer vision task that assigns a class label to each pixel in an image. It solves the problem of high computational cost in dense estimation tasks, enabling efficient deployment on mobile and embedded devices. The model uses an asymmetric encoder-decoder architecture with channel split/shuffle operations and an attention pyramid network to achieve high speed and accuracy.
Researchers and engineers working on real-time computer vision applications, particularly those deploying semantic segmentation models on resource-constrained hardware like mobile devices or edge computing platforms.
Developers choose LEDNet for its exceptional balance of speed and accuracy with minimal parameters (under 1M). It outperforms other lightweight models on standard benchmarks like Cityscapes while maintaining real-time inference speeds, making it ideal for production systems with latency constraints.
LEDNet: A Lightweight Encoder-Decoder Network for Real-time Semantic Segmentation
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
With under 1M parameters, LEDNet is optimized for memory-constrained environments, enabling deployment on mobile and embedded devices as emphasized in the README.
Processes over 71 FPS on a GTX 1080Ti GPU, making it suitable for latency-sensitive applications like autonomous driving or robotics that require fast pixel-wise predictions.
Uses channel split and shuffle operations in residual blocks to drastically reduce computational costs while maintaining accuracy, as described in the paper and implementation.
Achieves state-of-the-art results on Cityscapes with 70.6% IoU_cla and 87.1% IoU_cat among models of comparable speed, validating its speed-accuracy trade-off.
The code is tested with PyTorch 0.4.1 and CUDA 9.0, which are outdated and may cause compatibility issues with modern systems, requiring extra maintenance effort.
While Cityscapes is fully supported, other datasets like CamVid and VOC are listed as future work, forcing users to adapt data loaders for immediate use on non-Cityscapes data.
Installation involves specific environment setup, dataset downloading with conversion scripts, and dependency management, which can be cumbersome for quick prototyping or deployment.