TensorFlow implementation of weakly-supervised object localization using only image-level labels, without bounding box annotations.
Weakly_detector is a TensorFlow implementation of a weakly-supervised object localization method that can identify object locations in images using only image-level class labels. It implements the Class Activation Mapping (CAM) technique from the CVPR 2016 paper "Learning Deep Features for Discriminative Localization," which allows models to localize objects without requiring bounding box annotations during training. This solves the problem of expensive annotation costs in traditional object detection pipelines.
Computer vision researchers and practitioners working on object localization, especially those with limited annotated data or resources for bounding box labeling. It's also suitable for developers implementing weakly-supervised learning techniques in TensorFlow.
Developers choose Weakly_detector because it provides a clean TensorFlow implementation of a seminal weakly-supervised localization paper, enabling object detection with significantly reduced annotation requirements. The implementation is straightforward to adapt to custom datasets and integrates with standard pretrained models like VGG.
Tensorflow implementation of "Learning Deep Features for Discriminative Localization"
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Uses only image-level labels for training, eliminating the need for expensive bounding box annotations as highlighted in the paper's philosophy.
Provides a clean TensorFlow implementation of Class Activation Mapping, enabling heatmap generation to visualize discriminative regions without extra supervision.
Leverages VGG pretrained weights for transfer learning, making adaptation to custom datasets straightforward as mentioned in the README.
Tested on Caltech256 and Action40 datasets, and supports custom dataset training with minimal code changes, demonstrating practical applicability.
Relies on VGG networks from 2014, which are computationally inefficient and less accurate than modern architectures like ResNet or EfficientNet.
Requires manual download of external VGG weights and potential adjustments for TensorFlow version compatibility, adding initial setup overhead.
The README is brief, lacking detailed instructions for troubleshooting, evaluation metrics, or updates for newer TensorFlow versions.