A benchmark dataset and meta self-learning method for multi-source domain adaptation in scene text recognition.
Meta-SelfLearning is a research project that provides a benchmark dataset and method for multi-source domain adaptation in scene text recognition. It tackles the problem of domain shift by introducing a Meta Self-Learning approach, which combines pseudo-labeling with meta-learning to improve model performance across diverse text domains like car plates, documents, street signs, and handwritten text.
Researchers and practitioners in computer vision, particularly those working on domain adaptation, text recognition, or meta-learning. It is also relevant for developers building OCR systems that need to generalize across varied visual domains.
It offers the first multi-domain text recognition dataset with over five million images and a novel meta-learning-based method that outperforms traditional approaches in handling domain shift, providing a solid benchmark for future research.
Meta Self-learning for Multi-Source Domain Adaptation: A Benchmark
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.
Offers over five million images across five distinct domains (car, document, street, handwritten, synthetic), providing a comprehensive resource for robust text recognition training.
Combines self-learning with meta-learning to improve generalization, achieving state-of-the-art results on the benchmark, as shown in the paper and leaderboard badges.
Includes pre-trained models and extensive experiments, making it easy to reproduce and compare against baselines for multi-source domain adaptation research.
Supports various modules like ResNet, BiLSTM, and Attention, allowing customization for different experimental setups, as detailed in the training arguments.
Requires converting raw data to LMDB format with precise folder structures and label files, which is error-prone and adds overhead before training can begin.
Locked to Python 3.7 and PyTorch 1.7.0, which may cause compatibility issues with modern environments and limit access to newer library features.
Dataset download relies on external links like Baidu Cloud, which can be slow or inaccessible for international users, as noted in the README's data preparation section.