A PyTorch implementation of the DrQA model for reading comprehension and open-domain question answering.
DrQA is a PyTorch implementation of a reading comprehension model designed to answer open-domain questions by processing evidence text. It solves the problem of extracting precise answers from natural language paragraphs, as benchmarked by the SQuAD dataset, with a focus on simplicity and strong performance.
Researchers and developers in natural language processing who need a clean, modifiable codebase for experimenting with reading comprehension models and question-answering systems.
Developers choose this implementation for its lightweight, focused design that strips away unnecessary complexity, making it easier to understand, modify, and iterate on compared to bulkier official versions or chatbot frameworks.
A pytorch implementation of Reading Wikipedia to Answer Open-Domain Questions.
Achieves Exact Match and F1 scores nearly identical to the original paper and official implementations, as validated in the results table with EM 69.64 and F1 78.76.
Strips away extraneous features like document retrieval, offering a lightweight implementation that is easy to read, modify, and experiment with for SQuAD tasks, as emphasized in the philosophy.
Uses spaCy for lemma, POS, and named entity tags, which is faster than Stanford CoreNLP while maintaining similar accuracy, as noted in the detailed comparisons section.
Optimized for single-GPU environments, reducing configuration complexity and making it accessible for research on standard hardware, unlike bulkier official versions.
Lacks the document retriever and interactive inference API present in the full DrQA system, limiting it to scenarios where evidence text is already provided, as admitted in comparisons.
Only supports single-GPU training, which can slow down training times and hinder scalability compared to implementations that leverage multiple GPUs, as noted in the README.
Preprocessing requires approximately 9GB of memory with default settings, as warned in the setup, which may be prohibitive for machines with limited RAM.
Implements a 2017 model without updates for modern advancements like transformers, making it less competitive with newer models that surpass DrQA on benchmarks.
TensorFlow code and pre-trained models for BERT
Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granularity and uses a bi-directional attention flow mechanism to achieve a query-aware context representation without early summarization.
A Tensorflow implementation of QANet for machine reading comprehension
Tensorflow Implementation of R-Net
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.