Question 1

How do I reproduce the BiDAF results on SQuAD?

Accepted Answer

Follow the README steps: run download.sh for data, preprocess with squad.prepro, train using basic.cli, and evaluate with the official script. Pre-trained weights from the CodaLab worksheet can skip training.

Question 2

BiDAF vs BERT: which is better for question answering?

Accepted Answer

BiDAF is an older attention-based model optimized for SQuAD, while BERT is a transformer-based model with higher performance on modern benchmarks. For current projects, BERT or its variants are generally more effective and versatile.

Question 3

Can I train BiDAF on my own dataset?

Accepted Answer

The implementation is tailored for SQuAD, so adapting it requires modifying data preprocessing and potentially the model architecture, which is complex due to the outdated codebase and dependencies.

Question 4

What GPU do I need to train BiDAF?

Accepted Answer

You need at least 12GB of GPU RAM, such as an NVidia Titan X, as specified in the training section. For smaller GPUs, reduce batch size or use multi-GPU setups.

Question 5

Is BiDAF compatible with TensorFlow 2.x?

Accepted Answer

No, the code is verified on TensorFlow r0.11, with a dev branch for v1.2, but it's not updated for newer versions, making migration difficult and impractical.

Question 6

How to use the pre-trained weights in BiDAF?

Accepted Answer

Download save.zip from the CodaLab worksheet, unzip it, copy glove.6B.100d.txt to the directory, and run basic/run_single.sh or basic/run_ensemble.sh as per section 3.1 for testing.

BiDAF

What is BiDAF?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions