Question 1

How to convert SGD dialogues to SGD-X format?

Accepted Answer

Use the provided script `generate_sgdx_dialogues.py` to map annotations to the new schema variants. This allows evaluating model robustness to linguistic changes as described in the SGD-X paper for benchmarking.

Question 2

SGD dataset or MultiWOZ for dialogue state tracking?

Accepted Answer

SGD offers more domains (20 vs. 7) and schema-guided structure with zero-shot tests, while MultiWOZ is smaller and based on wizard-of-oz dialogues. Choose SGD for multi-API realism and generalization research.

Question 3

What's the best way to load SGD data in Python?

Accepted Answer

Parse the JSON files using libraries like `json`; each dialogue is a dict with turns, frames, and services. Refer to the dialogue representation section for field details like slots and actions.

Question 4

Can I use SGD for commercial virtual assistants?

Accepted Answer

Yes, it's under CC BY-SA 4.0 license, but ensure compliance and cite the papers. Google disclaims all liability, so use at your own risk without warranty.

Question 5

How to evaluate a model on SGD zero-shot domains?

Accepted Answer

Train on seen domains and test on the evaluation set's unseen services. The dataset splits are designed for this; follow the DSTC8 challenge guidelines and use the provided schema definitions.

The Schema-Guided Dialogue Dataset

What is The Schema-Guided Dialogue Dataset?

Overview

Use Cases

Best For

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions