Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Budzianowski et al. - EMNLP 2018 - MultiWOZ - A...
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
tosho
December 10, 2018
Research
360
0
Share
Budzianowski et al. - EMNLP 2018 - MultiWOZ - A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling
tosho
December 10, 2018
More Decks by tosho
See All by tosho
Experts, Errors, and Context: A Large-Scale Study of Human Evaluation for Machine Translation
tosho
0
320
Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine Translation
tosho
0
370
Shaham and Levy, 2021. Neural Machine Translation without Embeddings. NAACL2021
tosho
0
130
Liu et al., 2021. Pay Attention to MLPs. arXiv
tosho
0
180
Huang et al. 2020 Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting
tosho
0
490
Ive, Madhyastha, Specia_2019_EMNLP_Deep Copycat Networks for Text-to-Text Generation
tosho
0
160
Tan, Bansal_2019_EMNLP_LXMERT Learning Cross-Modality Encoder Representations from Transformers
tosho
0
260
Tsai et al._2019_ACL_Multimodal Transformer for Unaligned Multimodal Language Sequences
tosho
0
430
Zhou et al. 2019. Density Matching for Bilingual Word Embedding. NAACL
tosho
3
320
Other Decks in Research
See All in Research
2026年3月1日(日)福島「除染土」の公共利用をかんがえる
atsukomasano2026
0
540
【NICOGRAPH2025】Photographic Conviviality: ボディペイント・ワークショップによる 同時的かつ共生的な写真体験
toremolo72
0
220
Sequences of Logits Reveal the Low Rank Structure of Language Models
sansantech
PRO
1
190
AIエージェント時代のLLM-jpモデルのあるべき姿
k141303
0
290
AIスーパーコンピュータにおけるLLM学習処理性能の計測と可観測性 / AI Supercomputer LLM Benchmarking and Observability
yuukit
1
830
世界モデルにおける分布外データ対応の方法論
koukyo1994
7
2.1k
データサイエンティストをめぐる環境の違い2025年版〈一般ビジネスパーソン調査の国際比較〉
datascientistsociety
PRO
0
1.2k
An Open and Reproducible Deep Research Agent for Long-Form Question Answering
ikuyamada
0
410
進学校の生徒にはア行の苗字が多いのか
ozekinote
0
360
Dwangoでの漫画データ活用〜漫画理解と動画作成〜@コミック工学シンポジウム2025
kzmssk
0
220
Can We Teach Logical Reasoning to LLMs? – An Approach Using Synthetic Corpora (AAAI 2026 bridge keynote)
morishtr
1
200
Thirty Years of Progress in Speech Synthesis: A Personal Perspective on the Past, Present, and Future
ktokuda
0
210
Featured
See All Featured
So, you think you're a good person
axbom
PRO
2
2k
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
46
2.8k
Digital Projects Gone Horribly Wrong (And the UX Pros Who Still Save the Day) - Dean Schuster
uxyall
0
1.2k
From π to Pie charts
rasagy
0
170
"I'm Feeling Lucky" - Building Great Search Experiences for Today's Users (#IAC19)
danielanewman
231
23k
The Language of Interfaces
destraynor
162
26k
Keith and Marios Guide to Fast Websites
keithpitt
413
23k
Skip the Path - Find Your Career Trail
mkilby
1
110
Leveraging LLMs for student feedback in introductory data science courses - posit::conf(2025)
minecr
1
240
Designing for humans not robots
tammielis
254
26k
Build your cross-platform service in a week with App Engine
jlugia
234
18k
Joys of Absence: A Defence of Solitary Play
codingconduct
1
350
Transcript
MultiWOZ – A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue
Modeling Tosho Hirasawa
0. Overview • -6<+?E$> • 4L3I/%) H2 • :@Multi-Domain
Wizard-of-Oz (MultiWOZ) • KJ • ("*72 GA/9F8 #!= 5 ,1 • 0.BD&* ' &(*;C
1. Introduction • Conversational Artificial Intelligence • human-level *)&($ •
#%' ! • Seneff and Polifroni, 2000 • "Raux et al., 2005 • Amazon AlexaRam et al., 2018
1. Introduction • \T@F [C0*%0# RA •
2DKU • =W:J • ?6) 8V • OXN3A • PH517 E2E ,"/LI • <];Z17MYB( >E • &!-0Q • " 9 • [C$+0_4D • GS5'.-0^
1. Introduction , , 2017
2. Related Works • >K&.(%3/9 ! • Machine-to-Machine • *5/4+"O6K"R
• HLJ-$) T DM6K\E ]X • Human-to-Machine • 7:=@^Y'(*0UZ9";I • G OE! :B • HLJ^Y'(*0 YS?,1$5&.(NI • Human-to-Human • G<QW &(+< • Twitter, Reddit, Ubuntu 6K"_8NI! • HLJ6KC[ AP#-*'25 FV
3. Data Collection Set-up • Wizard-of-Oz E4 • Dialogue Task:
• *,-@ ontology random sampling !'#%"8(6 • User Side: • (6=1 97CF.;A • System (Wizard) Side: • $ 2: 97/D • Wizard/User (6>, (6JG+ • (6)I30< • (6H5&?B)I30
3. Data Collection Set-up • Annotation of Dialogue Acts •
Dialogue Act = intent + slot-value pairs • intent: inform / request • slot-value: domain, price, … • Amazon Mechanical Turk +!" &$ dialogue acts .) • !" &$ '- /( • % ,*0.8843#0
4. MultiWOZ Dialogue Corpus •
: domain
4. MultiWOZ Dialogue Corpus : expensive : domain
4. MultiWOZ Dialogue Corpus • (turns in a
dialogue) • 8.93 (single-domain), 15.39 (multi-domain) • 115,434 turns • >70% 10 turns • (sentence length) • 11.75 (user), 15.12 (wizard)
4. MultiWOZ Dialogue Corpus • Dialogue Acts • 60% turns
action • %# • "$ • %# !"$
4. MultiWOZ Dialogue Corpus • •
• Multi-Domain, Dialogue Act
5. MultiWOZ as a New Benchmark • Dialogue modelling task
• Dialogue State Tracking • (,# '/ • &,.5-0)1 ontology • Dialogue-Context-to-Text Generation • (,Dialogue State, # '/ • &,!16 • Cam676/MultiWOZ 28 • % $"+* • RNN 473 • Cam676: GRU • MultiWOZ: LSTM
5. MultiWOZ as a New Benchmark • Dialogue-Act-to-Text Generation •
Structured meaning representation (Dialogue Act?) • • Semantically Conditioned LSTM (Wen+, 2015) • SFX MultiWOZ restaurant • SER = (missing slots + redundant slots) / total slots Wen+, 2015
6. Conclusion • )1"&7* 8 E2E #$20
• Modular-based (+%' • MultiWOZ 3 46 • !-53. github /,