Curriculum Prompt Learning with Self-Training for Abstractive Dialogue Summarization

Curriculum Prompt Learning with Self-Training for Abstractive Dialogue Summarization Changqun
Li1, Linlin Wang1, Xin Lin1, Gerard de Melo2, Liang He1 1 East China Normal University 2 Hasso Plattner Institute / University of Potsdam

Massive Amounts of Dialogue Image: REVE Chat Customer Service Instant
Messaging Slack Etc.

Dialogue Summarization

Dialogue Summarization Challenge 1: Key information scattered across utterances (by
different participants)

Dialogue Summarization Challenge 1: Key information scattered across utterances (by
different participants) Challenge 2: Topic Drift

Bike Image: Adapted from https://www.flickr.com/photos/swambo/14119129185 Conventional Transformer-based models (e.g. BART)
Prior Work Incorporating dialogue characteristics, e.g. dialogue acts, discourse, topic segments Hierarchical architectures for long dialogues Challenge 1: Key information scattered across utterances (by different participants) Challenge 2: Topic Drift

Bike Image: Adapted from https://www.flickr.com/photos/swambo/14119129185 Conventional Transformer-based models (e.g. BART)
Prior Work Incorporating dialogue characteristics, e.g. dialogue acts, discourse, topic segments Hierarchical architectures for long dialogues Challenge 1: Key information scattered across utterances (by different participants) Challenge 2: Topic Drift Challenge 3: Insufficient Training Data (e.g. just 137 meetings)

Bike Image: Adapted from https://www.flickr.com/photos/swambo/14119129185 Key Ideas 1) Custom Prompt-based
Learning for Better Dialogue Understanding Challenge 1: Key information scattered across utterances (by different participants) Challenge 2: Topic Drift Challenge 3: Insufficient Training Data (e.g. just 137 meetings) 2) Exploit Unlabeled Data

Approach Heterogeneous Prompts Self-Training

Approach

Curriculum Prompt Learning in Encoder

Prompts Jane: Did the problem with your account get resolved?
Evelyn: Well, I can sign in but can’t push code. Jane: Oh, let me have a look at the log files. ...

Prompts: Text Prompts Jane: Did the problem with your account
get resolved? Evelyn: Well, I can sign in but can’t push code. Jane: Oh, let me have a look at the log files. ... Summary of the dialogue :

Prompts: Soft Prompts Jane: Did the problem with your account
get resolved? Evelyn: Well, I can sign in but can’t push code. Jane: Oh, let me have a look at the log files. ... P1 P2 ... Pn

Prompts: Perturbed Prompts Jane: Did the problem with your account
get resolved? Evelyn: Well, I can sign in but can’t push code. Jane: Oh, let me have a look at the log files. ... P2 P1 ... 0 Prevent Overfitting via: Random Swapping and Cutoff

Prompts: Interpolated Prompts with Curriculum Schedule P2 P1 ... 0
P1 P2 ... Pn MixUp interpolation Curriculum Learning: Gradually increase perturbation

Decoder

Decoder with Topic-based Prompts David: I heard you’ve taken over
Chris’s company? Is that true? Julie: Yes. ... Decoder Dialogue embedings Summary

Decoder with Topic-based Prompts David: I heard you’ve taken over
Chris’s company? Is that true? Julie: Yes. ... DialoGPT Topic segmentation Topic Prompts Decoder Dialogue embedings Summary

Prompt Optimization with Self-Training

Prompt Optimization with Self-Training Synthetic Data with different difficulty levels

Experiments: Main Results In Paper: Evaluation on SamSum dataset. Human
Evaluation of Fluency, Informativeness, Relevance

Experiments: Few-Shot Results on SAMSum In Paper: Further Analyses and
Comparisons

Experiments: Ablation Study ICSI Dev. Set

Example

Curriculum Prompt Learning with Self-Training for Abstractive Dialogue Summarization Changqun
Li, Linlin Wang, Xin Lin, G. de Melo, Liang He Contact: [email protected] http://gerard.demelo.org http://dialoguesystems.org/ Dialogue Summarization benefits from: Prompt Perturbation with Curriculum Schedule Topic Prompts for the Decoder Prompt Optimization with Self-Training gdemelo gdm3000 @[email protected]

Details: Datasets

Curriculum Prompt Learning with Self-Training f...

Curriculum Prompt Learning with Self-Training for Abstractive Dialogue Summarization

Gerard de Melo

More Decks by Gerard de Melo

Other Decks in Technology

Featured

Transcript