Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
A Word-Complexity Lexicon and A Neural Readabil...
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
onizuka laboratory
December 18, 2018
Research
140
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
A Word-Complexity Lexicon and A Neural Readability Ranking Model for Lexical Simplification
弊研究室で行なったEMNLP2018読み会の発表資料です。
onizuka laboratory
December 18, 2018
More Decks by onizuka laboratory
See All by onizuka laboratory
Phrase-Based & Neural Unsupervised Machine Translation
onilab
0
120
Tell-and-Answer: Towards Explainable Visual Question Answering using Attributes and Captions
onilab
0
82
Card-660: A Reliable Evaluation Framework for Rare Word Representation Models
onilab
0
43
Integrating Transformer and Paraphrase Rules for Sentence Simplification
onilab
0
66
An Auto-Encoder Matching Model for Learning Utterance-Level Semantic Dependency in Dialogue Generation
onilab
0
62
Generating More Interesting Responses in Neural Conversation Models with Distributional Constraints
onilab
0
110
Modeling Multi-turn Conversation with Deep Utterance Aggregation
onilab
0
100
Learning Semantic Sentence Embeddings using Pair-wise Discriminator
onilab
0
130
SGM: Sequence Generation Model for Multi-Label Classification
onilab
0
87
Other Decks in Research
See All in Research
第12回人と環境にやさしい交通をめざす全国大会/熊本都市圏「車1割削減、渋滞半減、公共交通2倍」をめざして
trafficbrain
0
110
オーストリア流 都市の公共交通サービス水準評価@公共交通オープンデータ最前線2026
trafficbrain
0
180
AGI4OPT:自然言語から数理最適化を導くエ ージェントスキル Translating Human Intent into Mathematical Optimization
mickey_kubo
0
130
コーディングエージェントとABNを再考
hf149
2
700
Any-Optical-Model: A Universal Foundation Model for Optical Remote Sensing
satai
3
810
YOLO26_ Key Architectural Enhancements and Performance Benchmarking for Real-Time Object Detection
satai
3
780
2026 東京科学大 情報通信系 研究室紹介 (大岡山)
icttitech
0
3.7k
人間中心の意思決定支援AI
yukinobaba
PRO
4
2.3k
AIエージェント時代のLLM-jpモデルのあるべき姿
k141303
0
440
「AIとWhyを深堀る」をAIと深堀る
iflection
0
470
敵対生成プロンプト同時探索による内省型プロンプト最適化
kinoue_smarthr
0
110
多様なデータを許容し学習し続ける模倣学習 / Advanced Imitation Learning for VLA
prinlab
0
210
Featured
See All Featured
The AI Search Optimization Roadmap by Aleyda Solis
aleyda
1
5.9k
Fashionably flexible responsive web design (full day workshop)
malarkey
408
66k
AI in Enterprises - Java and Open Source to the Rescue
ivargrimstad
0
1.3k
Why You Should Never Use an ORM
jnunemaker
PRO
61
9.9k
The agentic SEO stack - context over prompts
schlessera
0
800
Optimising Largest Contentful Paint
csswizardry
37
3.7k
Build The Right Thing And Hit Your Dates
maggiecrowley
39
3.2k
Google's AI Overviews - The New Search
badams
0
1k
Navigating the moral maze — ethical principles for Al-driven product design
skipperchong
2
380
Noah Learner - AI + Me: how we built a GSC Bulk Export data pipeline
techseoconnect
PRO
0
190
Into the Great Unknown - MozCon
thekraken
41
2.5k
Agile that works and the tools we love
rasmusluckow
331
21k
Transcript
EMNLP A Word-Complexity Lexicon and A Neural Readability Ranking Model
2018/12/18 M1
• 2 • 15000 • SimplePPDB++
2
3 Complex Sentence The cat perched on the mat. Substitution
Generation perched : rested, sat Substitution Ranking #1 : sat, #2 : rested Complex Word Identification The cat perched on the mat. Simplification Sentence The cat sat on the mat.
$,52(% *60#94 -):3 • 60 • $;! '
. • foolishness7 vs folly1 • 60 foolishness • Google Ngram Corpus foolishness/;! • PPDB"&2272 • 21%60 8160 • 14%/;! 760 4 +2
- • Google Ngram Corpus • Wo 15000 • 11
L • 6 5 6 • e p bug n d • C Wo c • 1000 i 2-2.5h • 1 5-7 L • m l 5
- C 2 • 3% • L 0.55 → 0.64
• • ≦0.5 47% • ≦1.0 78% • ≦1.5 93% 6
2 7
• ,/+*23.0! •
SemEval2012$! "% • )-2*15Candidates • $! "% • %'&(30Target300Candidate • #% 171Target1710Candidate 8 TEXT When you think about it, that’s pretty terrible. Target terrible Candidates bad, awful, deplorable
9 P@1 1 S all binning WC R 15000
• PPDB P Ranking model • PPDB • • •
+ + + • PPDB D • 10B S 10
+ 11 SimplePPDB++
Target Candidate • 100 Target Candidate • 2 • Candidate
G • SimplePPDB++ 12
13
• n Target • PPs Candidate • MAP Candidate • P@1 Top1
I • SemEval2016 CWIG3G2 • C WC 14
15
• 2'"#( & • SOTA% • 15000'"#(
• !*$ CWI) • SimplePPDB++ 16