Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
An Effective Approach to Unsupervised Machine T...
Search
Ryusuke_Tanaka
November 21, 2019
Technology
0
110
An Effective Approach to Unsupervised Machine Translationの紹介
An Effective Approach to Unsupervised Machine Translationの紹介です。
教師なし翻訳に関するお話です。
Ryusuke_Tanaka
November 21, 2019
Tweet
Share
More Decks by Ryusuke_Tanaka
See All by Ryusuke_Tanaka
医師向けQAサイトのための推薦システム開発
ryusuketa
1
1.6k
Universal Decompositional Semantics on Universal Dependencies
ryusuketa
0
75
Learning Dual Retrieval Module for Semi-supervised Relation Extractionの紹介
ryusuketa
0
69
動画視聴を整数倍(最大値)で_効率化するchrome extension作った
ryusuketa
0
72
双曲空間への単語埋め込みと QAサービスでの自然言語処理を 用いた推薦システムについて
ryusuketa
0
530
Other Decks in Technology
See All in Technology
TanStack Start 技術選定の裏側 / Findy-Lunch-LT-TanStack-Start
iktakahiro
1
170
使えるデータ基盤を作る技術選定の秘訣 / selecting-the-right-data-technology
pei0804
10
1.6k
問 1:以下のコンパイラを証明せよ(予告編) #kernelvm / Kernel VM Study Kansai 11th
ytaka23
3
630
kernelvm-brain-net
raspython3
0
660
計測による継続的なCI/CDの改善
sansantech
PRO
7
2.1k
経済メディア編集部の実務に小さく刺さるAI / small-ai-with-editorial
nkzn
2
480
Google Cloud Next 2025 Recap マーケティング施策の運用及び開発を支援するAIの活用 / Use of AI to support operation and development of marketing campaign
atsushiyoshikawa
0
350
スイッチのBMC、つかってますか?
sonic
0
370
ユーザーコミュニティが海外スタートアップのDevRelを補完する瞬間
nagauta
1
200
Cursorを全エンジニアに配布 その先に見据えるAI駆動開発の未来 / 2025-05-13-forkwell-ai-study-1-cursor-at-loglass
itohiro73
2
730
VitePress & MCPでアプリ仕様のオープン化に挑戦する
hal_spidernight
0
130
AIフレンドリーなプロダクト開発を目指して 〜MCPを橋渡しにした環境移行〜
shinpr
0
130
Featured
See All Featured
Keith and Marios Guide to Fast Websites
keithpitt
411
22k
The Web Performance Landscape in 2024 [PerfNow 2024]
tammyeverts
5
580
It's Worth the Effort
3n
184
28k
Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End
smashingmag
251
21k
Rails Girls Zürich Keynote
gr2m
94
13k
Intergalactic Javascript Robots from Outer Space
tanoku
271
27k
The Invisible Side of Design
smashingmag
299
50k
Practical Tips for Bootstrapping Information Extraction Pipelines
honnibal
PRO
19
1.2k
The Pragmatic Product Professional
lauravandoore
33
6.6k
VelocityConf: Rendering Performance Case Studies
addyosmani
329
24k
Designing Dashboards & Data Visualisations in Web Apps
destraynor
231
53k
10 Git Anti Patterns You Should be Aware of
lemiorhan
PRO
656
60k
Transcript
An Effective Approach to Unsupervised Machine Translation
None
?/= 8E 45": 3'209 40G0AIoT< :+;F<%$6 B@-(,F.
!)#7*&>12FM2 D1 CD!)#7ML
Unsupervised Machine Translation • 87=@16Statistical Machine Translation (SMT) Neural Machine
Translation (NMT))(95/&%$ ◦ .@.0:2?>! • -B"*< .@;3,A=@4+ ◦ Word translation without parallel data.[Alexis 2017], ◦ Learning bilingual word embeddings with (almost) no bilingual data [Artetxe 2017] • !#'5/ 87=@ NMT>!4+ ◦ UNSUPERVISED MACHINE TRANSLATION USING MONOLINGUAL CORPORA ONLY [Lample2018] ◦ Unsupervised statistical machine translation [Artetxe 2018]
Supervised Machine Translation NMT Back-translation !
#"BLEU http://deeplearning.hatenablog.com/entry/back_translation#f-726c04a7
!! • D8?8B;=/@[Alexis 2017] ◦ /@*;="%$#1: ◦ ;=B/@)3& A404 6
- 5.+=A'9C9 7> ◦ +=A( , +=2<EF
SMT https://www.nhk.or.jp/strl/publica/rd/rd168/pdf/P14-25.pdf
' 1. % $ 2. &! 3. SMT$
" 4. " refinement 5. NMT(#
&9 3+ • bi-gram embedding+A8: #6>$<[Artetxe 2018] • :
100=0/ softmax &952"* (e,f8: 4 :, τ1( ?.',%!7 ) ;- …@@
2<0K,A • 3N*6 5/2<0KPO • ex. “Sunday Telegraph”
→ “The Times of London” • =H. %'#& $"&MQ4 R(8-C WaveNet:1D+@9> IF !) 2<G@7JB; LS 7JE?/ T
Unsupervised SMT • Back-translation.CE/;> ◦ DF%"&*8L @3 DFB<+4DF%"&.C •
9H7Cycle GAN !#K65= ◦ -:02I ?HA M 1 : DF'! : ,G(#'$)'! : DF7J'!
+% • '$ SMT+% .0 .0 (), +% • SMT+%
.0!/1-*&# ()2"
NMT$ • "SMT$ %# NMT$ • % NMT#
: SMT%! : NMT%!
WMT2014 seq2seq
…