Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Char-rnn aurkezpena
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Manex Agirrezabal
March 14, 2016
Research
100
0
Share
Char-rnn aurkezpena
Manex Agirrezabal
March 14, 2016
More Decks by Manex Agirrezabal
See All by Manex Agirrezabal
The Flipped Classroom model for teaching Conditional Random Fields in an NLP course
manexagirrezabal
0
38
NLP for poetry generation and analysis
manexagirrezabal
0
81
Institut seminar 2020
manexagirrezabal
0
40
Automatic Scansion of Poetry (KU)
manexagirrezabal
0
670
RANLP talk
manexagirrezabal
0
79
Defense (Final version)
manexagirrezabal
0
75
Poesiaren eskantsio automatikoa: Bi hizkuntzen azterketa
manexagirrezabal
0
79
CodeFEST literature presentation
manexagirrezabal
0
66
Ongoing work (in mid 2016)
manexagirrezabal
0
28
Other Decks in Research
See All in Research
ブレグマン距離最小化に基づくリース表現量推定:バイアス除去学習の統一理論
masakat0
0
230
2026年3月1日(日)福島「除染土」の公共利用をかんがえる
atsukomasano2026
0
540
第66回コンピュータビジョン勉強会@関東 Epona: Autoregressive Diffusion World Model for Autonomous Driving
kentosasaki
0
570
typst の使い方:言語学を研究する学生のために
gitomochang
0
360
AIスーパーコンピュータにおけるLLM学習処理性能の計測と可観測性 / AI Supercomputer LLM Benchmarking and Observability
yuukit
1
830
Ankylosing Spondylitis
ankh2054
0
160
AIエージェント時代のLLM-jpモデルのあるべき姿
k141303
0
290
[Devfest Incheon 2025] 모두를 위한 친절한 언어모델(LLM) 학습 가이드
beomi
2
1.5k
データサイエンティストの業務変化
datascientistsociety
PRO
0
370
データサイエンティストをめぐる環境の違い2025年版〈一般ビジネスパーソン調査の国際比較〉
datascientistsociety
PRO
0
1.2k
COFFEE-Japan PROJECT Impact Report(海ノ向こうコーヒー)
ontheslope
0
1.4k
視覚から身体性を持つAIへ: 巧緻な動作の3次元理解
tkhkaeio
1
250
Featured
See All Featured
Understanding Cognitive Biases in Performance Measurement
bluesmoon
32
2.9k
個人開発の失敗を避けるイケてる考え方 / tips for indie hackers
panda_program
122
21k
VelocityConf: Rendering Performance Case Studies
addyosmani
333
25k
svc-hook: hooking system calls on ARM64 by binary rewriting
retrage
2
220
Building a Modern Day E-commerce SEO Strategy
aleyda
45
9k
技術選定の審美眼(2025年版) / Understanding the Spiral of Technologies 2025 edition
twada
PRO
118
110k
New Earth Scene 8
popppiees
3
2.1k
Marketing Yourself as an Engineer | Alaka | Gurzu
gurzu
0
180
The Art of Delivering Value - GDevCon NA Keynote
reverentgeek
16
1.9k
Organizational Design Perspectives: An Ontology of Organizational Design Elements
kimpetersen
PRO
1
680
Rails Girls Zürich Keynote
gr2m
96
14k
The Anti-SEO Checklist Checklist. Pubcon Cyber Week
ryanjones
0
120
Transcript
Poesiaren metrika DL bidez Manex Agirrezabal https://github.com/manexagirrezabal/char-rnn/
Proba ezberdinak TensorFlow: Sequence-to-sequence models https://www.tensorflow.org/versions/master/tutorials/seq2seq/index.html Torch: char-rnn (Andrew Karpathy)
https://github.com/karpathy/char-rnn/
Char-rnn http://karpathy.github.io/2015/05/21/rnn-effectiveness/ Karaktere mailako hizkuntz-ereduak sortzeko balio du. Sarrera gisa
testu hutsa.
Char-rnn Gure beharretarako moldatu behar: to swell the gourd and
plump the ha zel shells - ' - ' - ' - ' - ' wo man much missed how you call to me call to me ' - - ' - - ' - - ' - -
Char-rnn Dataset-a testu soil gisa: To_= swell_+ the_= gourd_+ and_=
plump_+ the_= ha_+ zel_= shells_+ To_= swell_+ the_= gourd_+ and_= plump_+ the_= hazel_+= shells_+ Wo_+ man_= much_= missed_+ how_= you_= call_+ to_= me_= call_+ to_= me_= Woman_+= much_= missed_+ how_= you_= call_+ to_= me_= call_+ to_= me_=
Char-rnn (training) $ th train.lua Parametroak: Model: [RNN, LSTM edo
GRU] rnn_size: LSTMaren (zelda) barruko tamaina num_layers: LSTMaren kapa kopurua seq_length: sekuentzian ikasteko karaktere kopurua
Char-rnn (prediction) $ th sample(mod).lua Parametroak: Model: eredu entrenatua Primetext:
sarrera testua (_ karakterearekin amaituta)
Char-rnn (prediction) Python programa bat (callSampleMod.py) aurreko programari deitzeko pausuz
pausu: $ th sampleMod.lua model M1 primetext “to_” = $ th sampleMod.lua model M1 primetext “to_= swell_” + $ th sampleMod.lua model M1 primetext “to_= swell_+ the_” = ...
Char-rnn (prediction) Arazoa: Hasieran, informazio gutxi duenez, batzuetan hanka sartzen
(+ propagatzen) du predikzioan. Adibidez, “to_” sarrerarekin Horrentzako soluzioa, predikzioa bi aldetara egitea.
Char-rnn (FW) Parametroak optimizatu nahi ditugu (seq_length, batch_size, rnn_size, ...)
Embedding-ak erabili nahi ditugu, baina gure hipotesia da ez dutela asko lagunduko.