Slide 14
Slide 14 text
5.2 Hyperparameters
14
連続ミニバッチ,330 epoch (early stopping)
学習率:0.01 to 0.001 (η = ( + )−, = 0.5)
RMSprop:λ = 10−5, α = 0.99
BASE:125 hidden units, 100-D word embedding,
25-D POS embedding, Dropout rate 0.33
BASE++:200 hidden units
+SHARED:200 hidden units, Dropout rate 0.66