30-sets train/test statistics evaluation ) 60 Mission boundary P R F1 Baseline [1] 0.8194 0.8375 0.8283 Baseline *[2] 0.8219 0.8163 0.8191 Our *[3] 0.8030 0.8115 0.8073 Fselect ( Baseline+Our )*[4] 0.8462 0.8837 0.8645 [1] commonw, prisma, time [2] lev, wordr, peos_q1 prisma, time_diff, peos_q2, n_subst_X_q2 [3] crf_ih, crf_iml, crf_imt [4] wordr, commonw, lev, inter_query_time, word_pov, prisma, crf_ih_state, crf_iml_state, crf_imt_state, word_suf