Tokyo BISH Bash #5の発表資料
紹介した論文:
[Toshniwal et al. 2018] Toshniwal, S., Kannan, A., Chiu, C.C., Wu, Y., Sainath, T.N. and Livescu, K., 2018, December. A comparison of techniques for language model integration in encoder-decoder speech recognition. In 2018 IEEE spoken language technology workshop (SLT) (pp. 369-375). IEEE.
[Variani et al. 2020] Variani, E., Rybach, D., Allauzen, C. and Riley, M., 2020, May. Hybrid autoregressive transducer (HAT). In ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 6139-6143). IEEE.
[Pundak et al. 2018] Pundak, G., Sainath, T.N., Prabhavalkar, R., Kannan, A. and Zhao, D., 2018, December. Deep context: End-to-end contextual speech recognition. In 2018 IEEE spoken language technology workshop (SLT) (pp. 418-425). IEEE.
[Zhao et al. 2019]: Zhao, D., Sainath, T.N., Rybach, D., Rondon, P., Bhatia, D., Li, B. and Pang, R., 2019. Shallow-Fusion End-to-End Contextual Biasing. In Interspeech (pp. 1418-1422).
[Jain et al. 2020] Jain, M., Keren, G., Mahadeokar, J., Zweig, G., Metze, F. and Saraf, Y., 2020. Contextual RNN-T for open domain asr. In Interspeech.