Shallow vs. Deep Sum-Product Networks [Olivier Delalleau and Yoshua Bengio, 2011] https://papers.nips.cc/paper/4350-shallow-vs-deep-sum-product-networks ಉ͡χϡʔϩϯΛ૿͢ͳΒ Λ͘͢ΔΑΓͷΛ૿͢ํ͕ ΑΓෳࡶͳؔΛදݱͰ͖Δͱ͢Δจ
ReLU ׆ੑԽ͍ͯ͠ΔݶΓେ͖ͳޯ ׆ੑԽ͍ͯ͠Δχϡʔϩϯ͕ͭͰ͋Ε ׆ੑԽ͕ؔݪҼͰޯ͕ࣦΘΕΔ͜ͱ ແ͘ͳͬͨ Rectified linear units improve restricted boltzmann machines [Vinod Nair and Geoffrey E. Hinton, 2010] https://dl.acm.org/citation.cfm?id=3104425
ϒϩοΫΛ௨͖ͬͯͨͱ ϒϩοΫͷೖྗΛࠞͥͯग़ྗ͢Δ ResNet Deep Residual Learning for Image Recognition [Kaiming He and Xiangyu Zhang and Shaoqing Ren and Jian Sun, 2015] https://arxiv.org/abs/1512.03385