houmei
April 24, 2018
140

# ぼくのかんがえたさいきょうCPU 2018 ベクトル演算

April 24, 2018

## Transcript

2. ### ಺༰ • ΅͘ͷ͔Μ͕͍͖͑ͨ͞ΐ͏CPUͷղઆ • Ϩʔϯؒͷԋࢉʹ͍ͭͯʢϕΫτϧʣ twitter : @houmei blog :

஛ԼੈքౝͷܭࢉػΑ΋΍·࿩ 18೥4݄24೔Ր༵೔
3. ### ϕΫτϧͱεΧϥ ɾσʔλશମΛ2ⁿόΠτ୯ҐͰ෼ׂͨ͠ ۠ըΛϨʔϯͱݺͿ ɾϨʔϯ͸͢΂ͯಉ͡ܕɺಉ͡αΠζ ɾϨʔϯ͕ͻͱ͚ͭͩͷ΋ͷΛεΧϥɺ ෳ਺͋Δ΋ͷΛϕΫτϧͱݺͿ FP32 FP32 FP32 FP32

lane 0 lane 1 lane 2 lane 3 0 31 63 95 127 18೥4݄24೔Ր༵೔

5. ### ϕΫτϧ×ϕΫτϧʢ̍ʣ ɾRaͷϨʔϯ਺͕RbͷϨʔϯ਺ͱಉ͡৔߹ →ରԠ͢ΔϨʔϯͲ͏͠Ͱԋࢉ Lane3 D d D+d Lane2 C c

C+c Lane1 B ʴ b = B+b Lane0 A a A+a Ra Rb Rd 18೥4݄24೔Ր༵೔
6. ### ϕΫτϧ×ϕΫτϧʢ̎ʣ ɾRaͷϨʔϯ਺͕RbΑΓগͳ͍৔߹ →ରԠ͢ΔRbͷԼҐϨʔϯͲ͏͠Ͱԋࢉ Lane3 d Lane2 c Lane1 B ʴ

b = B+b Lane0 A a A+a Ra Rb Rd 18೥4݄24೔Ր༵೔
7. ### ϕΫτϧ×ϕΫτϧʢ̏ʣ ɾRaͷϨʔϯ਺͕RbΑΓଟ͍৔߹ →RbͷϨʔϯΛ܁Γฦ͠ద༻͠ԋࢉ Lane3 D D+b Lane2 C C+a Lane1

B ʴ b = B+b Lane0 A a A+a Ra Rb Rd 18೥4݄24೔Ր༵೔
8. ### ϕΫτϧ×εΧϥ ɾRaͷϨʔϯ਺͕RbΑΓଟ͍৔߹ͱಉ͡ →RbΛ܁Γฦ͠ద༻͠ԋࢉ Lane3 D D+a Lane2 C C+a Lane1

B B+a Lane0 A ʴ a = A+a Ra Rb Rd 18೥4݄24೔Ր༵೔
9. ### εΧϥ×ϕΫτϧ ɾRaͷϨʔϯ਺͕RbΑΓগͳ͍৔߹ͱಉ͡ →Rbͷ࠷ԼҐϨʔϯͰԋࢉ Lane3 d Lane2 c Lane1 b Lane0

A ʴ a ʹ A+a Ra Rb Rd 18೥4݄24೔Ր༵೔