Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
ぼくのかんがえたさいきょうCPU 2018 ベクトル演算
Search
houmei
April 24, 2018
Technology
0
170
ぼくのかんがえたさいきょうCPU 2018 ベクトル演算
houmei
April 24, 2018
Tweet
Share
More Decks by houmei
See All by houmei
ぼくのかんがえたさいきょうCPU 2018 スカラ演算
houmei
0
150
ぼくのかんがえたさいきょうCPU 2018 DATA
houmei
0
150
2018-BKSC-ALU
houmei
0
290
2017 CPU Architeciture
houmei
0
310
Other Decks in Technology
See All in Technology
Oracle Audit Vault and Database Firewall 20 概要
oracle4engineer
PRO
3
1.7k
データプラットフォーム技術におけるメダリオンアーキテクチャという考え方/DataPlatformWithMedallionArchitecture
smdmts
5
650
AIエージェント最前線! Amazon Bedrock、Amazon Q、そしてMCPを使いこなそう
minorun365
PRO
15
5.4k
KubeCon + CloudNativeCon Japan 2025 Recap by CA
ponkio_o
PRO
0
140
米国国防総省のDevSecOpsライフサイクルをAWSのセキュリティサービスとOSSで実現
syoshie
2
1.2k
2025-06-26_Lightning_Talk_for_Lightning_Talks
_hashimo2
2
100
PHP開発者のためのSOLID原則再入門 #phpcon / PHP Conference Japan 2025
shogogg
4
890
How Community Opened Global Doors
hiroramos4
PRO
1
120
監視のこれまでとこれから/sakura monitoring seminar 2025
fujiwara3
11
4k
解析の定理証明実践@Lean 4
dec9ue
0
180
25分で解説する「最小権限の原則」を実現するための AWS「ポリシー」大全 / 20250625-aws-summit-aws-policy
opelab
9
1.2k
GeminiとNotebookLMによる金融実務の業務革新
abenben
0
240
Featured
See All Featured
The MySQL Ecosystem @ GitHub 2015
samlambert
251
13k
個人開発の失敗を避けるイケてる考え方 / tips for indie hackers
panda_program
107
19k
Improving Core Web Vitals using Speculation Rules API
sergeychernyshev
17
950
Keith and Marios Guide to Fast Websites
keithpitt
411
22k
A Tale of Four Properties
chriscoyier
160
23k
The Cost Of JavaScript in 2023
addyosmani
51
8.5k
Principles of Awesome APIs and How to Build Them.
keavy
126
17k
Unsuck your backbone
ammeep
671
58k
GraphQLとの向き合い方2022年版
quramy
49
14k
Fireside Chat
paigeccino
37
3.5k
Producing Creativity
orderedlist
PRO
346
40k
It's Worth the Effort
3n
185
28k
Transcript
΅͘ͷ͔Μ͕͑ͨ ͍͖͞ΐ͏CPU 2018.4.24 @houmei ϕΫτϧԋࢉ 184݄24Ր༵
༰ • ΅͘ͷ͔Μ͕͍͖͑ͨ͞ΐ͏CPUͷղઆ • Ϩʔϯؒͷԋࢉʹ͍ͭͯʢϕΫτϧʣ twitter : @houmei blog :
ԼੈքౝͷܭࢉػΑ· 184݄24Ր༵
ϕΫτϧͱεΧϥ ɾσʔλશମΛ2ⁿόΠτ୯ҐͰׂͨ͠ ۠ըΛϨʔϯͱݺͿ ɾϨʔϯͯ͢ಉ͡ܕɺಉ͡αΠζ ɾϨʔϯ͕ͻͱ͚ͭͩͷͷΛεΧϥɺ ෳ͋ΔͷΛϕΫτϧͱݺͿ FP32 FP32 FP32 FP32
lane 0 lane 1 lane 2 lane 3 0 31 63 95 127 184݄24Ր༵
ݪଇ (1)ԋࢉ݁ՌRdͷσʔλαΠζɺϨʔϯɺܕ Ұ൪ͷιʔε(Ra)ʹ߹ΘͤΔ (2)ೋ൪ͷιʔε(Rb)ͷϨʔϯ͕RaͷϨʔϯ ΑΓগͳ͍߹܁Γฦ͠ద༻ (3)ೋ൪ͷιʔε(Rb)ͷϨʔϯ͕RaͷϨʔϯ ΑΓଟ͍߹ԼҐͷϨʔϯΛద༻ (4)Ϩʔϯؒͷԋࢉʹ͍ͭͯεΧϥԋࢉͷϧʔ ϧΛద༻ 184݄24Ր༵
ϕΫτϧ×ϕΫτϧʢ̍ʣ ɾRaͷϨʔϯ͕RbͷϨʔϯͱಉ͡߹ →ରԠ͢ΔϨʔϯͲ͏͠Ͱԋࢉ Lane3 D d D+d Lane2 C c
C+c Lane1 B ʴ b = B+b Lane0 A a A+a Ra Rb Rd 184݄24Ր༵
ϕΫτϧ×ϕΫτϧʢ̎ʣ ɾRaͷϨʔϯ͕RbΑΓগͳ͍߹ →ରԠ͢ΔRbͷԼҐϨʔϯͲ͏͠Ͱԋࢉ Lane3 d Lane2 c Lane1 B ʴ
b = B+b Lane0 A a A+a Ra Rb Rd 184݄24Ր༵
ϕΫτϧ×ϕΫτϧʢ̏ʣ ɾRaͷϨʔϯ͕RbΑΓଟ͍߹ →RbͷϨʔϯΛ܁Γฦ͠ద༻͠ԋࢉ Lane3 D D+b Lane2 C C+a Lane1
B ʴ b = B+b Lane0 A a A+a Ra Rb Rd 184݄24Ր༵
ϕΫτϧ×εΧϥ ɾRaͷϨʔϯ͕RbΑΓଟ͍߹ͱಉ͡ →RbΛ܁Γฦ͠ద༻͠ԋࢉ Lane3 D D+a Lane2 C C+a Lane1
B B+a Lane0 A ʴ a = A+a Ra Rb Rd 184݄24Ր༵
εΧϥ×ϕΫτϧ ɾRaͷϨʔϯ͕RbΑΓগͳ͍߹ͱಉ͡ →Rbͷ࠷ԼҐϨʔϯͰԋࢉ Lane3 d Lane2 c Lane1 b Lane0
A ʴ a ʹ A+a Ra Rb Rd 184݄24Ր༵
ଈͷѻ͍ • ԋࢉͷୈೋΦϖϥϯυଈͷࢦఆ͕Մೳ • ଈͷεΧϥσʔλͱͯ͠ѻΘΕΔ • ଈͷαΠζRaͷϨʔϯͷαΠζʹ߹Θͤූ ߸֦ு͞ΕΔ(.uम০ࢠͰθϩ֦ு) 184݄24Ր༵
ɹ ͭͮ͘ ΅͘ͷ͔Μ͕͍͖͑ͨ͞ΐ͏CPU 184݄24Ր༵