Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Optimisation of short memory strategies in the ...
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Nikoleta
June 04, 2017
Science
58
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Optimisation of short memory strategies in the Iterated Prisoners Dilemma
Wales Mathematics Colloquium 2017.
Nikoleta
June 04, 2017
More Decks by Nikoleta
See All by Nikoleta
A trip to earth science with python as a companion
nikoletav3
0
54
Arcas: Using Python to access open research literature
nikoletav3
1
180
Testing Research Software
nikoletav3
0
340
Arcas
nikoletav3
0
510
SSI Selection Day
nikoletav3
0
420
SWORDS-03-10-2016
nikoletav3
0
53
PyCon UK 2016
nikoletav3
0
170
Other Decks in Science
See All in Science
不動産業界における業界特化のデータ整備とAI活用 ─Vertical DataとVertical AI─
estie
1
550
(CVPR2026) Back to Basics: Let Denoising Generative Models Denoise
shumpei777
0
130
データベース02: データベースの概念
trycycle
PRO
2
1.2k
データベース04: SQL (1/3) 単純質問 & 集約演算
trycycle
PRO
0
1.5k
なぜエネルギーは保存する? 〜自由落下でわかる“対称性”とネーターの定理〜
syotasasaki593876
0
180
「遂行理論の未来」(松島斉教授最終講義記念セッションの発表資料)
shunyanoda
0
910
データベース01: データベースを使わない世界
trycycle
PRO
1
1.3k
CVPR2026_VGGTとその仲間たち
mickey_0226
0
800
知能とはなにか -ヒトとAIのあいだ-
tagtag
PRO
1
100
(メタ)科学コミュニケーターからみたAI for Scienceの同床異夢
rmaruy
0
240
防災デジタル分野での官民共創の取り組み (1)防災DX官民共創をどう進めるか
ditccsugii
0
660
機械学習 - 決定木からはじめる機械学習
trycycle
PRO
0
1.5k
Featured
See All Featured
Designing for humans not robots
tammielis
254
26k
Keith and Marios Guide to Fast Websites
keithpitt
413
23k
Marketing to machines
jonoalderson
1
5.4k
Primal Persuasion: How to Engage the Brain for Learning That Lasts
tmiket
0
360
HU Berlin: Industrial-Strength Natural Language Processing with spaCy and Prodigy
inesmontani
PRO
0
410
Helping Users Find Their Own Way: Creating Modern Search Experiences
danielanewman
31
3.2k
Visual Storytelling: How to be a Superhuman Communicator
reverentgeek
2
560
AI Search: Where Are We & What Can We Do About It?
aleyda
0
7.6k
Unsuck your backbone
ammeep
672
58k
Groundhog Day: Seeking Process in Gaming for Health
codingconduct
0
200
Digital Ethics as a Driver of Design Innovation
axbom
PRO
1
310
What does AI have to do with Human Rights?
axbom
PRO
1
2.2k
Transcript
Optimisation of short memory strategies in the Iterated Prisoners Dilemma
Nikoleta E. Glynatsi Supervised by: Dr. Vincent Knight Dr. Jonathan Gillard
(3, 3) (0, 5) (5, 0) (1, 1)
(3, 3) (0, 5) (5, 0) (1, 1) (R, P,
S, T) = (3, 1, 0, 5)
1950 1955 1960 1965 1970 1975 1980 1985 1990 1995
2000 2005 2010 2015 0 20 40 60 80 100 number of records Articles per Year (N=1145)
CC CD DC DD C D C D C D
C D p1 1 − p1 p2 1 − p2 p3 1 − p3 p4 1 − p4 p = (p1 , p2 , p3 , p4 ) ∈ R4 [0,1]
Christopher Lee, Marc Harper, and Dashiell Fryer. The art of
war: Beyond memory-one strategies in population games. 2015.
How good are memory one strategies ?
CC CD DC DD
M = p1 q1 p1 (−q1
+ 1) q1 (−p1 + 1) (−p1 + 1)(−q1 + 1) p2 q3 p2 (−q3 + 1) q3 (−p2 + 1) (−p2 + 1)(−q3 + 1) p3 q2 p3 (−q2 + 1) q2 (−p3 + 1) (−p3 + 1)(−q2 + 1) p4 q4 p4 (−q4 + 1) q4 (−p4 + 1) (−p4 + 1)(−q4 + 1)
maxp uq (p) such that p ∈ R4 [0,1]
Lemma uq(p) = 1 2 pQpT + cT p +
a 1 2 p ¯ QpT + ¯ cT p + ¯ a Q, ¯ Q ∈ R4×4 c, ¯ c ∈ R4×1 a, ¯ a ∈ R
maxp uq (p) such that p ∈ R4 [0,1]
maxp uq (p) such that p ∈ R4 [0,1] subject
to p1 = p2 = p3 = p4 = p
Lemma uq(p) = n2p2 + n1p + n0 d1p +
d0 n2 = −(q1 − q2 − 2q3 + 2q4) n1 = −q1 + 2q2 + 5q3 − 7q4 − 1 n0 = q2 − 5q4 − 1 d1 = q1 − q2 − q3 + q4 d0 = q2 − q4 − 1
q = 1, 1, 0, 2 3 0 1 p
0 1 2 3 4 5 theoretic simulated
q = 1, 1, 0, 2 3 0 1 p
0 1 2 3 4 5 theoretic simulated uq (p) = −4p2 3 + 14p 3 − 10 3 2p 3 − 2 3
q = 1, 1, 0, 2 3 0 1 p
0 1 2 3 4 5 theoretic simulated uq (p) = −4p2 3 + 14p 3 − 10 3 2p 3 − 2 3 = −2p + 5
q = 1, 0, 1, 1 3 0 1 p
0 1 2 3 4 5 theoretic simulated
q = 1, 0, 1, 1 3 0 1 p
0 1 2 3 4 5 theoretic simulated uq (p) = p2 3 + 8p 3 − 10 3 p 3 − 4 3
q = 1, 0, 1, 1 3 0 1 p
0 1 2 3 4 5 theoretic simulated uq (p) = p2 3 + 8p 3 − 10 3 p 3 − 4 3 = p + 2
q = 2 3 , 0, 2 3 , 1
3 0 1 p 0 1 2 3 4 5 theoretic simulated
q = 2 3 , 0, 2 3 , 1
3 0 1 p 0 1 2 3 4 5 theoretic simulated uq (p) = 2p 3 − 8 3 p 3 − 4 3
q = 2 3 , 0, 2 3 , 1
3 0 1 p 0 1 2 3 4 5 theoretic simulated uq (p) = 2p 3 − 8 3 p 3 − 4 3 = 2
q = 2 3 , 1 3 , 1 3
, 0 0 1 p 0 1 2 3 4 5 theoretic simulated
q = 2 3 , 1 3 , 1 3
, 0 0 1 p 0 1 2 3 4 5 theoretic simulated uq (p) = p2 3 − 2p 3 − 2 3 −2 3
q = 2 3 , 1 3 , 1 3
, 0 0 1 p 0 1 2 3 4 5 theoretic simulated uq (p) = p2 3 − 2p 3 − 2 3 −2 3 = − p2 2 + p + 1
Lemma (Indifferent) −q1 + q2 + 2q3 − 2q4 =
0 and (q2 − q4 − 1)(q1 − 2q2 − 5q3 + 7q4 + 1) − (q2 − 5q4 − 1)(q1 − q2 − q3 + q4 ) = 0. Proof. uq (p) = n2 p2 + n1 p + n0 d1 p + d0 = a0 n2 p2 + n1 p + n0 = a0 d1 p + a0 d0 n2 = 0 n1 d0 = d1 n0
Lemma (Linear) (q1 q4 − q2 q3 + q3 −
q4 )(4q1 − 3q2 − 4q3 + 3q4 − 1) = 0 Proof. uq (p) = n2 p2 + n1 p + n0 d1 p + d0 = a1 p + a0 n2 p2 + n1 p + n0 = a1 d1 p2 + (d1 a0 + a1 d0 )p + a0 d0 n2 = d1 a1 n1 d0 = d1 n0 + a1 d0
Lemma (Quadratic) (q1 − q2 − q3 + q4 )
= 0, (q1 q4 − q2 q3 + q3 − q4 )(4q1 − 3q2 − 4q3 + 3q4 − 1) = 0 and q2 − q4 − 1 = 0 Proof. uq (p) = n2 p2 + n1 p + n0 d1 p + d0 = a2 p2 + a1 p + a0 n2 p2 + n1 p + n0 = d1 a2 p3 + (a1 d1 + d0 a2 )p2 + (d1 a0 + a1 d0 )p + a0 d0 a1 d1 = 0 n2 = d1 a1 + d0 an2 n1 d0 = d1 n0 + a1 d0
du dp = m2 p2 + m1 p + m0
(d1 p + d0 )2 p uq p− p+ p uq p− p+ p uq p− p+ p uq p− p+
Theorem (Optimization of purely random player) Sq = 0, p±
, 1 0 < p± < 1, p± = −d0 d1 p∗ = argmax p∈Sq uq (p)
q = 7 8 , 7 16 , 3 8
, 0 0.0 0.2 0.4 0.6 0.8 1.0 p 0 1 2 3 4 5 theoretical p* simulated
q = 1 3 , 2 3 , 1, 0
0.0 0.2 0.4 0.6 0.8 1.0 p 0 1 2 3 4 5 theoretical p* simulated
q(1), q(2), q(3) . . . q(N) max p 1
N N i=1 uq (i)(p)
q(1), q(2), q(3) . . . q(N) max p 1
N N i=1 uq (i)(p) max p u 1 N N i=1 q(i) (p)
0.0 0.2 0.4 0.6 0.8 1.0 0 1 2 3
4 5 Tournament size N=9 q u p* simulated
p∗ = argmaxS q(1),...,q(n) u(p) where, | Sq(1),...,q(n) |≤ 2N
+ 2
p∗ = argmaxS q(1),...,q(n) u(p) where, | Sq(1),...,q(n) |≤ 2N
+ 2 @NikoletaGlyn https://github.com/Nikoleta-v3