Slide 48
Slide 48 text
Synthetic Experiment: Setup
● Baselines
○ DM, IPS, DR (other baselines are compared in the paper)
○ MIPS (estimated weight), and MIPS (true)
● Basic Setting
○ n=10000, |A|=1000 (much larger than typical OPE experiments)
○ continuous rewards with some gaussian noise
○ 3-dimensional categorical action embeddings
where the cardinality of each dimension is 10, i.e., |E|=10^3=1,000
provides best achievable accuracy