Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
R を用いた分析(補講) (2) — 人工データの生成 / Generating Artifi...
Search
Kenji Saito
PRO
January 25, 2024
Business
0
97
R を用いた分析(補講) (2) — 人工データの生成 / Generating Artificial Data
早稲田大学大学院経営管理研究科「企業データ分析」2023 冬のオンデマンド教材 第11回で使用したスライドです。
Kenji Saito
PRO
January 25, 2024
Tweet
Share
More Decks by Kenji Saito
See All by Kenji Saito
多重比較/相関分析 / Multiple Comparison and Correlation Analysis
ks91
PRO
0
49
アカデミーキャンプ 2025冬「考えるのは奴らだ」 / Academy Camp 2025 Winter - Live and Let Think DAY 3
ks91
PRO
0
35
アカデミーキャンプ 2025冬「考えるのは奴らだ」 / Academy Camp 2025 Winter - Live and Let Think DAY 2
ks91
PRO
0
35
アカデミーキャンプ 2025冬「考えるのは奴らだ」 / Academy Camp 2025 Winter - Live and Let Think DAY 1
ks91
PRO
1
63
インクルーシブな社会へ / Toward an Inclusive Society
ks91
PRO
0
10
P 値と有意差/分散分析 / P-value, Significant Difference and Analysis of Variance
ks91
PRO
0
55
関連2群のt検定/独立2群のt検定 / Related 2-group t-test and independent 2-group t-test
ks91
PRO
0
65
A Guide to Paper Writing Support with Generative AI - A Joint Zemi
ks91
PRO
0
21
正規分布と簡単な統計理論/t分布と信頼区間 / Normal distribution, simple statistical theory, t-distribution and confidence intervals
ks91
PRO
0
52
Other Decks in Business
See All in Business
システム思考ゲーム「ビールゲーム」
chibanba1982
PRO
0
510
면접으로 직행하는 데이터 분석 포트폴리오 | 2025년 1월 세미나
datarian
0
1.1k
プロダクトを次々にPMFさせるためのPlayBook - pmconf2024 落選セッションお披露目会
kubotaku
2
1.3k
Lablup at CES 2024: 우리의 CES 활용법
inureyes
PRO
0
240
企業向け謎解きゲーム「消えた提案書の謎」
chibanba1982
PRO
0
280
【エンジニア採用】BuySell Technologies会社説明資料
buyselltechnologies
3
56k
NewsPicks Expert説明資料 / NewsPicks Expert Introduction
mimir
0
10k
IT業界向けグループワーク「THEクリティカルパス オンライン版」
chibanba1982
PRO
0
390
2024年5月採用広報資料.pdf
gw_recruit
0
880
イークラウド会社紹介 ~ひとりひとりの想いをつなぎ、挑戦に力を~
ecrowd
1
2.4k
セルフケア研修用カードゲーム「攻略! きみのストレスを発見せよ!」
chibanba1982
PRO
0
190
IT業界向けグループワーク「THEクリティカルパス カード版」
chibanba1982
PRO
0
180
Featured
See All Featured
Raft: Consensus for Rubyists
vanstee
137
6.7k
Making the Leap to Tech Lead
cromwellryan
133
9k
Faster Mobile Websites
deanohume
305
30k
Product Roadmaps are Hard
iamctodd
PRO
50
11k
Into the Great Unknown - MozCon
thekraken
34
1.6k
The World Runs on Bad Software
bkeepers
PRO
66
11k
Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End
smashingmag
251
21k
Build The Right Thing And Hit Your Dates
maggiecrowley
33
2.5k
A Philosophy of Restraint
colly
203
16k
Git: the NoSQL Database
bkeepers
PRO
427
64k
RailsConf 2023
tenderlove
29
980
Code Reviewing Like a Champion
maltzj
521
39k
Transcript
generated by Stable Diffusion XL v1.0 2023 12 R (
) (2) — (WBS) 2023 12 R ( ) (2) — — 2024-01 – p.1/14
https://speakerdeck.com/ks91/collections/corporate-data-analysis-2023-winter 2023 12 R ( ) (2) — — 2024-01
– p.2/14
( 20 ) 1 • 2 R • 3 •
4 • 5 • 6 ( ) • 7 (1) • 8 (2) • 9 R ( ) (1) — Welch • 10 R ( ) (2) — χ2 • 11 R ( ) (1) — • 12 R ( ) (2) — • 13 GPT-4 14 GPT-4 15 ( ) LaTeX Overleaf 8 (12/21 ) / (2 ) OK / 2023 12 R ( ) (2) — — 2024-01 – p.3/14
N(µ, σ2) ρ 2 ( : ˆ y = a
+ b1 x1 + b2 x2 + e ) 2023 12 R ( ) (2) — — 2024-01 – p.4/14
N(µ, σ2) “rnorm()” set.seed(173205) # # N(50, 10^2) 100 x
<- rnorm(n=100, mean=50, sd=10) # x # hist(x) mean(x) sd(x) 2023 12 R ( ) (2) — — 2024-01 – p.5/14
Histogram of x x Frequency 10 20 30 40 50
60 70 80 0 5 10 15 20 25 30 35 mean(x) : 50.06994 sd(x) : 10.30096 2023 12 R ( ) (2) — — 2024-01 – p.6/14
ρ 2 (1/2) MASS “mvrnorm()” “ .R” # r =
0.9 # t = 3.7 # r = 15.2 # t = 7.5 # = -0.5 # <- matrix(c( r^2, * t * r, * r * t, t^2 ), nrow=2) 2023 12 R ( ) (2) — — 2024-01 – p.7/14
“mvrnorm()” = S xx S xy S xy S yy
= S xx rS x S y rS x S y S yy ( r = S xy S x S y ) 2 x, y x, y, z, . . . 2023 12 R ( ) (2) — — 2024-01 – p.8/14
ρ 2 (2/2) MASS “mvrnorm()” “ .R” # set.seed(28284) <-
mvrnorm(n=100, mu=c( r, t), Sigma= ) <- pmin(pmax( [,1], 13.0), 19.9) <- pmin(pmax( [,2], 0.0), 20.0) “ [,1]” “ [,2]” plot 2023 12 R ( ) (2) — — 2024-01 – p.9/14
0 5 10 15 20 13 14 15 16 17
18 ㈇ࡢ┦㛵ࡢ 㐌ᙜࡓࡾࡢㄢእ㐠ື㛫 100m㉮ࡢࢱ࣒ (⛊) r : -0.5932345 ( ) -0.5884094 ( ) 2023 12 R ( ) (2) — — 2024-01 – p.10/14
(1/2) “ .R” n <- 50 # a <- 49.4
# ( (158cm ) ) # r_father <- 0.306 mean_father <- 168.78 sd_father <- 3.2 # r_mother <- 0.37 mean_mother <- 155.32 sd_mother <- 2.45 2023 12 R ( ) (2) — — 2024-01 – p.11/14
(2/2) “ .R” <- round(rnorm(n=n, mean=mean_father, sd=sd_father), digits=1) <- round(rnorm(n=n,
mean=mean_mother, sd=sd_mother), digits=1) e <- rnorm(n=n, mean=0, sd=2.8) # <- round(a + r_father * + r_mother * + e, digits=1) 1 “round()” plot 2023 12 R ( ) (2) — — 2024-01 – p.12/14
ፉ㌟㛗 160 165 170 175 152 156 160 164 160
165 170 175 ∗㌟㛗 152 156 160 164 150 154 158 150 154 158 ẕ㌟㛗 : 34.2484 : 0.3545 : 0.4137 : 0.2831 2023 12 R ( ) (2) — — 2024-01 – p.13/14
2023 12 R ( ) (2) — — 2024-01 –
p.14/14