Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
R を用いた分析(補講) (2) — 人工データの生成 / Generating Artifi...
Search
Kenji Saito
PRO
January 25, 2024
Business
0
110
R を用いた分析(補講) (2) — 人工データの生成 / Generating Artificial Data
早稲田大学大学院経営管理研究科「企業データ分析」2023 冬のオンデマンド教材 第11回で使用したスライドです。
Kenji Saito
PRO
January 25, 2024
Tweet
Share
More Decks by Kenji Saito
See All by Kenji Saito
コードや知識を組み込む / Incorporating Codes and Knowledge
ks91
PRO
0
130
シリアスゲームとしての RPG / RPGs as Serious Games
ks91
PRO
0
44
"September 12th" ゲームのプロンプトの構造 / "September 12th" Game Prompt Structure
ks91
PRO
0
36
ワールドカフェI /チューターを改良する / World Café I and Improving the Tutors
ks91
PRO
0
140
自然言語の扱いと翻訳のためのプロンプト / Natural Language Handling and Prompts for Translation
ks91
PRO
0
39
研究って何だっけ / What is Research?
ks91
PRO
0
34
ブロックチェーンと分散ファイナンス概論 / Introduction to Blockchain and Decentralized Finance
ks91
PRO
0
19
大規模言語モデルの原理と使いこなしの原則 / Principles of Large Language Models and Their Use
ks91
PRO
0
34
講師研究紹介 / Research Introduction of the Lecturer
ks91
PRO
0
44
Other Decks in Business
See All in Business
Spice Factory Co., Ltd. Culture Deck
spicefactory
0
3.9k
FinGo
hyunchang
0
220
ソニックガーデン会社説明資料(2025年1月)
kuranuki
0
260
プロダクトプランナー・ビジネスコンサルタント職種説明資料
lycorp_recruit_jp
0
11k
Chemican Overview
aswdv
0
230
メドピアグループ紹介資料
medpeer_recruit
10
130k
yamory事業紹介資料
assuredjp
0
920
日本トライスタイル採用説明資料
yamauguchishunta
0
240
採用ピッチ(2025年4月2日更新)
canvas_recruit
1
1.2k
Fantia株式会社 会社紹介資料
fantia
0
190
VISASQ: ABOUT DEV TEAM
eikohashiba
3
26k
NotebookLM + Agentspace を使った(開発)体験
satohjohn
1
440
Featured
See All Featured
Fight the Zombie Pattern Library - RWD Summit 2016
marcelosomers
233
17k
Reflections from 52 weeks, 52 projects
jeffersonlam
349
20k
How STYLIGHT went responsive
nonsquared
100
5.5k
The Invisible Side of Design
smashingmag
299
50k
Why You Should Never Use an ORM
jnunemaker
PRO
56
9.3k
JavaScript: Past, Present, and Future - NDC Porto 2020
reverentgeek
47
5.3k
Scaling GitHub
holman
459
140k
The MySQL Ecosystem @ GitHub 2015
samlambert
251
12k
What’s in a name? Adding method to the madness
productmarketing
PRO
22
3.4k
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
41
2.3k
Why Our Code Smells
bkeepers
PRO
336
57k
4 Signs Your Business is Dying
shpigford
183
22k
Transcript
generated by Stable Diffusion XL v1.0 2023 12 R (
) (2) — (WBS) 2023 12 R ( ) (2) — — 2024-01 – p.1/14
https://speakerdeck.com/ks91/collections/corporate-data-analysis-2023-winter 2023 12 R ( ) (2) — — 2024-01
– p.2/14
( 20 ) 1 • 2 R • 3 •
4 • 5 • 6 ( ) • 7 (1) • 8 (2) • 9 R ( ) (1) — Welch • 10 R ( ) (2) — χ2 • 11 R ( ) (1) — • 12 R ( ) (2) — • 13 GPT-4 14 GPT-4 15 ( ) LaTeX Overleaf 8 (12/21 ) / (2 ) OK / 2023 12 R ( ) (2) — — 2024-01 – p.3/14
N(µ, σ2) ρ 2 ( : ˆ y = a
+ b1 x1 + b2 x2 + e ) 2023 12 R ( ) (2) — — 2024-01 – p.4/14
N(µ, σ2) “rnorm()” set.seed(173205) # # N(50, 10^2) 100 x
<- rnorm(n=100, mean=50, sd=10) # x # hist(x) mean(x) sd(x) 2023 12 R ( ) (2) — — 2024-01 – p.5/14
Histogram of x x Frequency 10 20 30 40 50
60 70 80 0 5 10 15 20 25 30 35 mean(x) : 50.06994 sd(x) : 10.30096 2023 12 R ( ) (2) — — 2024-01 – p.6/14
ρ 2 (1/2) MASS “mvrnorm()” “ .R” # r =
0.9 # t = 3.7 # r = 15.2 # t = 7.5 # = -0.5 # <- matrix(c( r^2, * t * r, * r * t, t^2 ), nrow=2) 2023 12 R ( ) (2) — — 2024-01 – p.7/14
“mvrnorm()” = S xx S xy S xy S yy
= S xx rS x S y rS x S y S yy ( r = S xy S x S y ) 2 x, y x, y, z, . . . 2023 12 R ( ) (2) — — 2024-01 – p.8/14
ρ 2 (2/2) MASS “mvrnorm()” “ .R” # set.seed(28284) <-
mvrnorm(n=100, mu=c( r, t), Sigma= ) <- pmin(pmax( [,1], 13.0), 19.9) <- pmin(pmax( [,2], 0.0), 20.0) “ [,1]” “ [,2]” plot 2023 12 R ( ) (2) — — 2024-01 – p.9/14
0 5 10 15 20 13 14 15 16 17
18 ㈇ࡢ┦㛵ࡢ 㐌ᙜࡓࡾࡢㄢእ㐠ື㛫 100m㉮ࡢࢱ࣒ (⛊) r : -0.5932345 ( ) -0.5884094 ( ) 2023 12 R ( ) (2) — — 2024-01 – p.10/14
(1/2) “ .R” n <- 50 # a <- 49.4
# ( (158cm ) ) # r_father <- 0.306 mean_father <- 168.78 sd_father <- 3.2 # r_mother <- 0.37 mean_mother <- 155.32 sd_mother <- 2.45 2023 12 R ( ) (2) — — 2024-01 – p.11/14
(2/2) “ .R” <- round(rnorm(n=n, mean=mean_father, sd=sd_father), digits=1) <- round(rnorm(n=n,
mean=mean_mother, sd=sd_mother), digits=1) e <- rnorm(n=n, mean=0, sd=2.8) # <- round(a + r_father * + r_mother * + e, digits=1) 1 “round()” plot 2023 12 R ( ) (2) — — 2024-01 – p.12/14
ፉ㌟㛗 160 165 170 175 152 156 160 164 160
165 170 175 ∗㌟㛗 152 156 160 164 150 154 158 150 154 158 ẕ㌟㛗 : 34.2484 : 0.3545 : 0.4137 : 0.2831 2023 12 R ( ) (2) — — 2024-01 – p.13/14
2023 12 R ( ) (2) — — 2024-01 –
p.14/14