Paranormal Statistics: Computing what doesn't add up.

Para-normal Statistics: Analyzing what doesn't add up. Steven Lembark Workhorse
Computing [email protected]

Normality We expect data is normal. It's what we are
trained for. Chi-Squared, F depend on it. It's the guts of ANOVA. Theory guarantees it, sort of.

What is "normal"? Normal data is: Parametric Real Symmetric Unimodal

Ab-normal data Not all data is parametric.

Ab-normal data Not all data is parametric: "Bold" + "Tide"
/ 2 ==

Ab-normal data Not all data is parametric: Nominal Data "Bold"
+ "Tide" / 2 == ?? "Bald" - "Harry" >= 0 ??

Ab-normal data Not all data is parametric: Ordinal Data "On
a scale of 1 to 5 how would you rate..." Is the average really 3? Are differences between ranks unform?

Ab-normal data Not all data is parametric: Ordinal Data "On
a scale of 1 to 5 how would you rate..." Is the average really 3? For different people?

Ab-normal data Not all data is unimodal, symmetric. Bi-modal data
has higher sample variance. Positive data is skewed.

Ab-normal data Counts usually Binomial or Poission. Binomial: Coin flips.
Poisson: Sample success/failure.

Power of Positive Thinking Binomial: Count of success from IID
experiments. Mean = np Variance = npq

Power of Positive Thinking Poisson: Count of occurrances in sample
size n. Mean = np Variance = np

Power of Positive Thinking Curves all positive. Right tailed. Binomial
has highest power if sample data is binomial. Result: Smaller n for given Beta.

Kinda normal Approximations work...

Kinda normal Approximations work some of the time. Rule: npq
> 5 for binomial approximation. Goal: Keep mean > 3σ so normal is all positive. Q: How good an approximation? A: It depends...

The middle way Binomial: n=20, p=0.5 Normal: µ 10, σ
= 2.23 Decent approximation.

Off to one side Binomial: n=20, p=0.3 Normal: µ =
6, σ = 2.0 Drifting negative.

Life on the edge Binomial: n=20, p=0.1 Normal: µ =
2, σ = 1.3 Significant negative.

Neverneverland Binomial: n=20, p=0.0013 Normal: µ = 0.26 σ =
0.16 Heavily negative.

General rule: npq > 5 Small or large p is
skewed. Six-sigma range should be positive. At that point n > 5 / pq. For p = 0.0013, n = 3582. Sample size around 4000?

When we assume we make... Assuming normal data leaves a
less robust conclusion. Stronger, less robust: Sensitive to individual datasets. Not reproducable.

Non-parametric Statistics Origins in Psychology, Biology, Marketing. Analyze counts, ranks.
Tests based on discrete distributions.

Common in Quality Frequency of failures. QC with No-Go guages.
Variations between batch runs. Customer feedback.

Example: Safety study Q: Are departments equally "safe"? Q: Is
a new configuration any "safer"? Compare sample populations.

What is "safe"? Fewer reported injurys? What is P( injury
) per operation?

) per operation? 0.5? 0.3?

) per operation? 0.5? 0.1? A whole lot less?

) per operation? 0.5? 0.1? A whole lot less? N(0.01, 0.01) is heavily negative.

Severe? Parametric measure of injurys?

Severe? Parametric ranking of injurys? ( Finger + Thumb )
/ 2 == ?

Severe? Parametric ranking of injurys? ( Finger + Thumb )
/ 2 == ? ( Hand + Eye ) == Arm ? ( Hand + Hand ) == 2 * Hand ?

Ordinal Data Ranked data, not scaled.

Ordinal Data Ranked data, not scaled. Hangnail < Finger Tip
< Finger < Hand < Arm "Fuzzy Buckets" Have p( accident ) from history.

Kolomogrov-Smirnov Got tonic?

Kolomogrov-Smirnov Nope, not Vodka. Like F or ANOVA: Populations are
"different".

K-S Test Compare cumulative data (blue) vs. Expeted (red). Measure
is largest difference (arrow).

K-S for safety Rank the injurys on relative scale. Compare
counts by bucket. Cumulative distribution: accomodates empty cells. minor mis-catagorization.

A good datum is hard to find, You always get
the other kind. Apologies to Bessie Smith Sliding-scale questions: "How would you rate..." "How well did..." "How likely are you to..."

the other kind. Apologies to Bessie Smith Reproducability: Variable skill. Variable methods. Variable data handling.

the other kind. Apologies to Bessie Smith Big Data: Multiple sources. Multiple populations. Multiple data standards.

Repeatable Analysis Variety of NP tests for "messy" data. Handle
protocol, sampling variations. Robust conclusions with real data.

Summary Non-parametric data: counts, nominal, ordinal data. Non-parametric analysis avoids
NID assumptions. Robust analysis of real data. Even the para-normal.

Questions?

References: N-P http://www.uta.edu/faculty/sawasthi/Statistics/stnonpar.html http://www.ncbi.nlm.nih.gov/pmc/articles/PMC153434/ Nice writeups.

References: K-S http://itl.nist.gov/div898/handbook/eda/section3/eda35g.htm Exploratory data analysis is worth exploring. https://en.wikipedia.org/wiki/Kolmogorov%E2%80%93Smirnov_test
As always... really good writeup of the test definition, math.

References: Robust Analysis https://en.wikipedia.org/wiki/Robust_statistics https://en.wikipedia.org/wiki/Robust_regression Decent introductions. Also look up
"robust statistics" at nist.gov or "robust statistical analysis" at duckduckgo.

References: This talk http://slideshare.net/lembark Along with everything else I've done...

Paranormal Statistics: Computing what doesn't a...

Paranormal Statistics: Computing what doesn't add up.

More Decks by Steven Lembark

Other Decks in Technology

Featured

Transcript