# Data Visualization: Then and Now

The deck first presented at SAP Inside Track Netherlands 2012
Updated for SAP Online Track 2020-05-30

May 30, 2020

## Transcript

Vitaliy Rudnytskiy, SAP
2020-05
Data Visualization:
Then and Now

Let me introduce myself
- Віталій Рудницький
aka Vitaliy Rudnytskiy, aka Witalij Rudnicki
aka @Sygyzmundovych
- SAP’s Developer Relations team
- 12 years as a BI Technology Consultant
- SAP Mentor 2010-2014
- Self-proclaimed King of Data Geeks J
- Organizer of Wrocław’s
SAP Community meetups and SIT
- https://people.sap.com/vitaliy.rudnytskiy

4. Visualization: A bit of theory

Picture Superiority Effect – Understanding a meaning
„…is a simple shape in Euclidean
geometry, which is the set of all
points in a plane that are at a
given distance from a given point;
equivalently it is the curve traced
out by a point that moves so that
its distance from a given point is
constant.”
Source: Wikipedia
Source: Wikipedia

Picture Superiority Effect – Understanding numbers
I II III IV
x y x y x y x y
10.0 8.04 10.0 9.14 10.0 7.46 8.0 6.58
8.0 6.95 8.0 8.14 8.0 6.77 8.0 5.76
13.0 7.58 13.0 8.74 13.0 12.74 8.0 7.71
9.0 8.81 9.0 8.77 9.0 7.11 8.0 8.84
11.0 8.33 11.0 9.26 11.0 7.81 8.0 8.47
14.0 9.96 14.0 8.10 14.0 8.84 8.0 7.04
6.0 7.24 6.0 6.13 6.0 6.08 8.0 5.25
4.0 4.26 4.0 3.10 4.0 5.39 19.0 12.50
12.0 10.84 12.0 9.13 12.0 8.15 8.0 5.56
7.0 4.82 7.0 7.26 7.0 6.42 8.0 7.91
5.0 5.68 5.0 4.74 5.0 5.73 8.0 6.89
Property Value
Mean of x in each case 9 (exact)
Sample variance of x in each
case
11 (exact)
Mean of y in each case 7.50 (to 2 decimal places)
Sample variance of y in each
case
4.122 or 4.127 (to 3 decimal
places)
Correlation between x and y in
each case
0.816 (to 3 decimal places)
Linear regression line in each
case
y = 3.00 + 0.500x (to 2 and 3
decimal places, respectively)
http://en.wikipedia.org/wiki/Anscombe%27s_quartet
@Sygyzmundovych

Picture Superiority Effect – Understanding numbers


Picture Superiority Effect - Remembering
„Cool Infographics: Effective Communication with Data Visualization and Design” by Randy Krum, John Wiley & Sons © 2014
after: „Brain Rules: 12 Principles for Surviving and Thriving at Work, Home, and School” by John Medina, Pear Press © 2008


Picture Superiority Effect - Context

10. Data Visualization:
Shapes, locations and colors
based on „SAP Lumira: Data Visualization Handbook” and other

Human eye can distinguish betwen 7-10 milion colors
How many can you?
source: https://en.wikipedia.org/wiki/Color_vision#/media/File:1Mcolors.png


The Colour Wheel




Example: SAP à Brand Elements à Visual à Color Palette
Our range of primary colors consists
of SAP Gold, white, black and the
supportive neutral colors (SAP light,
medium and dark gray)
We use our secondary color palette
for graphs and tables, as well as for
accent and highlighting purposes
https://www.sapbrandtools.com/color-palette/#palette-details


Visual Perception
„SAP Lumira Data Visualization Handbook” http://getlumira.sapstore.com/data-visualization-handbook/
@Sygyzmundovych

Chart
Picker
@Sygyzmundovych
“…precise representation
of the true data value…”

https://info.sapdigital.com/The-Data-Visualization-Handbook_WC.html

18. Data Visualization: A short history
sources: https://de.wikipedia.org/wiki/Microsoft_Excel

First known chart?
Inclinations of the planetary orbits from monastery school’s textbook
Time series (X-XI century)


First known column chart?
A Frenchman Nicole Oresme used a column chart in a 14th century publication,
"The Latitude of Forms", to plot velocity of a constantly accelerating object against time


Klaer Lightende Spiegel der Verfkonst (1692)


Introduction of Relational Graphics: neither Time nor Geo
Apparently the first use of pie charts: The Statistical Breviary, by William Playfair, 1801


The most famous geospatial chart
@Sygyzmundovych
Inspired by Picquet,
Dr John Snow
principle to depict
cholera deaths in
Central London in
1854.

Nightingale’s Rose diagram (1858):
"a true pioneer in graphical representation of statistics"
"Diagram of the causes of mortality in the army in the East" by Florence Nightingale


International Year of the Nurse and Midwife
#Nightingale2020
https://www.smithsonianmag.com/history/the-worlds-
most-famous-nurse-florence-nightingale-180974155/

Escaping 2D: Introduction of 3D
Chart by Luigi Perozzo, from the Annali di Statistica, 1880
The population of Sweden
from 1750-1875
by age groups
printed in color in a way which enhances the perception of depth.


1975-85: the Golden Age of statistical graphics


1990s: explosion of computer-aided visualizations


29. IBCS

Since 2013 the Creative Commons IBCS® project has been
governed by the newly-founded IBCS Association.
The further development of the IBCS® Standards is an ongoing
process, which is managed by the not-for-profit IBCS Association.
Communication Standards
(IBCS®) are practical proposals
for the design of reports,
presentations, dashboards and
the diagrams and tables
contained therein.
This involves the conception of
the content, the visual
perception and the application
of a semantic notation.

International Business Communication Standards are proposals for the conceptual and visual design of
management reports and presentations.
The purpose of IBCS is to foster the level of understanding in published material for business communication.
Business communication meets the IBCS® Standards
when it complies with the rules of the seven areas
SAY: Convey a message
UNIFY: Apply semantic notation
CONDENSE: Increase information density
CHECK: Ensure visual integrity
EXPRESS: Choose proper visualization
SIMPLIFY: Avoid clutter
STRUCTURE: Organize content

SAP and IBCS



34. EDA: Exploratory Data Analysis

Modelling
(Schema)
Data
Collection
Data
Analysis
Data
Storytelling
Data Collection
Data
Analysis
Modelling
Data
Storytelling
Exploratory Data Analysis


36. Demo:
https://github.com/SAP-samples/hana-ml-samples/tree/master/Python-API/usecase-examples/estimate-car-price

Data Storytelling…

