Data Visualization for Business With Tableau

profileDaraja
ANL201StudyUnit3ver20200108.pdf

Visualisation for Business

ANL 201

The Art of Data Visualisation

Study Unit 3

January 2020

Visual Cues

3

Visual Cues The eight components of visual cues

1. Position (e.g., scatterplot)

2. Length (e.g., bar chart)

3. Angle (e.g., pie chart)

4. Direction (e.g., line graph)

5. Shape (e.g., scatterplot)

6. Area (e.g., square area graph)

7. Volume

8. Colour

4

Visual Cues

Mun Teng
Sticky Note
normally attributed to the scatter plot, useful for spotting outliers, good for relatively comparison between points, not so good for telling you the exact data points and when many data points are close to one another
Mun Teng
Sticky Note
always start from 0 scale
Mun Teng
Sticky Note
the larger the circle of a chart, the bigger the size

5

Visual Cues

6

Visual Cues Colour — the Red-Green-Blue (RGB) colour system

‣ The basic idea of the RGB colour system is that any coloured light can be matched by a weighted sum of any three distinct primary colours

C ≡ rR + gG + bB,

where

C is the colour to be matched

R, G, and B are primary sources to be used to create a match

r, g, and b are the amounts of each primary source

≡ denotes a perceptual match

Mun Teng
Sticky Note
colors are good at segmenting categories
Mun Teng
Sticky Note
only supplementary on math, need to know this only thoroughly for this module

7

Visual Cues Colour — the CIE colour system

‣ The CIE colour system uses a set of abstract primaries called tristimulus values that are labelled XYZ. These values are chosen for their mathematical

properties, and not because they match any set of actual lights

‣ The CIE colour system is by far the most widely adopted colour system to measure coloured lights. We should always use the CIE colour system when

precise colour specification is required

8

Visual Cues Colour — the HSV colour system

‣ The HSV colour system uses colour hue, colour saturation, and black-white brightness (i.e., value) to specify the surface colours

‣ In the HSV colour system, hue refers to which part of the rainbow colour map a colour belongs to, such as red or green. Saturation refers to how rich a colour

hue is, for example, neon colours are very saturated, while pastel colours are

less saturated. Value denotes how bright a colour is, or in other words, how close

a colour is to pure white or pure black

Coordinate Systems

10

Coordinate Systems The cartesian coordinate system

‣ The cartesian coordinate system specifies each data point on a plane by a pair of numerical coordinates. The numerical coordinates are the signed distances

from the data point to the two fixed perpendicular reference lines, called the x-

axis and y-axis

‣ Both axes meet at a point, called the origin, which is usually represented by the ordered pair (0, 0)

‣ The numerical coordinates can also be expressed as a signed distance from the origin

Mun Teng
Sticky Note
distance can be plus or minus that's why it is signed distance

Coordinate Systems

The cartesian coordinate system

12

Coordinate Systems The polar coordinate system

‣ In the polar coordinate system, each data point is determined by the distance between a fixed point and an angle from a fixed direction

‣ The fixed point, which is analogous to the origin in the cartesian coordinate system, is called the pole. The ray or half-line from the pole in the fixed direction

is called the polar axis. The distance from the pole is called the radial coordinate

or radius, and the angle is called the angular coordinate, polar angle or azimuth

Coordinate Systems The polar coordinate system

Mun Teng
Sticky Note
Pole
Mun Teng
Sticky Note
Polar Axis
Mun Teng
Sticky Note
Radial Coordinate, Angular Coordinate

14

Coordinate Systems The geographic coordinate system

‣ The geographic coordinate system enables every location on the earth to be specified by a set of numbers or letters. To represent the location, the coordinate

system commonly uses latitude and longitude. Sometimes the coordinate system

may also use elevation

‣ Latitude lines run east and west, which indicates north and south positions on the globe. Longitude lines run north and south, which indicates east and west

positions. Elevation can be thought of as a third dimension

Mun Teng
Sticky Note
longitude is the vertical axis
Mun Teng
Sticky Note
latitude is horizontal axis
Mun Teng
Sticky Note
wont be using this for the module, just theory

Coordinate System

Source: http://image.slidesharecdn.com/projectionsandarcgis-131008191137- phpapp01/95/understanding-coordinate-systems-and-projections-for-arcgis-4-638.jpg?cb=1381259844

The geographic coordinate system

16

Coordinate Systems The geographic coordinate system — projections (1/4)

‣ The surface of the earth is wrapped around a spherical mass, but we usually want to display a location on earth on a two-dimensional surface, like a piece of

paper or a computer screen

‣ Therefore, there is a variety of ways to map the surface of the Earth on a two- dimensional surface, which are called projections

17

Coordinate Systems

1. Equirectangular — typically used for thematic mapping and it does not

preserve any area or angle

2. Albers — does not preserve scale and shape, and angle is minimally distorted

3. Mercator — preserves angles and shapes in small area, so it is good for

direction

The geographic coordinate system — projections (2/4)

Mun Teng
Sticky Note
extra reading material for pg 17 till 19

18

Coordinate Systems

4. Lambert Conformal Conic — better used for showing smaller areas and it is

often used for aeronautical maps

5. Sinusoidal — preserves area and it is useful for showing areas near the prime

meridian

6. Polyconic — used to show the map of the U.S. in the mid-1900s. There are

little distortions in small areas near the meridian

The geographic coordinate system — projections (3/4)

19

Coordinate Systems

7. Winkel Tripel — a good choice for showing the world map, because it

minimises area, angle and distance distortions

8. Robinson — a good choice for showing the world map because it compromises

preserving areas and angles

9. Orthographic — represents a three-dimensional object in a two-dimensional

space. Using this method, the user needs to rotate to the area/location of

interest

The geographic coordinate system — projections (4/4)

Scales

21

Scales Comparing the different scales (1/2)

‣ With a linear scale, the visual spacing between each of the data points is the same regardless where the data points are on the axis

‣ The logarithmic scale condenses the distance between each of the data points when the value of the data points increase

‣ A percent scale is usually linear, but when it is used to represent part of the whole data, its maximum is 100 percent

22

Scales Comparing the different scales (2/2)

‣ We use a categorical scale when we want to provide visual separation of categorical data, such as country of residence or gender

‣ We use the time scale when we want to plot temporal data on a linear scale, or to divide the temporal data on a categorical scale, such as by year, month or day

Context

24

Context The big idea

‣ Context is a data visualisation component that lends to better understanding of who, what, when, where and why of the data. Context can make the data clearer

for interpretation

‣ When we would like to enable viewers to see the data visualisation object of primary interest in full detail, and at the same time get an overview within the

context (i.e., surrounding information) available, this is known as a focus-context

problem in data visualisation

25

Context The three premises of the focus-context problem

1. The viewer needs both context (i.e., overview of the information), and focus

(i.e., details of the information) simultaneously

2. The information needed in the overview may be different from that needed in

the detail

3. These two types of information need to be combined within a single interactive

data visualisation

26

Context

‣ Spatial related problems are common to all Data Visualisation that use maps

‣ Structural related problem arises when we try to visualise data that have structural components at many levels

‣ Temporal related problem involves understanding the timing of data at very different scales

Sub-types of the focus-context problem

27

Context

‣ The distortion technique spatially distorts a data presentation to give more room to the designated points of interest, and to decrease the space given to regions

away from those points

Solving the focus-context problem — distortion

Context Distortion – Hyperbolic Tree Browser

29

Context

‣ The rapid zooming technique allows viewers to zoom rapidly in and out of points of interest

Solving the focus-context problem — rapid zooming

30

Context

‣ The elision technique hides parts of a structure from viewers until they are needed

Solving the focus-context problem — elision

Context Elision – Fish Eye Technique

32

Context

‣ The multiple windows technique allows viewers to have one window that shows an overview of the data, and several other windows that show the expanded

details

Solving the focus-context problem — multiple windows

Discussion

Source: https://zylab.files.wordpress.com/2010/09/hyperbolic_tree.png ; http://www.ceh.ac.uk/sites/default/files/hyrad-static-and-rapid.jpg ; http://tulip.labri.fr/TulipDrupal/sites/default/files/uploadedFiles/images/scatterplot_view_detail_fisheye.preview.png; https://www.devexpress.com/products/net/dashboard/i/demos/winforms-hr-dashboard.png

Can you identify the four focus-context data visualisation techniques

(a) (b)

(c) (d)

Mun Teng
Sticky Note
zooming technique, rapid zooming
Mun Teng
Sticky Note
fish eye technique, structure is blown up for certain areas
Mun Teng
Sticky Note
distortion

Tableau (Class Activity)

Tableau (Class Activity)

1. Sit with your GBA’s team mates

2. Follow your instructor for the following exercises:

- Data:

- global_superstore_2016.xlsx (orders)

- Coffee Chain.xlsx and Office City.xlsx

Calculation: Aggregate VS Record-Level

Aggregate Functions

1. Aggregation of a measure

2. Aggregation of a dimension

More information:

• https://help.tableau.com/current/pro/desktop/en-us/calculations_aggregation.htm

• https://help.tableau.com/current/pro/desktop/en-us/calculations_calculatedfields_aggregate_create.htm

Exercise 1. Create a calculated field: for each state, calculate combined sales from office city.xlsx

and coffee chain.xlsx

Quick Table Calculation

percent of total

Quick Table Calculation

running total

Quick Table Calculation

running total for each year: Is this chart correct?

Quick Table Calculation running total for each year:

Quick Table Calculation

More Information:

https://help.tableau.com/current/pro/desktop/en-us/calculations_tablecalculations.htm

suss.edu.sg

Course Homepage https://canvas.suss.edu.sg/courses/21575

Study Guide https://ibookstore.suss.edu.sg/

Tableau Desktop https://www.tableau.com/products/trial

Tableau Tutorials https://www.tableau.com/learn/get-started/creator

Academic Calendar https://www.suss.edu.sg/docs/default-

source/contentdoc/cel/ft-2020acadcalendar.pdf