Six Sigma, Statistics expert needed

profilesajhal-1
Distribution_Basics.pdf

Distribution Basics

Statistics

• Descriptive statistics – Those methods involving the collection, summarisation, presentation

and characterisation of a set of data in order to properly describe the various features of that

set of data.

• Inferential statistics – Those methods that make possible the estimation of a characteristic of

a population or the making of a decision concerning a population based only on a sample.

Population vs. sample

• A population is a total collection of observations or measurements of interest

• A sample is a subset of measurements or observations from a population

Sample vs. population

m = Population mean

s = Population standard deviation

Estimate Sample statistics

Population parameters

SAMPLE POPULATION

s = Sample standard deviation (s)̂

X = Sample mean (m)̂

Describing the sample

• Central tendency – a distinct tendency to cluster about a central point (an average)

• Dispersion – amount of variation or spread in the data

• Shape – manner in which data are distributed

Statistical functions

• f(x) - Probability Density Function (PDF)

• Models a histogram

• F(x) - Cumulative Distribution Function (CDF)

• The area under f(x) to the left of x

• Probability of less than: integral of f(x) from negative infinity to x

• R(x) - Reliability function

• The area under f(x) to the right of x

• 1-F(x)

• Probability of greater than: integral of f(x) from x to infinity

• h(x) - Hazard function

• Instantaneous failure rate: f(x)/R(x)

• A measure of proneness to fail

Probability Density Function Definition

Symmetric vs. skewed data

• Mean > Median: Positive or right-skewness

• Mean = Median: Symmetry or zero-skewness

• Mean < Median: Negative or left-skewness

positively skewed (right) negatively skewed (left)

symmetric

Moment Generating Functions

1st moment generating function about the origin

2nd moment generating function about the mean

Skewness = the 3rd moment generating function about the origin

Example • Given the density function f(x) = ax, 0 < x < 10

• Determine the value of a that makes f(x) a valid density function

• Determine the mean and variance of the distribution

• Derive expressions for the reliability and hazard functions

( ) ( )( )ax dx a

x a

a 0

10 2

0

10

2 2 1

2 1

2 10 0 1

1

50  =  =  − =  =

( )E x x x

dx x

dx( ) .= = = =  50 50

1

150 10 6 667

0

10 2

3

0

10

( ) ( )V x x dx( ) . . .= − = − = 1

50 6 667

1

200 10 44 449 5 55

3 2 4

0

10

Example Continued

Probability Density Function (PDF)

0.59

0.71

0.83

0.95

1.07

1.19

1.31

1.43

1.55

1.67

1.79

1.91

2.03

2.15

2.27

2.39

2.51

2.63

0

10

20

30

40

Count

Probability Density Function

0

x

f(x)

a b

6 8 Diameter

4 6 8 0

0.2

0.4

0.6

0.8

1

Diameter

C u

m u

la ti

ve D

is tr

ib u

ti o

n

4 0

0.002

0.004

0.006

0.008

P ro

b a

b il

it y

D e

n si

ty

Cumulative Distribution Function (CDF)

Reliability Function

0 x

1

F(x) R(x)

Hazard Function

• Instantaneous failure rate