SYSEN 5300 Assignment 8 / Takehome Final Factorial Design at Two Levels and Response Surface Method

rehi_anyupcomingassignmentuneedhelpwith.zip

Home >Engineering homework help >Mechanical Engineering homework help >SYSEN 5300 Assignment 8 / Takehome Final Factorial Design at Two Levels and Response Surface Method

Lecture 19 Slides Application of Response Surface Methods.ppt

SYSEN 5300 (5310, 5320) - Systems Engineering and Six-Sigma for Systems Reliability and Quality

Introduction System Reliability (FMEA, Fault Tree) Six-sigma & Stat. Control Six-sigma & Systems Improvement (DOE) Six-sigma & Systems Improvement (RSM)

SYSEN5300 Lecture 23 Design of Experiments: Application of Response Surface Methods (RSM)

H. Oliver Gao *

Overview

Aspects of RSM
Using RSM to improve a product design
Simplification of a complicated response function by data transformation
Using RSM to determine and exploit active and inert factor spaces for multiple-response data
Using RSM to exploit inert canonical spaces
Using RSM to move from empiricism to mechanism

H. Oliver Gao *

Iterative experimentation to improve a product design

An identical problem faced by different experimenters
Different factors could have been chosen for the study
Different ranges for the factors could have been selected
Different choices could have been made for qualitative and blocking factors
Different transformations for the factors might have been employed
Different responses and their metrics might have been chosen
Different models could have been considered
Arbitrary choices  conclusions from a single experiment are doubtful  an iterative sequence of experiments (scientific iteration tends to be self-correcting)

H. Oliver Gao *

Design of a paper helicopter

Use statistical design for scientific discovery in a real investigation—sequential unfolding of a problem.
You need to experience when you perform your own experiments using experimental design and discovering your own iterative path to a solution. “the art of investigation cannot be found just by playing with someone else’s data.
A prototype helicopter design: the objective was to find an improved design giving longer flight times.

H. Oliver Gao *

Screening Experiment

To get some idea as to which factors might be important for increasing flight times, a fractional factorial arrangement was used for testing.

H. Oliver Gao *

Screening Experiment (cont’d)

Explore the possibility of increasing the flight time by changing what factors along what path.

H. Oliver Gao *

Screening Experiment (cont’d)

The linear model for estimating the mean flight times

Contour diagram

H. Oliver Gao *

Steepest Ascent

Construct a series of helicopters along the steepest ascent path: the factors were changed simultaneously in proportion to the coefficients of the fitted equation.
I.e., For every increase of 28 units in x2, x3 was reduced by 13 units and x4 by 8 units. The units were the scale factors: lscale=0.875, Lscale=0.875, Wscale=0.375, which were the changes in l, L, and W corresponding to a change of one unit in x2, x3, and x4.
A helicopter with a 4-inch wing length l was first tested on the steepest ascent path and then additional helicopters were built along this path with wing length l increased by ¾-inch increments and the other dimensions adjusted accordingly.

H. Oliver Gao *

Steepest Ascent (continued)

Data for flight helicopters built along the path of steepest ascent

Observation: among the 5 new designs, design 3 gave the longest average flight time of 347 centiseconds—an impressive improvement with flight times increased by more than 50%.

In the practical development of a manufactured product, if a new product design had been discovered that was this much ahead of current competition, a management decision of “cash in”.

What’s next?

H. Oliver Gao *

An even better design—A sequentially assembled composite design

Since none of the qualitative factors so far tried seemed to produce any positive effects, it was decided for the present to fix these features.
We explore further 4 helicopter dimensions—wing length l, wing width w, body length L, and body width W.
In addition, a discussion with an engineer led to the suggestion that a better characterization of the wing dimensions: wing area A=lw and the length-to-width ration R=l/w.
A 24 factorial in A, R, W, and L was run with two added center points with the expectation that, if necessary, additional runs could be added to the design to allow the fitting of a second-order model

H. Oliver Gao *

Helicopter data for 24 factorial design

H. Oliver Gao *

Normal plot of coefficients

It is evident that some two-factor interactions now approach the size of the main effects.

H. Oliver Gao *

Additional runs for a Central Composite Arrangement

Allow for the fitting of a second-order model. The added runs consisted of points placed at +2 and -2 units along each of the four axes.

H. Oliver Gao *

Additional runs for a Central Composite Arrangement (continued)

The estimated second-order model, allowing for possible mean differences between blocks.
4 linear coefficient in the second line
4 quadratic coefficients on the third line
6 two-factor interaction on the final lines.

To the right are the estimated SEs of the coefficients in that line

H. Oliver Gao *

Additional runs for a Central Composite Arrangement (continued)

ANOVA—goodness of fit of the model

Residual MS=9.7. Overall F ratio (207.6/9.7>20) for the fitted second-degree equation, exceeding its 5% significance level F0.05, 14, 14=2.48 by a factor of 8.6.

H. Oliver Gao *

Canonical analysis

The fitted second-order model goes as follows:

It had seemed likely to the experimenters that a maximum might now occur at S. However, the positive coefficient (3.27) suggests that the response surface almost certainly had a minimum at S in the direction of X3.
It’s possible to move from point S in either dir. Of X3 to increase flight times.

H. Oliver Gao *

Canonical analysis (cont’d)

In terms of centered variables

Thus, beginning at S, one direction of ascent along the X3 axis would be such that for each increase in of 0.52 units would be reduced by 0.45 units, reduced by 0.45 units, and increased by 0.57 units. To follow the opposite direction of ascent, you would make precisely the opposite changes.
Helicopters were now designed and constructed for 16 points along this axis.

H. Oliver Gao *

Canonical analysis (cont’d)

Experimental Data Employing Canonical Factor X3

H. Oliver Gao *

Canonical analysis (cont’d)

Characteristics of helicopters along axis X3.

H. Oliver Gao *

Helicopter Example Summary

Key: encourage us to experience process improvement and discovery by employing our imagination in studies. We can test our ideas employing different starting points, varying different factors, and so forth. Specifically

Experience the catalysis of the scientific method obtained by the use of stat. methods

Factorial designs for screening

Follow an improvement trend with steepest ascent

Sequential assembly of a composite design by adding axial points and center points to a factorial

Unexpected surprise produced from canonical analysis

SYSEN5300FA18Lecture26CompareMultipleEntities&ANOVA.pptx

11/28/18

SYSEN 5300 Systems Engineering and Six-Sigma for the Design and Operation of Reliable Systems

Lecture 26 Compare a Number of Entities and ANOVA

Dr. Oliver H. Gao and Dr. Wenqi Yi

11/28/18

Outline

Compare two entities

Compare a number of entities and ANOVA

Factorial design at two levels

Comparing Two Entities

Null hypothesis: Two means may be considered to be equal

Test/analysis:

t-test (unknown and equal variance) (Z test for known variance modified t-test for unknown and unequal variance)

Experimental strategies:

Physical randomization

Randomized paired (block) comparison

Example 1

Example 2

A gardener conducted an experiment to discover whether a change in fertilizer mixture would result in improved tomato yield. 11 plants set out in a single row; 5 with standard fertilizer A and 6 with improved mixture B.

Need for Randomization in Example 1

The negative autocorrelation produces a reduction in the std. by a factor of 0.7. Thus the reference distr. obtained from past data has a smaller spread than the corresponding scaled t distribution RANDOMIZED DESIGN

Physical Randomization in Example 2

11!/(5!6!)=462, 154 of the possible 462 arrangements provide differences greater than 1.69. Significance probability: 154/462=33%. No significant difference.

Example 3

10 boys’ shoes: amount of wear of the soles (standard material A and a cheaper one B)

Tests were run in pairs—each boy wore a special pair of shoes (one with A and the other with B, randomized)

Some boys scuffed their shoes more than other, however for each boy his two shoes were subject to the same treatment.

Randomized Paired Comparison Design in Example 3

Increase precision by making comparisons within matched pairs of experimental material

By working with the 10 differences B-A most of the boy-to-boy variation could be eliminated

Randomization: distribution: 2^10=1024.

A difference of 0.41 is quite unusual (3 of 1024 differences), probability below 0.5%  significant increase in the wear with B

T-test?

Blocking and Randomization

A block is a portion of the experimental material that is expected to be more homogeneous than the aggregate.

By confining comparisons to those within blocks, greater precision is usually obtained because the differences associated between the blocks are eliminated.

Pairs (blocks) in time and space

Block what you can and randomize what you can not to deal with unavoidable sources of variability

Comparison, Replication, Randomization, and Blocking in Simple Experiments

Conduct experiments to assess treatment A & B

Experiments should be comparative: modified and unmodified procedures should be run side by side

Genuine replication: variation among replicates can provide an accurate measure of errors

Blocking (pairing) should be used to reduce error

Randomization planned for homogeneous errors of both A and B

None of the above will necessarily alert you to the influence of bad values look at the original data

Sensitive to violation of NIID

One- vs. Two-Sided Tests

Conventional significance level: somewhat convinced at the 5% level and fairly confident at the 1% level. Confidence Interval?

Example: boys’ shoes. (1-alpha)=95% CI for the B-A. The observed average difference in wear was 0.41, its standard error was 0.12, and there were nine DOFs. The 5% level for such a t distribution is Pr(|t|>2.262)=5%. Thus

In general, the 1-alpha CI for delta would be

Confidence Interval for Differences in Means (paired design)

Example: tomato plant. (1-alpha)=95% CI for the B-A.

In general, the 1-alpha CI for delta would be

Confidence Interval for Differences in Means (Unpaired Design)

Testing the Ratio of Two Variance

A sample of n1 observations randomly drawn from a normal distr. with variance , a second sample of n2 observations from a second normal distri. with variance

Example: inexperienced chemist 1 and experienced chemist 2,

Compare a Number of Entities

Null hypothesis: All means may be considered to be equal

Test analysis: Analysis of variance (ANOVA): a generalization of the t-test used to compare two entities

Experimental strategies:

Completely randomized design

Randomized block design

Comparing a number of Entities Example Blood Coagulation Time

24 animals receiving four different diets A, B, C, D. Animals were randomly allocated to the diets, and the testing was done in the random order.

Question: Is there real difference between the mean coagulation times for the 4 diets?  Analysis of Variance (ANOVA) table

Analysis of Variance (ANOVA) Table

Arithmetic breakup of deviation from grand mean=64

Entries in the ANOVA Table

Sum of Squares: SD, ST, SR

Degrees of Freedom (DOF)

Mean Squares: mT, mR

Geometry and the ANOVA Table

The 24 numbers in each of the Tables D, T, and R constitute vectors D, T, and R

<T, R>=0 T is orthogonal to R

Since the vector D is the hypotenuse of a right triangle with sides T and R, by extending Pythagoras’ theorem to n dimensions, SD=ST+SR

Exercise

Each of 21 student athletes, grouped into 3 teams A, B, and C, attempts to toss a basketball through a hoop. The number of successes is given. Are there real differences between three teams? Construct an ANOVA.

One Way ANOVA, an Additive Model

The underlying model

F-test

Graphical Checks (diagnostics) on Violation of Assumptions

Assumptions (additivity, IID errors, normality, constant variance): the ANOVA is quite robust (insensitive) to moderate nonnormailty and to moderate inequality of group variances. However, it is sensitive to serial correlation if testing was not well randomized.

Graphical checks are used for examining: outliers (plotting residuals), serial correlation (randomization can nullify the serious effect of autocorrelation), constant variance across treatment groups, systematic drift occurring during the experiment…

Graphical Checks (diagnostics) on Violation of Assumptions

Residual for each diet

Residual versus estimated values

Residual in time sequence

Randomized Block Design

By general randomization the effect of noise is homogenized between treatment and error comparisons and thus validates the experiment.

Example:

penicillin

yield

By randomly assigning the order in which the four treatment were run within each blend (block), validity and simplicity were maintained while blend differences were largely eliminated.

Randomized Block Design (cont’d)

R=D-B-T, the vectors R, B, and T are mutually orthogonal.

Randomized Block Design (cont’d)

Increase in Efficiency by Elimination of Block Differences

Advantage of using the randomized block arrangement: of the total sum of squares not associated with treatments or with the mean, almost half is accounted for by block-to-block variation.

If the experiment had been arranged on a completely randomized basis with no blocks, the error variance would have been much smaller/larger?

With randomized block design these errors were considerably less: of the total of SD=560, SB=264 has been removed by blocks.

The randomized block design greatly increased the sensitivity of experiment and made it possible to detect smaller treatment differences.

Implications of the Additive Model

Diagnostic Checks

Latin Squares: more than one blocking component

Experiment example: test the feasibility of reducing air pollution by modifying a gasoline mixture with very small amounts of chemicals A, B, C, and D. These 4 treatments were tested with 4 different drivers and 4 different cars. Two block factors: drivers and cars

The latin square design was used to help eliminate from the treatment comparisons possible differences between the drivers, and between the cars

Latin Squares: more than one blocking component (cont’d)

Each treatment (A, B, C, or D) appears once in every row (driver) and once in every column (car). Randomization was used.

Advantage: a wider inductive basis for conclusion

Latin Squares: more than one blocking component (cont’d)

Conclusions? No convincing evidence for differences between the treatments, but the Latin square design has been effective in eliminating a large component of variation due to drivers

Graeco-Latin Squares

A Graeco-Latin square is a k by k pattern that permits the study of k treatment simultaneously with three different blocking variables each at k levels. Example: one extra blocking variable in car emissions

Could be used to eliminate possible differences between, say days.

Hyper-Graeco-Latin Squares in Martindale Wear Tester

Martindale Wear Tester: a machine used for testing the wearing quality of types of cloth or other such materials. Record of weight loss suffered by the test piece in one machine cycle (rubbed against a std. grade of emery paper)

Four types of cloth (treatments) A, B, C, D are mounted in four specimen holder 1, 2, 3, 4. Each holder can be in any one of four positions P1, P2, P3, P4. Each emery paper sheet alpha, bita, gama, delta was cut into four quarters.

Objective: (1) Make accurate comparison of the treatments; (2) Understand variability caused by various factors-holders, positions, emery papers, and cycles

Hyper-Graeco-Latin Squares in Martindale Wear Tester (cont’d)

The design was effective both in removing sources of extraneous variation and in indicating their relative importance.

Because of the elimination of these disturbances, the residual variance was reduced by a factor of 8

We could detect much smaller differences in treatment

Hyper-Graeco-Latin Squares in Martindale Wear Tester (cont’d)

The F-stat=5.39 with 3, 9 DOF, significant at 2% level.

By using a design which makes it possible to remove the effects of many larger disturbing factors, differences between treatment were made detectable.

The analysis identified the large contributions to the total variation due to cycles and to emery papers. This suggested improvements which later led to changes in the design of the machine.

Hyper-Graeco-Latin Squares in Martindale Wear Tester (cont’d)

In graphical analysis, position P2 gives much less wear than others, indicating a need of improvement

Balanced Incomplete Block Designs

Suppose the Martindale wear tester allowed only three samples to be included in each cycle, but you had 4 treatment A, B, C, and D to compare

You have 4 treatments but a block size of 3, too small to accommodate all the treatments simultaneously balanced incomplete block design

Property: every pair of treatment occurs together in a block the same number of times.

Youden Squares: Doubly Balanced Incomplete Block Designs

Example: comparing 7 treatment in seven blocks of size 4 (e.g., test 7 types of cloth A, B, C, D, E, F, and G, but only 4 test pieces could be compared simultaneously in a single machine cycle)

Also had the opportunity to eliminate a second source of block variation, machine positions

Principles for Valid and Efficient Experiments

Make use of the specialist’s knowledge and experience. Statistical techniques are an adjunct, not a replacement, for special subject matter expertise

Involve the people responsible for operation, testing, and sampling

Be sure that everyone knows what it is they are supposed to do and try to make certain that the experiments are run precisely as required.

Use appropriate randomization so that the effect of noise on the treatment responses and on the residual errors is homogenized

Provide suitable statistical analysis, both computational and graphical

)

(

var

with

but

not

iance

The

...

)

Pr(|

Testing

Hypothesis

Sided

Two

)

Pr(

Testing

Hypothesis

Sided

One

262

where

)

(

)

(

262

)

262

Pr(|

)

(

)

(

)

(

Thus

dof

with

distr

The

)

(

)

(

)

(

)

(

)

(

where

)

(

)

(

)

183

(

)

062

(

)

Pr(

)

062

183

(

mean

this

does

what

distribute

Under

Alternativ

Null

Testing

Hypothesis

)

(

)

(

)

(

estimate

and

both

Then

hypothesis

Null

treatments

four

the

difference

were

there

IID

assumed

error

associated

treatment

produced

deviation

the

effect

treatment

mean

grand

overall

group

diet

observatio

ith

the

where

alone

noise

ator

deno

the

noise

plus

signal

numerator

The

distribute

would

ratio

the

that

hypothesis

null

Under

tly

independen

distribute

would

and

then

NIID

distribute

normally

were

the

that

assumed

further

could

min

)

(

int

mod

exp

treatments

and

blocks

between

occur

said

would

eraction

additive

not

were

effects

treatment

and

block

the

would

together

both

increase

the

response

the

increases

block

and

increment

provide

treatment

Additive

response

ected

underlying

The

where

Lecture 16 Slides Multiple Regression and Comparing Two Entities(1).ppt

SYSEN 5300 (5310, 5320) - Systems Engineering and Six-Sigma for Systems Reliability and Quality

Introduction System Reliability (FMEA, Fault Tree) Six-sigma & Stat. Control Six-sigma & Systems Improvement (DOE) Six-sigma & Systems Improvement (RSM)

Introduction to Design of Experiments: Least Squares, Multiple Regression, and Why DOE

H. Oliver Gao *

Why experiments?

Example 1: a process change was made. Is it an improvement? By how much?

Example 2: Car is rated at 30 mpg. Is the rating justified?

Example 3: Data are available on the performance of multiple machines. Do the machines perform alike?

H. Oliver Gao *

Experimental Error

An operation/experiment repeated under nearly the same condition, the observed results are NEVER identical
Experimental error: fluctuation that occurs from one repetition to another
Sources of error include: measurement, analysis, sampling
Awareness of the possible experimental error is essential in analysis of data AND planning the generation of the data (i.e., experiment design)

H. Oliver Gao *

Multiple Regression

H. Oliver Gao *

Multiple Regression Example (1)

An investigator wants to determine the relationship of a key process output variable, product strength, to two key process input variables:
Hydraulic pressure during a forming process
Acid concentration

H. Oliver Gao *

Multiple Regression Example (2)

Some of the entries in this output are more important than others.:
The predictor and coef. describe the prediction model
The p-value give the significance level for each model term (p<=0.5)
The coefficient of determination (R2) is presented as R-Sq and R-Sq(adj). This value represents the proportion of the variability accounted for by the model. In this example, the model accounts for a very large percentage of the variability

H. Oliver Gao *

Multiple Regression Example (3)

In the analysis of variance portion of the output the F value is used to determine an overall P value for the model fit. In this case the resulting p value of 0.000 indicates a very high level of significance.
The regression and residual sum of squares (SS) and mean square (MS) values are interim steps toward determining the F value

H. Oliver Gao *

Least Squares Estimation, an example (1)

H. Oliver Gao *

Least Squares Estimation, an example

H. Oliver Gao *

Least Squares Estimation, an example (2)

H. Oliver Gao *

Least Squares Estimation, an example (3)

H. Oliver Gao *

Example: Multiple regression best subset analysis (1)

Results from a cause-and-effect matrix lead to a passive analysis of factors A, B, C, and D on Thruput
Plastic molding process: thruput response might be shrinkage as a function of the input factors temp. 1, temp. 2, pressure 1, and holt time
We’d like to create a model that provides a good estimate with the fewest number of terms

H. Oliver Gao *

Example: Multiple regression best subset analysis (2)

A best subsets computer regression analysis yielded

From this output we note:
R-Sq: look for the highest value when comparing models with the same # of predictors
Adj. R-Sq: look for the highest value when comparing models with the same # of predictors
Cp: Look for models where Cp is small and close to the number of parameters in the model (e.g., look for a model with Cp close to four for a three-predictor model that has an intercept constant (often we just look for the lowest Cp)
s: We want s, the estimate of the standard deviation about the regression, to be as small as possible.

H. Oliver Gao *

Example: Multiple regression best subset analysis (3)

The regression equation for a 3-parameter model from a computer program is

H. Oliver Gao *

Example: Indicator variables with covariate (1)

Consider the data set, which has created indicator variables and a covariate.
The covariate might be a continuous variable such as process temp. or dollar amount for an invoice

H. Oliver Gao *

Example: Indicator variables with covariate (2)

H. Oliver Gao *

Example: Binary logistic regression (1)

Binary logistic regression is applicable when the response is pass or fail, and the inputs are continuous variables.
Example: Ingots prepared with different heating and soaking times are tested for readiness to be rolled

H. Oliver Gao *

Example: Binary logistic regression (2)

Heat would be considered statistically significant;
Question: which levels are important?

H. Oliver Gao *

Example: Binary logistic regression (3)

From the p chart on the right, it appears that heat at the 51 level causes a larger portion of not readys

H. Oliver Gao *

Benefits to DOE

Koselka (1996) lists the following applications
Reducing the rejection rate of a touch-sensitive computer screen from 25% to less than 1% within months
Maintaining paper quality at a mill while switching to a cheaper grade of wood
Reducing the risks of misusing a drug in a hospital by incorporating a standardized instruction sheet with patient-pharmacist discussion
Reducing the defect rate of the carbon-impregnated urethane form used in bombs from 85% to zero
Improving the sales of shoes by using an inexpensive arrangement of shoes by color in a showcase, rather than an expensive, flashy alternative.
Reducing errors on service orders while at the same time improving response time on service calls
Improving bearing durability by a factor of five.

H. Oliver Gao *

Residuals and Degrees of Freedom

In later application you will encounter examples where, because of the need to calculate several sample quantities to replace unknown population parameters, several constraints are necessarily placed on the residuals.

When there are p independent linear constraints on n residuals, their sum of squares and resulting sample variance and standard deviation are all said to have n-p DOF

H. Oliver Gao *

Student’s t Distribution

H. Oliver Gao *

Sampling Distribution of a Sum and a Difference

H. Oliver Gao *

Random Sampling from a Normal Population

A random sampling of n observations from a normal distribution

H. Oliver Gao *

The Chi-Square and F Distribution

Random sampling from normal distr.
Chi-square distr. from which you can derive the distribution of the sample variance

F-distr. from which you can obtain the ratio of two sample variances

H. Oliver Gao *

Comparing Two Entities

Comparing two entities experimentally to decide whether the differences are genuine (statistically significant) or merely due to chance.

H. Oliver Gao *

Comparing Two Entities (cont.)

F 3.1
F3.2 and 3.3

H. Oliver Gao *

Comparing Two Entities (cont.)

H. Oliver Gao *

Comparing Two Entities (cont.)

H. Oliver Gao *

Randomized Design

Example: A gardener conducted an experiment to discover whether a change in fertilizer mixture would result in improved tomato yield. 11 plants set out in a single row; 5 with standard fertilizer A and 6 with improved mixture B. How did he randomize?
Fisher argued that physical randomization would make it possible to conduct a valid significant test without making assumptions of independent errors and normality. Why?

H. Oliver Gao *

Randomized Design (cont’d)

11!/(5!6!)=462, 154 of the possible 462 arrangements provide differences greater than 1.69. Significance probability: 154/462=33%. No significant difference.

H. Oliver Gao *

Randomized Design (cont’d)

T-test

H. Oliver Gao *

Randomized Paired Comparison Design

Increase precision by making comparisons within matched pairs of experimental material
Example: 10 boys’ shoes: amount of wear of the soles (standard material A and a cheaper one B)
Tests were run in pairs—each boy wore a special pair of shoes (one with A and the other with B, randomized)
Some boys skuffed their shoes more than other, however for each boy his two shoes were subject to the same treatment.
By working with the 10 differences B-A most of the boy-to-boy variation could be eliminated

H. Oliver Gao *

Randomized Paired Comparison Design (cont’d)

Null hypothesis: B=A

H. Oliver Gao *

Randomized Paired Comparison Design (cont’d)

Randomization
distribution: 2^10=1024.

A difference of 0.41 is quite unusual (3 of 1024 differences), probability below 0.5%  significant increase in the wear with B
T-test?

H. Oliver Gao *

Blocking and Randomization

A block is a portion of the experimental material that is expected to be more homogeneous than the aggregate.
By confining comparisons to those within blocks, greater precision is usually obtained because the differences associated between the blocks are eliminated.
Pairs (blocks) in time and space
Block what you can and randomize what you can not to deal with unavoidable sources of variability

H. Oliver Gao *

Comparison, Replication, Randomization, and Blocking in Simple Experiments

Conduct experiments to assess treatment A & B
Experiments should be comparative: modified and unmodified procedures should be run side by side
Genuine replication: variation among replicates can provide an accurate measure of errors
Blocking (pairing) should be used to reduce error
Randomization planned for homogeneous errors of both A and B
None of the above will necessarily alert you to the influence of bad values look at the original data
Sensitive to violation of NIID

H. Oliver Gao *

One- and Two-Sided Tests

Conventional significance level: somewhat convinced at the 5% level and fairly confident at the 1% level. Confidence Interval?

H. Oliver Gao *

In general, the 1-alpha CI for delta would be

Confidence Interval for Differences in Means (paired design)

H. Oliver Gao *

Example: tomato plant. (1-alpha)=95% CI for the B-A.

In general, the 1-alpha CI for delta would be

Confidence Interval for Differences in Means (Unpaired Design)

H. Oliver Gao *

Testing the Ratio of Two Variance

A sample of n1 observations randomly drawn from a normal distr. with variance , a second sample of n2 observations from a second normal distri. with variance
Example: inexperienced chemist 1 and experienced chemist 2,

...,

given valu

for

value

predicted

the

where

...

equation

prediction

for the

)

...

(

(LSE)

Estimates

Squares

Least

the

data

from

determine

regression

multiple

object

The

...

reduces

model

general

the

factors),

(or

variables

predictors

are

there

terms

polynomial

wihtout

situation

For the

DOE

use

great

on x

model

quadratic

full

This

error.

random

and

parameters

unknown

are

Where

e.g.,

variables

one

terms

polynomial

includes

model

general

that

minimizing

coefficien

unknown

the

estimate

)

model

the

from

calculated

values

the

and

values

data

the

between

ies,

discrepanc

the

squares

sum

for the

equation

consider t

Now

expect to

was

relationsh

the

x1,

and

ranges

relevant

the

over

and

zero,

were

and

both x0

when

zero

y was

formation

rate

mean

The

dimer x1

ion

concentrat

the

(2)

and

monomer

ion x0

concentrat

the

(1)

factors

on two

depend

iimpurity

undesirabl

formation

rate

initial

the

how

determine

experiment

from

data

set

illustrati

small

shows

table

following

The

LSE

DOF

have

squares

their

sum

the

hence

residuals

The

residuals

the

constra

linear

constitute

)

(

int

)

(

)

1908

(

)

(

Gosset

chemist

distributi

Student

have

known

std

sample

with

substitute

Suppose

unknown

always

almost

practice

)

(

)

(

)

(

)

(

)

(

)

(

)

(

)

(

)

(

)

(

)

(

)

(

)

(

)

(

)

(

var

iance

and

mean

with

)

(

var

with

but

not

iance

The

...

)

Pr(|

Testing

Hypothesis

Sided

Two

)

Pr(

Testing

Hypothesis

Sided

One

262

where

)

(

)

(

262

)

262

Pr(|

)

(

)

(

)

(

Thus

dof

with

distr

The

)

(

)

(

)

(

)

(

)

(

where

)

(

)

(

)

183

(

)

062

(

)

Pr(

)

062

183

(

mean

this

does

what

distribute

Under

Alternativ

Null

Testing

Hypothesis

Lec 17 Statistical Process Control--other useful Charts.ppt

SYSEN 5300 (5310, 5320) - Systems Engineering and Six-Sigma for Systems Reliability and Quality

Introduction System Reliability (FMEA, Fault Tree) Six-sigma & Stat. Control Six-sigma & Systems Improvement (DOE) Six-sigma & Systems Improvement (RSM)

Lecture 17 Statistical Process Control: Other Useful Charts

H. Oliver Gao *

Managing a Process

Monitoring, controlling, and improving a process

Risks: risk of false alarm, risk of not detecting a process shift
Costs: off-target products, sampling, corrective actions

This involves special circumstances not considered by the traditional variable and attribute control charts

H. Oliver Gao *

Five additional control charts

Risk-based charts: explicitly manage the two risks of making wrong decisions
modified limit charts: useful when it is uneconomical for frequent adjustment (high capability and adjustment cost)
charts to detect small shifts: for rapid detection of small but sustained shifts (e.g., low capability process)
short-run charts: the same process used to produce multiple products
charts for nonnormal distri: departure from a normal distribution

H. Oliver Gao *

Two risks of making a wrong decision with control chart

: Probability of concluding that a process is out of control when it is not. Leads to false alarms and wasted efforts to detect a process shift when not exists

: Probability of concluding that a process is in control when it is not. Implies inability to detect process shifts when they have occurred.

Both can be controlled by a proper selection of control limits and subgroup size

Risk-based control charts

H. Oliver Gao *

Control limits and risks

Narrower control limits?

H. Oliver Gao *

Subgroup size and risks

n increases from 1 to 4:

the Std of subgroup mean reduces by a factor of two

The control limits tighten and become half as wide

For a fixed alpha risk, increasing subgroup size reduces the bita risk

H. Oliver Gao *

By properly selecting control limits and subgroup size, any desired alpha and bita risks can be obtained

H. Oliver Gao *

Risk-Based Chart

For process mean chart, the control limits and subgroup size can be determined to meet any specified alpha and bita risks based upon a “single point outside control limits” as the out-of-control rule

H. Oliver Gao *

Example

Approximate subgroup size for 3 sigma limit process mean charts

Bita is the probability of not detecting a shift in the first subgroup after the shift

For d<1.5, a typical n of 3-5 usually won’t detect the shift

With n=5, shifts greater than 1.5 sigma can be detected with…

Larger n is necessary to immediately detect smaller shifts

H. Oliver Gao *

Detecting sustained shifts

The probability that a shift will be detected on the kth subgroup following the shift is

The expected number of subgroups to detect a sustained shift

E.g., for n=4, the probability of detecting a 1.5 sigma shift in the first subgroup is 50%, in the second is 25%, in the third is 12.5%. Avg=2

Exercise: n=6, 1.5 sigma shift

Point: If a shift is detected in the kth subgroup, it may have occurred not just in the most recent interval, but much prior to that.

H. Oliver Gao *

Product weight example

Sigma=1.074

For mean shift Delta=1.5 sigma, subgroup size=5

What is the bita risk?

How to interpret this? (sporadic vs. sustained shift)

What is the desirable subgroup size to detect a 1.5 sigma shift in one period with a bita risk of 10%

H. Oliver Gao *

X chart

Subgroup size fixed at 1

The bita risk is uncontrolled and is generally very large for a chart of individual values.

The X chart has a very limited ability to detect shifts rapidly

H. Oliver Gao *

Risk-based attribute charts

For alpha=0.3%, the sample size for p and u charts is approximately determined by

For attribute charts, very large sample sizes are required to achieve meaningfully small bita risk.

E.g., p=0.05, a shift delta=0.02, bita=20%Zβ=0.84, d= 0.02/sqrt(0.05*0.95)=0.092 n=?

If a subgroup size of 100 is used, bita=?

1742

H. Oliver Gao *

Modified control limit chart

The usual Shewhart 3 sigma CL charts: distinguish between common and special causes; uneconomical when Cpk is high and the cost is high for corrective action
Modified CL charts: reduce cost of corrective action while ensuring no out-of-specification product
How? –by letting the process drift just high enough and just low enough before taking action
Also know as acceptance control charts (Montgomery)

H. Oliver Gao *

Modified control limit chart (cont.)

Sigma=0.002
Shewhart CL: 0.006
Cpk: 5.0

Modified CLs have a different function: approximate economic action limits intended to signal the need for action

H. Oliver Gao *

Chart design

Determining economic action limits should explicitly consider

The cost of sampling
The cost of identifying and taking corrective actions
The cost of off-target products

Montgomery charts provide appro. Economic action limits assuming

The cost of identifying and correcting special causes >> the cost of off-target products
The range chart is in control and the within-subgroup std. is constant
Large # of obs. available to estimate the within-subgroup std., sigma is known
The product characteristic is equally acceptable as long as it anywhere within the SLs
Montgomery charts can not be designed in all cases (wider limits, permissible Cpk and sample size n)

H. Oliver Gao *

Chart design (cont.)

H. Oliver Gao *

Chart design (cont.)

H. Oliver Gao *

Chart design (cont.)

Minimum required Cpk to implement modified control limit charts

H. Oliver Gao *

Modified limit chart example

H. Oliver Gao *

Moving Average Control Chart

X-bar chart with usual subgroup size can not easily detect small shifts in the mean
To detect small shifts in the mean, use moving average (MA)

X-bar chart: good for detecting large sporadic shifts
MA chart: good for detecting small sustained shifts

H. Oliver Gao *

MA Control Chart (example)

H. Oliver Gao *

Short-run control charts

Short-run: a particular product is manufactured only for a short period of time
Difficult to use conventional control charts
Conversion from short- to long-run situation

Cutoff lengths:

Targeted 20 for A and 30 for B

H. Oliver Gao *

Short-run control charts

It would be beneficial if data from multiple products could be charted on the same control chart select a statistic to plot that has a fixed distribution over time regardless of the product.

Individual product control chart

Proliferation of control chart per process
A longer time period is needed for meaningful CL
Not applicable to few of a kind products
Fragments the continuous running record of the process

H. Oliver Gao *

Short-run individual and MR charts

Process out of control

Delta X and mR charts: if products differ only in terms of target values and the variability is constant from product to product, simply plot

Delta x = x-Ti  target value for product I

Centerline =0, std. is sigma 

Control limits delta x =0 + and/or – 2.66 mR

H. Oliver Gao *

Delta X and R charts: if products differ only in terms of target values and the variability is constant from product to product
Z bar and W bar charts: variability also changes from product to product

CV constant: A special case when each product has a different mean and standard deviation but in a manner as to keep the coefficient of variation CV constant. We can control chart , with mean equal to 1 and std. =CV/sqrt(n)

Short-run average and range charts

H. Oliver Gao *

Short-run variable charts

H. Oliver Gao *

Short-run attribute charts

H. Oliver Gao *

Charts for nonnormal distributions

Situations where the characteristic of interest does not have a normal distribution
E.g., Microbiological counts and particulate counts; time intervals between events; waiting times
For nonnormal distributions, the 6 sigma limits do not enclose 99.73% of the pop.; the alpha risk changes; not a significant issue for X-bar charts
But a big issue for charts of individual values
Two approaches
Identify the dist. of the data to some know dist., then construct centerline and CLs using their parameters
Transform the dist. of X into normal distribution: Y=f(X)

H. Oliver Gao *

Charts for lognormal distribution

Transform the dist. of X into normal distribution: Y=f(X)
Y=Ln(X)

risks

shift

ecting

not

chance

risk

alarm

false

risk

)

(

det

risk

)

(

shif

detecting

not

chance

alarm

false

shift

same

ecting

not

chance

risk

alarm

false

risk

)

(

det

shift

ecting

not

chance

risk

alarm

false

risk

)

(

det

data

historical

from

obtained

may

and

values

individual

std.

term

short

the

)

(

lim

where

UCL

size

subgroup

select

its

Control

)

(

)

(

354

product

per

defects

average

denotes

defective

fraction

denotes

)

(

charts

the

for

chart

the

for

spec.

out

prob.

the

drifts

mean

process

the

even

LSL

USL

)

(

100

)

(

observed

the

such that

drawn

LCL

The

)

(

observed

the

such that

drawn

UCL

The

spec

out

prob

hence

that

sure

are

UCL

LCL

within

long

prob

LCL

prob

UCL

)

LSL

LCL

Modified

)

USL

UCL

Modified

then

0.0013,

are

and

both

for

gives

USL

UCL

Shewhart

UCL

Modified

USL

then

closer

USL

0013

)

(

)

(

centerline

Let

limits.

Shewhart

than

wider

limits

modified

for the

Cpk

minimum

required

screening

and

action

corrective

chart

limit

control

modified

the

Implement

Step

ion

considerat

usual

upon the

based

interval

sampling

appropriat

Select

Step

006

limits

Shewhart

the

that

Note

979

)

LSL

LCL

Modified

021

)

USL

UCL

Modified

then

limits.

control

modified

the

Compute

Step

Cpk

What

size.

subgroup

restrictio

)

(2/

Cpk

here,

Cpk

minimum

required

Cpk

that

size

subgroup

miminum

Compute

Step

0.0013),

both

(here

and

values

Select the

Step

002

centered

mean

0.03,

spec.

Here

index.

Cpk

Calculate

Step

estimated)

precisely

can

satisfctor

equal

control,

chart

range

cost,

(correctio

assumption

basic

Ensure

Step

all

for

Cpk

charts

compared

factor

narrower

become

its

control

the

Essentiall

for

chart

for

its

Control

lim

;

lim

)

(

)

(

...

for

span

with

chart

)

(

577

time

over

fixed

)

(

)

(

product

ith

for the

)

(

)

(

Limits

Control

)

ln(

)

ln(

)

ln(

UCL

Exercise

UCL

Lec 15 Statistical Process Control--CI and PI.ppt

SYSEN 5300 (5310, 5320) - Systems Engineering and Six-Sigma for Systems Reliability and Quality

Introduction System Reliability (FMEA, Fault Tree) Six-sigma & Stat. Control Six-sigma & Systems Improvement (DOE) Six-sigma & Systems Improvement (RSM)

Lecture 15 Statistical Process Control: Process Capability

H. Oliver Gao *

Process Capability (PC) vs. Stability (PS)

PC: ability to meet product specifications
A capable process: all product predicted to be within specifications
Capability can not be determined without knowing product specifications
PS: a process is only influenced by common causes
PS: product specifications not necessary for judging PS

H. Oliver Gao *

Stability and Capability

A stable process: a constant and predictable distribution over time
Capable (prediction within specifications)
Not capable
Unstable process: impossible to predict
Stability is a prerequisite for defining capability.
A process
Stable and capable
Stable and incapable
Unstable but potentially capable
Unstable and incapable

Improvement action

H. Oliver Gao *

Goal and Outline for PC

Quantification of process capability for both stable and unstable processes, in terms of capability and performance indices

Methods for capability indices (CI) and confidence intervals
Connection between a CI and tolerance interval
Six sigma goal (meaning and rationale)
Application of CI: setting goals, assessing process, identifying improvement actions

H. Oliver Gao *

Capability and performance indices

Capability indices: measure what a stable process would be capable of. Two capability indices (Cp, and Cpk)
Process indices: measure the current performance of the process regardless of whether it is stable or not. Two performance indices (Pp, Ppk)

H. Oliver Gao *

Cp Index

Basic assumptions

Specification is two-sided
Process is perfectly centered in the middle of the specification
Process is stable
Process is normally distributed

H. Oliver Gao *

Cp Index (cont.)

Examples: car garage, lane width

The Cp index can be improved by widening specification width or reducing short-term variability

H. Oliver Gao *

Cpk Index

Practical modification of Cp (relax the two-sided and center assumptions)

Negative to positive infinity. Improve by: widening specification; reducing short-term variability; and changing the process mean

H. Oliver Gao *

Pp Index

Pp Index measures the performance of the process without assuming it to be stable.

Basic assumptions

Specification is two-sided
Process is perfectly centered in the middle of the specification
Process is normally distributed

H. Oliver Gao *

Ppk Index

Measure the current performance of the process without assuming a two-sided specification or stable centered process

Negative to positive infinity. Improve by: widening specification; reducing special cause variability; reducing common cause variability; and changing the process mean

H. Oliver Gao *

Relationships between Cp, Cpk, Pp, and Ppk

CI and PI assumptions

One-sided spec: only Cpk and Ppk, Cpk > Ppk
Two-sided spec:
Cp>Cpk > Ppk and Cp>Pp > Ppk
PpCpk=CpPpk
Pp>Cpk if off-centering is more severe than instability; Pp<Cpk if instability is more severe than off-centering
Ppk=Cpk and Pp=Cp for stable process, Ppk=Pp and Cpk=Cp for centered process

H. Oliver Gao *

Relationships between Cp, Cpk, Pp, and Ppk (cont.)

Example: process unstable and not centered, with current Ppk=1
Stabilization improves the process to Cpk=?
Further, if centered, process improved to Cp=?
Alternatively, if the process first gets centered, Pp=?; then further stabilized, Cp=?
In this example: Cpk=Pp

H. Oliver Gao *

Estimating CI and PI

Point Estimates
Process mean
Short-term standard deviation
Long-term standard deviation
Subgrouped data

Pooled within-subgroup std.

H. Oliver Gao *

Estimating CI and PI (cont.)

Individual data: data are available as n individual values collected over time
Data sources: X-mR chart, short-term capability studies, and process validation

H. Oliver Gao *

Example 1 (Cpk and Ppk)

Declared weight: 250 grams, single-sided lower specification limit
Estimate

Calculate Cpk and Ppk (the process has a single-sided lower spec. limit: 250 grams)

H. Oliver Gao *

Example 1 (Cpk and Ppk, continued)

H. Oliver Gao *

Example 2 (process capability and performance indices)

The x-bar and R chart of the data in the table on the right indicates that the process is in control. Now we are interested in calculating the capability and performance indices:
Specification limits: Lower: 0.500; Upper: 0.900
Calculate the process capability and performance indices

H. Oliver Gao *

Example 2 (process capability and performance indices)

The following should be noted:

Process capability and process performance metrics are noted to be almost identical.
Calculations for short-term variability were slightly larger than long-term variability, which is not reasonable because short-term variability is a component of long-term variability. Using range is not as statistically powerful as using the following

H. Oliver Gao *

Confidence Intervals for CI and PI (1)

A process capability index calculated from sample data is an estimate of the population process capability index. It is highly unlikely that the true population index is the exact value calculated.
A confidence interval adds a probability range for the true population value given the results of the sample, including sample size.

H. Oliver Gao *

Confidence Intervals for CI and PI (2)

Uncertainties in estimated CI and PI:
Cp and Pp: short-term variability is not known precisely
Cpk and Ppk: added uncertainty from unknown process mean
Uncertainties quantified by confidence intervals (a function of the DOF available to estimate the index, wider for Cpk and Ppk)

H. Oliver Gao *

Confidence Intervals for CI and PI (3)

Approximate confidence intervals for Cp, Pp, and for Cpk or Ppk close to 1

We need more than 100 to 200 observations to get a reasonable estimate of CI and PI. Even then, a 10% uncertainty

H. Oliver Gao *

Confidence Intervals for CI and PI (example)

Product weight example: 22 subgroups of size 5, calculate the confidence interval (one-sided and two sided) of Cpk=1.14. How do we interpret them?

H. Oliver Gao *

Connection with Tolerance Intervals

Process validation: a tolerance interval can be constructed to contain 100(1-p)% of the population with 100(1-a)% confidence, based on n observations over a relatively short time.

k depends on a, p, and n

Validation passes if tolerance interval is within specification limits.

Example: It is a common practice in some industries to construct a 95/95 tolerance interval, meaning that we are 95% sure that 95% of the population is within the constructed tolerance interval.

Process is validated if this tolerance interval is within spec. limits

In the context of process validation, there is connection btn the tolerance interval and th lower bound on the Cpk or the lower bound on the Ppk

H. Oliver Gao *

Connection with Tolerance Intervals (continued)

Two-sided specification:

two-sided tolerance interval is inside the specification interval. The limiting case: when one tolerance limit exactly matches the corresponding specification limit or when the tolerance interval exactly matches the specification interval. In both cases, we are 100(1-α)% sure that no more than 100p% of the product is outside the specification. This means that the distance between the estimated process mean and the nearest specification limit must be at least σZp/2. Thus, we are 100(1-α)% sure that

One-sided specification:

H. Oliver Gao *

Validation Acceptance and Minimum Cpk

Process validation: The process is validated with 100(1-a)% confidence provided that 100(1-p)% of the pop is enclosed inside the specification interval:

For process validation, calculating min Cpk is better than using tolerance intervals:

Min Cpk allows for an assessment of the goodness of the process on a continuous scale
Max fraction defective can be predicted
Allows for examination of process stability and centering

H. Oliver Gao *

Six Sigma Goal

A stable process: Cp=Pp; Cpk=Ppk.

What should be the targets for the Cp and Cpk indices?

What does a Cpk of one mean? Is this acceptable?

The consequence of a characteristic being outside spec. (e.g., for safety characteristic, the risk of 0.3% is not acceptable; for a minor degradation characteristic, Cpk=1 is reasonable)
The # of key product characteristics that control the total performance of the product. (e.g., for 10 and 100 independent characteristics, each with Cpk of one, the probabilities that the system will perform well are 97.3% and 76.3%, not acceptable)
The closeness of the estimated Cpk to the population Cpk (confidence limits)
Deviation from the continuous stability assumption (the process needs to be designed to a Cpk greater than one in order to achieve a Cpk of one in practice)

Better than 99.73% of the individual characteristic values are within specification or no more than three individual values out of 1000 are expected to be outside specifications.

H. Oliver Gao *

Six Sigma Goal (cont.)

A six sigma goal: design Cp>2 and manufacturing Cpk>1.5

H. Oliver Gao *

Planning for improvement (if and how)

What is the current performance?
What should be the process capability targets?
What improvement actions are necessary?

The Cp, Cpk, Pp, and Ppk indices permit an assessment of the process stability, centering, and capability.

H. Oliver Gao *

Six categories of processes

Based on CI and PI (assume capability targets to be six-sigma, i.e., Cp>2; Ppk>1.5

Stable and capable

Stable and potentially capable

Stable and incapable

Unstable but capable

Unstable but potentially capable

Unstable and incapable

Pr6

short

SpecificationwidthUSLLSL

ocesswidth

/,/,

()/(1)

shortw

totaltotali

Rdscors

sxxnkalldata

éù

»=--

êú

ëû

shorttotal

mss

total

USLLSL

total

USLMean

MeanLSL

orP

/1.128,(1)

()/(1),(1)

short

totaltotali

mRnDOF

sxxnalldatanDOF

éù

»=---

ëû

xks

098

074

253

total

short

)

098

(

250

253

)

074

(

250

253

confidence

95%

For

estimate

(nk

and

estimate

k(n

and

size

subgroups

for

e.g.,

ns.

observatio

number

total

the

and

for

)

and

(for

estimate

freedom

degrees

where

index

For the

index

For the

index

For the

index

For the

total

short

total

short

0.99.

greater th

index

performanc

true

that the

sure

95%

are

)

)(

110

(

)

(

)

)(

(

1.14

for

bound

lower

95%

The

1.32.

and

0.96

between

index

performanc

true

that the

sure

95%

are

)

)(

110

(

)

(

)

)(

(

1.14

for

95%

sided

two

The

index

For the

)

CI,

95%

(e.g.,

with Z

replace

interval

confidence

sided

one

For

confidence

95%

For

estimate

(nk

and

estimate

k(n

and

size

subgroups

for

e.g.,

ns.

observatio

number

total

the

and

for

)

and

(for

estimate

freedom

degrees

where

index

For the

index

For the

index

For the

index

For the

total

short

total

short

0811

07627613

[

]

6629

9688

6629

min

8159

)

081716

(

determine

can

estimate,

deviation

standard

this

Using

0817

)

7375

(

)

(

deviation

standard

term

long

The

8136

0819

326

1906

deviation

standard

short term

0.1906

0.7375

size

sample

Subgroup

yields

chart

control

process

The

sample

LSL

USL

LSL

USL

LSL

USL

LSL

USL

SYSEN5300 Lec 14 Statistical Process Control--control Charts.ppt

SYSEN 5300 (5310, 5320) - Systems Engineering and Six-Sigma for Systems Reliability and Quality

Introduction System Reliability (FMEA, Fault Tree) Six-sigma & Stat. Control Six-sigma & Systems Improvement (DOE) Six-sigma & Systems Improvement (RSM)

Lecture 14 Six Sigma and Statistical Process Control: Control Charts

H. Oliver Gao *

Motivation for Control Charts

Data are often collected over time
General descriptive data summaries (e.g., mean, Std., histogram) don’t preserve the time dimension such as a time trend
Control chart: one way to plot data over time

H. Oliver Gao *

Outline

Role of control charts
Basic principles behind determining control limits
Formulae to design the most commonly used variable and attribute control charts
Out-of-control rules to detect special causes
Key success factors for implementing effective charts

H. Oliver Gao *

Role of Control Charts

No two products are exactly alike  variability in the process
Normal causes stable and predictable variation
Special causes unstable and unpredictable variation
To improve product uniformity: reduce the special and common causes of variation or reduce their effects
Redesign product and process
Ensure the operation of the process

Confusion between common and special causes of variation is expensive and leads to counterproductive corrective actions.

H. Oliver Gao *

Control Charts (Shewhart)

A graphical method to distinguish between common and special causes of variation
Old way of quality control: quality defined as meeting specifications, inspection after produced; no effort of improving toward meeting the ideal product targets.
A better way: prevention strategy based on an understanding of the process, the causes of variability, and the nature of actions necessary to reduce variation

H. Oliver Gao *

Process and process quality

All product and services are a result of some process.

Process quality   the degree to which product and service performance characteristics are consistently on target.

H. Oliver Gao *

Cause of variation in a process

People (operators, training, and experience)
Machines (machine-to-machine difference, wear, maintenance)
Methods (temperature control)
Materials (lot-to-lot and within-lot differences)
Environment (ambient temp., humidity)
Time to place an order; the measurement systems

Common and Special causes

H. Oliver Gao *

Common causes of variation

A part of the normal operation of the process and are constantly present
Short-term variability, large in number and small in effect

Process in statistical control, output predictable within limits.
A stable process: constant mean, std, and distribution over time

H. Oliver Gao *

Special causes of variation

Not always present or not always present to the same degree
Large change in the output, long-term (because of a new cause, or larger than usual change in a key common cause)

Process out of control or unstable: unpredictable change in the mean, variance, or shape of the distribution of output attributes.  prediction impossible

H. Oliver Gao *

Improvement Actions

Two ways:

reduce common causes or their effects  change the process itself

Reduce special causes or their effects  identification and removal of special causes so that the process is executed as designed.

Two mistakes: (confusion between common and special causes is costly)

Ascribe variation to a special cause when it is the result of a common cause overadjustment (reacting to noise)
Ascribe variation to a common cause when it is the result of a special cause  ignoring a signal

H. Oliver Gao *

Examples of mistakes

A stable process with no adjustment

A stable process with adjustment

An unstable process with no adjustment

It is therefore necessary to be able to distinguish between the two types of variation.

A control chart is a graphical method used to distinguish between common cause variation and special cause variation.

The role of a control chart is to help us identify the presence and nature of special causes.

H. Oliver Gao *

Logic of control limit

Control Limit distinguishes a control chart from a simple plot of data over time
Control Limit: a bound between common and special cause variation
Logic of control limit (example)

H. Oliver Gao *

Process Mean Control Chart

Is the process mean constant over time?

Center line depends on the purpose of control chart
Whether the process is stable?  grand mean of all data
Whether the mean is on target?  targeted value

H. Oliver Gao *

Control Limit

Symmetry between UCL and LCL

3 sigma distance from center line (sample mean will has less than 0.3% chance of falling outside of 3 sigma control limit)

3 Sigma of sample mean=

Computing sigma (only reflect the common cause variability of the process): within subgroup variance

Control Limit=

Discussion: mean control limit vs. specification limit (for individual values)

H. Oliver Gao *

Variable control charts

Apply when the characteristic of interest is measured on a continuous scale.
The average and range
The average and standard deviation
The individual and moving range

Two types of control charts: variable control charts and attribute control charts.

H. Oliver Gao *

Average and range chart

Product weight example (weight control)

Two types of control charts: variable control charts and attribute control charts.

H. Oliver Gao *

Average and standard deviation chart

Std. is a better estimate of within-group variability than range. Mean and std. chart is preferred over mean and range.

Two types of control charts: variable control charts and attribute control charts.

H. Oliver Gao *

Comparison

H. Oliver Gao *

Individual and moving range chart

Only one measurement per subgroup (within-subgroup variance, i.e., short-term variation, is presented by the difference between successive values (mR))

Two types of control charts: variable control charts and attribute control charts.

H. Oliver Gao *

Individual and moving range chart

H. Oliver Gao *

Design Variable Control Chart

summary

Comparison for Product Weight Data

H. Oliver Gao *

Attribute (count data) control chart

Characteristic is measured on a discrete scale (e.g., # occurrences, # defects): loss in discrimination, increase in subgroup size, reduction in ability for continuous improvement

Selecting Attribute Control Chart

H. Oliver Gao *

Fraction defective (p) chart

# of defectives has a binomial distribution (i.e., 1. each of the n items being tested is being classified into only two categories: defective and not defective; 2. the probability p of a defective item is constant for every item)
If X represent the number of defectives in n items, then the probability of finding x defective in n items is

H. Oliver Gao *

Inventory accuracy example

How do we design a p chart for this?

In order to assess inventory accuracy, a simple data collection scheme: check 100 items each week and record the number of misplaced items.

H. Oliver Gao *

Procedure to design a p chart

Obtain # misplaced (xi) and number checked (ni)

Calculate the fraction misplaced, pi=xi/ni

Calculate centerline:

Under binomial assumptions, the standard deviation of pi is

3-sigma control limits for p chart=

H. Oliver Gao *

Control limits for attribute charts

p chart for inventory data

H. Oliver Gao *

Defect per product (u) chart

Assumption: # of defects per product follows Poisson distribution

E.g., # of accidents per month at a manufacturing facility

H. Oliver Gao *

Procedure to design a u chart

Obtain # defects (accidents) xi and corresponding sample size ni

Calculate ui=xi/ni. For this example, ui=xi, ni=1

Calculate the centerline,

For Poisson dist., the std. is

H. Oliver Gao *

Interpreting control charts

Control charts provide more information regarding process instability: identify and provide information about the nature of special causes
Pattern of points  test for a special cause

H. Oliver Gao *

Tests for the chart of averages

Sporadic shift or beginning of sustained shift

Early warning of a shift

Early warning of sustained shift

Upward or downward trend

Subgroups from different distri.

Sustained shift

Gauge is faulty, variability decreased, wrong control limits

A systematic factor: alternate machines, suppliers, or operators

H. Oliver Gao *

Key factors for successful control charts

Key characteristics to control (e.g., clarity of TV picture, time for 0-60 acceleration)
Rational subgroup (e.g., sources of variation only include common cause variability)
Proper control chart (good understanding of the process), control limits, subgroup size, and sampling interval
Control chart redesign (review and update)
Corrective actions (in case of rule violation)

...

)

(

for

UCL

LCL

Range

for

Limits

Control

one

Simpler

mean

for

Limits

Control

short

;

chart

for

limits

Control

)

(

and

)

(

lim

(

long

how

period

sampling

many

how

size

subgroup

from

far

how

its

control

lower

and

upper

where

line

center

chart

individual

std

mean

grand

short

)

(

and

)

(

and

iability

process

monitor

chart

the

mean

process

monitor

chart

NID

stability

Under

var

)

(

UCL

LCL

Std

for

Limits

Control

mean

for

Limits

Control

;

Limits

Control

Chart

short

128

)

(

)

(

)

(

)

(

)

(

checked

total

misplaced

total

size

sample

Total

accidents

defects

observed

Total

)

(

Factors	Levels	Coding
l: length of specimen	250, 300, 350 mm	x1=(l-300)/50
A: amplitude of loading cycle	8, 9, 10 mm	x2=A-9
L: load	40, 45, 50 g	x3=(L-45)/5