buisness data analysis
Significance Levels. 0.05, 0.01, or?
Author(s): Lester V. Manderscheid
Source: Journal of Farm Economics , Dec., 1965, Vol. 47, No. 5, Proceedings Number (Dec., 1965), pp. 1381-1385
Published by: Oxford University Press on behalf of the Agricultural & Applied Economics Association
Stable URL: https://www.jstor.org/stable/1236396
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact support@jstor.org. Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at https://about.jstor.org/terms
and Oxford University Press are collaborating with JSTOR to digitize, preserve and extend access to Journal of Farm Economics
This content downloaded from �������������80.233.34.48 on Sun, 03 Jul 2022 13:29:31 UTC��������������
All use subject to https://about.jstor.org/terms
CONTRIBUTED PAPERS: MARKETING, PRICES, AND CONSUMPTION
CHAIRMAN: MARGUERITE BuRK, UNIVERSITY OF MINNESOTA
Significance Levels-0.05, 0.01, or ? LESTER V. MANDERSCHEID
M OST statistically oriented research published in the JOURNAL OF FARM ECONOMICS includes tests of statistical hypotheses. In most
cases a significance level of either 5 or 1 percent is cited. But a few use 10 or even 20 percent. Why the difference? Is a 1-percent level "better" than a 5-percent level?
I will argue that choice of statistical-significance levels is not arbitrary but rather is, or at least should be, a deliberate choice. The basic purpose of this paper is to integrate decision theory-management if you prefer- with statistical hypothesis testing. The discussion will be restricted to rel- atively simple cases so as to minimize mathematical confusion. Once the concepts are clarified, the mathematically sophisticated reader may pursue the more realistic cases. Let me begin with a review of some ele- mentary ideas to insure that we are all thinking in the same terms.
The basic problem in hypothesis testing involves choosing which of two hypotheses to use as a basis for action. Hypotheses may be simple or very complex. In a simple case, we might hypothesize that two populations have equal means (H) or alternatively that the mean of one population is two units larger than the mean of the second (HA). More formally:
H: : Al == A
HA: 1I1- / 12 = 2.
A t test can be applied to a set of data to determine whether we accept H, or reject H and accept HA. What significance level should we use? Is a 1-percent significance level better than a 5-percent significance level?
Why would we consider the 1-percent significance level better? Be- cause the probability of rejecting H when it is true is reduced to 1 per- cent. And this is obviously better than using a test which permits a 5-per- cent probability of rejecting H when it is true. Or is it? The probability of accepting H when it is false must also be considered.
Thus, we have two types of errors: Type I: Rejecting H when it is true. Type II: Accepting H when it is false.
LESTER V. MANDERSCHEID is associate professor of agricultural economics, Michi- gan State University.
1381
This content downloaded from �������������80.233.34.48 on Sun, 03 Jul 2022 13:29:31 UTC��������������
All use subject to https://about.jstor.org/terms
1382 LESTER V. MANDERSCim
In a statistician's Utopia one can simultaneously minimize the proba- bility of Type I error (a) and the probability of Type II error (P). A statistician in Utopia would obviously set a = 0= . But as a is de- creased (the significance level moved from 5 percent to 1 percent) P is increased. The exact relationship between a and P depends on the underlying probability distributions for the test statistic and on the hypothesis and alternative hypothesis. This relationship is illustrated in most introductory statistics textbooks. For any particular test, we know the test statistic and the hypotheses.
From this information we can calculate the P associated wtih any par- ticular a.: The statistical tests recommended in standard textbooks or
reference books are suggested because P is minimized for given a by these tests. For example, the t test is recommended for testing H:?, = ~2 against HA:,1 - '2 = 2 under rather general conditions be- cause for any value of a (any significance level) the probability of a Type II error, P, is as small as possible. In some cases power functions or operating-characteristic curves are exhibited to illustrate this fact. Unfortunately, very few standard statistics books go much further in helping us select a significance level. Two quotations from the more helpful books will suffice to make the point:
The choice of a level of significance a will usually be somewhat arbi- trary since in most situations there is no precise limit to the probability of an error of the first kind that can be tolerated. It has become cus- tomary to choose for a one of a number of standard values such as .005, .01 or .05. There is some convenience in such standardization since it permits a reduction in certain tables needed for carrying out various tests. Otherwise there appears to be no particular reason for selecting these values. In fact, when choosing a level of significance one should also consider the power that the test will achieve against various alternatives.2
In practice, the final choice of the value for the critical probability represents some compromise between these two risks. It must be arrived at by balancing the consequences of a Type I error against the possible consequences of a Type II error.3
Both quotations emphasize balancing the two types of error. Can we formalize this balancing by use of economic- and/or decision-theory criteria? One might consider minimizing a weighted average of the
1This statement is literally true for simple hypotheses. For complex hypotheses, we can calculate the P3 for given a for various alternative values of the parameters.
'E. L. Lehmann, Testing Statistical Hypotheses, New York, John Wiley & Sons, Inc., 1959, p. 61.
' W. A. Spurr, L. S. Kellogg, and J. H. Smith, Business and Economic Statistics, Homewood, Ill., Richard D. Irwin, 1961, p. 253.
This content downloaded from �������������80.233.34.48 on Sun, 03 Jul 2022 13:29:31 UTC��������������
All use subject to https://about.jstor.org/terms
SIGNIFICANCE LEVELS-0.05, 0.01, OR ? 1383
costs, using a and P as weights. Defining the cost of a Type I error as C, and the cost of a Type II error as Cu, this might be stated as:
Minimize L' = aCI + #CII.
We thus consider L' as a 'loss function" and minimize it by choosing appropriate a and 3. More properly L' should be labeled as expected- loss function but simplicity suggests the term 'loss function."
Unfortunately, the loss function involves mathematical difficulties: a is calculated on the basis that H is true while P is calculated on the basis that HA is true. We are thus adding together "unlike" items. But there is also another consideration. Hodges and Lehmann phrase it thus:
. . . the reasonable compromise in choosing the critical value will depend on the consequences of the two errors. However, it also depends on the circumstances of the problem in another way. If the null hypothesis is very firmly believed, on the basis of much past experience or of a well- verified theory, one would not lightly reject it and hence would tend to use a very small a. On the other hand, a larger a would be appropriate for testing a null hypothesis about which one is highly doubtful prior to the experiment.4
Fortunately this suggestion provides a solution to some of the mathe- matical difficulties-at least to the person willing to accept some of the "Bayesian" approach to statistics. Define as follows:
PI: Prior probability that H is true,
PII: Prior probability that HA is true.
These prior probabilities reflect the investigator's beliefs prior to looking at the data. If we accept the idea that prior probabilities exist, they provide a link for putting a and P probabilities on a common basis.
The resulting loss function is as follows:
L = PIaCI + PIfCII. Choosing a and P so as to minimize L, given the values of PI, P1I, CI, and CII, leads to an "optimum" or "best" significance level for the person whose decision rule is to minimize expected loss.
Note that this discussion assumes a fixed sample size. Permitting sample size to vary allows calculation of the sample size needed to achieve given levels of the loss function rather than minimizing it for given sample size.
A similar analysis can be pursued by the person who prefers a mini-max or some other decision rule. The significance level will depend on the de- cision rule but the conceptual arguments are the same.
" J. L. Hodges, Jr., and E. L. Lehmann, Basic Concepts of Probability and Sta- tistics, San Francisco, Holden-Day, Inc., 1964, p. 326.
This content downloaded from �������������80.233.34.48 on Sun, 03 Jul 2022 13:29:31 UTC��������������
All use subject to https://about.jstor.org/terms
1384 LESTEn V. MANEERSCHEID
What are the implications of the loss-function approach for the simple case suggested on page 1, where
H: A -A /2
and
HA:: 1- /2 = 2 ?
Suppose that these refer to yields for two varieties of wheat. Suppose all data other than average yield indicate no difference in the two varieties. Then a farmer might well say Pr = PI = 3 and also say that Cr = 0, since if the yields are equal he loses no profit by choosing either variety. However, C11 is 2 bushels per acre and this can be translated into a dollar amount by using price and acreage. Obviously, the farmer wants to minimize P and doesn't care about a-a significant level of 5 percent is obviously wrong! In fact, he should always choose HA and plant variety 1.
Suppose that the decision maker is the head of a seed company and that development costs for variety 1, a new variety, would be high. The manager then needs some idea of how much we will lose if he develops variety 1 and it is no better (CI) compared to the loss if he fails to de- velop it and variety 1 is better (CT). Further, he will want to consult geneticists and other agronomists to evaluate the prior probabilities rather than assuming PI = PII = 3. In spite of the extra complications, the manager may still find the loss function a useful device for selecting an appropriate significance level.
Others involved-the plant breeder, a rival seed company, etc.-might arrive at still different a levels either because they begin with different prior probabilities or because their estimated costs are different. But this should not worry us. Don't we argue that decision makers need to evaluate their environment, talents, etc. to arrive at a "best" decision?
Some will argue that this approach is interesting in theory but im- possible in practice because we cannot estimate Cr and CII. But if we cannot estimate the costs of an error, should we be testing? A basic purpose of testing is to choose between two acts. If we are choosing acts, we should be able to specify the costs which can be measured in either monetary or nonmonetary units.
There remains, however, real difficulty in actually calculating the minimum for the loss function in most cases of practical importance. For example, if we test
H: U1 - 2, HA: /11 /2,
This content downloaded from �������������80.233.34.48 on Sun, 03 Jul 2022 13:29:31 UTC��������������
All use subject to https://about.jstor.org/terms
SIGNIFICANCE LEVELS-0.05, 0.01, OR ? 1385
then the value of P depends on the difference between p, and pt2; C I also depends on this difference.5 Thus, L is a complicated mathematical function. Approximate results can be obtained by using "representative values" for I, - p2. The basic conceptual framework is still a valid reasoning device whether one actually carries out the minimization or only approximates it.
Birnbaum6 has argued that researchers test too many hypotheses and fail to specify the likelihood of various parameter values often enough. Short of publishing the likelihood function, one could publish the maxi- mum value of a that would permit rejection of a relevant hypothesis (or maximum a for several hypotheses). One could go further and publish the P associated with several possible values of a. This would permit the de- cision maker to test, or approximate a test, using his optimum values for a and (3.
Summary
Choosing a significance level is not an arbitrary choice between a 5- percent and a 1-percent level. Rather, a conscious choice can be made-a choice grounded in the principles of management and statistical theory. One must consider
(1) the costs associated with each type of error, (2) the prior probabilities of the hypothesis and the alternative, and (3) the size of the Type II error associated with each significance level.
Incorporating these facts into a decision model yields a "best" significance level. This approach clarifies the relation between testing hypotheses and following actions and helps explain why several decision makers faced with exactly the same observations may reach different decisions.
5P1 must be interpreted carefully, since the probability of exact equality is undoubtedly near 0. We usually have in mind equality up to some small difference.
6 A. Birnbaum, "On the Foundations of Statistical Inference," J. Am. Stat. Assn. 57:269ff, June 1962.
This content downloaded from �������������80.233.34.48 on Sun, 03 Jul 2022 13:29:31 UTC��������������
All use subject to https://about.jstor.org/terms
- Contents
- 1381
- 1382
- 1383
- 1384
- 1385
- Issue Table of Contents
- Journal of Farm Economics, Vol. 47, No. 5, Proceedings Number (Dec., 1965), pp. 1079-1647
- Volume Information [pp. 1635-1647]
- American Farm Economic Association Fellows
- Presidential Address
- Agricultural Economics and Technical Aid in Foreign Development [pp. 1079-1090]
- Agricultural Policy
- Present and Prospective Policy Problems of U.S. Agriculture
- Present and Prospective Policy Problems of U.S. Agriculture: As Viewed by a Political Scientist [pp. 1091-1115]
- Present and Prospective Policy Problems of U.S. Agriculture: As Viewed by an Economist [pp. 1116-1129]
- Comparative Resource, Productivity, and Income Effects of Canadian and United States Farm Policies
- Resources and Productivity [pp. 1130-1139]
- The Costs of and Returns to Farm Policy: A United States-Canadian Comparison [pp. 1140-1151]
- Discussion: Comparing Effects of U.S. and Canadian Farm Policies [pp. 1152-1159]
- The Southern Cotton Economy: Perspectives and Prospects
- The Effects of Government Programs on Southern Cotton Production since 1945 [pp. 1160-1171]
- Discussion: The Effects of Government Programs on Southern Cotton Production since 1945 [pp. 1172-1174]
- Prospects for Southern Cotton Production and Its Place in the Regional Economy, 1965-1975 [pp. 1175-1185]
- Discussion: Cotton and the Southern Economy, 1965-1975 [pp. 1186-1189]
- Contributed Papers: Policy and Resource Development
- Farmers' Perception of the Farm Problem [pp. 1190-1196]
- Leakages Reducing Employment and Income Multipliers in Labor-Surplus Rural Areas [pp. 1197-1202]
- Implications of Alternative Cotton Acreage Allotment Levels [pp. 1203-1207]
- Rates of Growth in Food Requirements during Economic Development [pp. 1208-1212]
- Income Distribution and Resource Allocation
- Income Distribution and Welfare in Agriculture
- Changes in the Income Distribution in Agriculture [pp. 1213-1224]
- The Impact of Government Price and Income Programs on Income Distribution in Agriculture [pp. 1225-1234]
- The Impact of Outmigration and Multiple Jobholding upon Income Distribution in Agriculture [pp. 1235-1244]
- Political Pressures and Income Distribution in Agriculture [pp. 1245-1251]
- Factor Price and Farm Income Inter-Relationships with Implications for Resource Allocation and Income Distribution
- The Benefits of Farm Programs: Incidence, Shifting, and Dissipation [pp. 1252-1263]
- Discussion: The Benefits of Farm Programs: Incidence, Shifting, and Dissipation [pp. 1263-1264]
- Land Values and Agricultural Income: A Paradox? [pp. 1265-1273]
- Discussion: Land Values and Agricultural Income: A Paradox? [pp. 1273-1277]
- Marketing
- Commodity Price Analysis
- An Analysis of Bartlett Pear Prices [pp. 1278-1287]
- A Dynamic Long-Run Model of the Livestock-Feed Sector [pp. 1288-1305]
- On the Distribution of Price Changes at a Bull Auction [pp. 1306-1320]
- Discussion: Commodity Price Analysis [pp. 1320-1322]
- Competition and Concentration in Food Retailing
- Competition in Food Retailing [pp. 1323-1331]
- Concentration in Food Retailing [pp. 1332-1346]
- Discussion: Competition and Concentration in Food Retailing [pp. 1347-1351]
- Marketing Costs and Margins
- Marketing Costs and Margins: Current Use in Agribusiness Market-Structure Analysis [pp. 1352-1365]
- Marketing Costs and Margins: New Perspectives in a Changing Economy [pp. 1366-1376]
- Discussion: Marketing Costs and Margins [pp. 1376-1380]
- Contributed Papers: Marketing, Prices, and Consumption
- Significance Levels. 0.05, 0.01, or? [pp. 1381-1385]
- Negotiated Pricing of Fluid Milk: Role of Federal Orders and Influence on Pricing Policies [pp. 1386-1390]
- Convenience Foods: The Relationship between Sales Volume and Factors Influencing Demand [pp. 1391-1394]
- An Analysis of Adjustments Made by Local Livestock Marketing Agencies to Packers-and-Stockyards Regulation Prohibiting Service Charges [pp. 1395-1399]
- Farm Management
- Operations Research Techniques in Farm Management
- Operations Research Techniques in Farm Management: Survey and Appraisal [pp. 1400-1414]
- Discussion: Operations Research Techniques in Farm Management: Survey and Appraisal [pp. 1415-1417]
- Operations Research Techniques in Farm Management: Potential Contribution [pp. 1418-1426]
- Discussion: Operations Research Techniques in Farm Management: Potential Contribution [pp. 1427-1432]
- The Management Factor in Commercial Agriculture
- The Management Factor in Commercial Agriculture: Is It an Art or a Science? [pp. 1433-1436]
- The Management Factor in Commercial Agriculture: How Can It Be Recognized? [pp. 1437-1439]
- The Management Factor in Commercial Agriculture: The Influence of Values and Goals [pp. 1440-1442]
- The Management Factor in Commercial Agriculture: The Effect of Off-Farm Management Inputs [pp. 1443-1445]
- The Management Factor in Commercial Agriculture: How Can It Be Improved? [pp. 1446-1448]
- The Management Factor in Commercial Agriculture: How Can It Be Taught? [pp. 1449-1451]
- The Management Factor in Commercial Agriculture: New Tools for the Manager [pp. 1452-1456]
- Contributed Papers: Farm Management and Production Economics
- Labor Productivity in Beef Slaughter Plants [pp. 1457-1461]
- A Methodology for Estimating Production Parameters [pp. 1462-1467]
- Factors Related to Farmers' Use of Credit: Least-Squares Analysis of Sample Survey Data [pp. 1468-1473]
- Economic Effects of Taking Parts of Farms for Flood-Control Reservoirs [pp. 1474-1478]
- Agricultural Economics Education
- Training in Mathematics and Statistics for Agricultural Economists
- Desirable Requirements for Graduate Students Specializing in Quantitative Techniques [pp. 1479-1485]
- Desirable Requirements for Graduate Students Not Specializing in Quantitative Techniques [pp. 1486-1496]
- Have We Gone Too Far? [pp. 1497-1503]
- Contributed Papers: Curricula, Collegiate and Adult Education
- Teaching Problems in "Growing" Departments of Agricultural Economics Abroad. A Neglected Research Area [pp. 1504-1509]
- A Look at the Future of Agricultural Economics [pp. 1510-1514]
- Extension Education with Marketing Firms: An Analysis of the Future [pp. 1515-1519]
- Nonparametric Statistics in Marketing Research [pp. 1520-1523]
- Research Methods
- Sampling Frames for Collection of Agricultural Statistics
- A New Area Sampling Frame and Its Uses [pp. 1524-1533]
- Joint Use of Different Sampling Frames [pp. 1534-1539]
- Discussion: Sampling Frames for Collection of Agricultural Statistics [pp. 1539-1541]
- Discussion: Sampling Frames for Collection of Agricultural Statistics [pp. 1542-1544]
- Economic Classification of Farms
- The Contribution of the Economic Classification of Farms to the Understanding of American Agriculture [pp. 1545-1554]
- Problems and Limitations Due to Criteria Used for Economic Classification of Farms [pp. 1555-1564]
- Problems Relating to Criteria for Classification of Farms [pp. 1565-1571]
- Input-Output Analysis in Water Resource Economics
- Input-Output Models and Benefit-Cost Analysis in Water Resources Research [pp. 1572-1579]
- Discussion: Input-Output Models and Benefit-Cost Analysis in Water Resources Research [pp. 1580-1582]
- The Input-Output Model and Regional Water Management [pp. 1583-1591]
- Discussion: The Input-Output Model and Regional Water Management [pp. 1591-1593]
- First Award Paper in the Undergraduate Student Essay Contest
- Impact of Beef Imports on the U.S. Beef Industry [pp. 1594-1600]
- American Farm Economic Association: Reports and Minutes 1964-65 [pp. 1601-1634]