Week 6
Reliability and Validity
In: Reliability and Validity in Qualitative Research
By: Jerome Kirk & Marc L. Miller
Pub. Date: 2011
Access Date: July 15, 2019
Publishing Company: SAGE Publications, Inc.
City: Newbury Park
Print ISBN: 9780803924703
Online ISBN: 9781412985659
DOI: https://dx.doi.org/10.4135/9781412985659
Print pages: 14-21
© 1986 SAGE Publications, Inc. All Rights Reserved.
This PDF has been generated from SAGE Research Methods. Please note that the pagination of the
online version will vary from the pagination of the print book.
Reliability and Validity
Despite the prestige and success of natural science in recent years, application of science as a model for
social “science” is not inevitable. Many have argued that social science has an intrinsically different set of
goals that call for an altogether separate collection of methods. Others (nonscientists, it should be noted)
contend that recent developments in the natural sciences entirely discredit the fundamental notions (such as
objectivity) of an earlier and outdated science.
Yet, whatever their detailed goals, the natural and social sciences share an aspiration to cumulative collective
knowledge that is of interest on its own merits to those other than the friends and admirers of its creators.
This goal is exactly objectivity. In the natural sciences, objectivity is obtained in two ways. First, experience is
reported in such a way that it is accessible to others, for example, when reporting an experiment every effort
is made to describe the way the experiment was carried out, just in case somebody else would like to try the
same thing. Second, the results of the experiment are reported in terms of theoretically meaningful variables,
measured in ways that are themselves justifiable in terms of the relevant theories.
Since Wilhelm Dilthey and George Herbert Mead, the vast majority of social scientists have agreed that
objectivity, in this sense, is an admirable goal. Yet, the description of reliability and validity ordinarily provided
by nonqualitative social scientists rarely seems appropriate or relevant to the way in which qualitative
researchers conduct their work.
It is the purpose of this book to reconcile the means-ends discrepancy. The remainder of this chapter will
pursue the argument that, subject to clearly specifiable differences in goals and practice, social science is in
every sense of the word fully as “scientific” as physics, and has fully as much need for reliability and validity
as any other science.
The “Positivist” View
In recent decades, the social science literature has incorporated a great deal of discussion of an epistemology
called “positivism.” (The term is generally employed by those advocating some alternative view of knowledge,
and often amounts to a straw man.) In its strongest form, positivism denies objectivity as defined here by
assuming not only that there is an external world, but that the external world itself determines absolutely the
one and only correct view that can be taken of it, independent of the process or circumstances of viewing. No
one seriously defends such an ontology, but scholars attentive to the social and cultural construction of social
things (including social science) point out that much research (particularly nonqualitative research) makes
sense only in terms of a set of unexamined positivist assumptions.
Most often, these assumptions pertain to the “naturalness” of the measurement procedure employed. Thus
a survey researcher may interview a large number of people about their political attitudes, and conclude that
SAGE
1986 SAGE Publications, Ltd. All Rights Reserved.
SAGE Research Methods
Page 2 of 7 Reliability and Validity in Qualitative Research
“public opinion” says something.
Such an assertion obviously concerns the investigator's theoretical view of the world as much as it does
the psychic organization of the interviewees. The investigator's theory contains categories not imposed by
the structure of empirical reality. Elements such as “attitudes” and “public opinion” serve rather to organize
understanding of the world. Certainly, political and psychological theories that do not use these constructs (or
even deny their meaningfulness) are possible, and treating analytic devices as though they are facts is the
well-known fallacy of reification.
In response to the propensity of so many nonqualitative research traditions to use such hidden positivist
assumptions, some social scientists have tended to overreact by stressing the possibility of alternative
interpretations of everything to the exclusion of any effort to choose among them. This extreme relativism
ignores the other side of objectivity—that there is an external world at all. It ignores the important distinction
between knowledge and opinion, and results in everyone having a separate insight that cannot be reconciled
with anyone else's.
Metaphysical polemics, often directed against caricatures of the opposing views, largely miss the point. As is
shown in the next chapter, the problem is not so much one of metaphysics as it is a pragmatic question of
the validity of measurements. The survey researcher who discusses attitudes is not wrong to do so. Rather,
the researcher is wrong if he or she fails to acknowledge the theoretical basis on which it is meaningful to
make measurements of such entities and to do so with survey questions addressed to a probability sample of
voters.
For any observation (or measurement) to yield discovery, it must generate data that is (a) not already
known and (b) identifiable as “new” by the theory already in place.2 Most of the technology of “confirmatory”
nonqualitative research in both the social and natural sciences is aimed at preventing discovery. When
confirmatory research goes smoothly, everything comes out precisely as expected. Received theory is
supported by one more example of its usefulness, and requires no change. As in everyday social life,
confirmation is exactly the absence of insight.
In science, as in life, dramatic new discoveries must almost by definition be accidental (“serendipitous”).
Indeed, they occur only in consequence of some kind of mistake.
The Discovery of the New
Henri Becquerel was studying the phenomenon of phosphorescence by exposing metal salts first to the
sun and then to photographic plates. When the sky clouded over for an extended period, he tossed the
uranium salts into a drawer with his photographic materials and knocked off work for a while (Badash, 1965).
“Merde! Je me suis plantér!” he must have muttered when he discovered that the film was ruined, but he was
sufficiently prepared and alert to realize that he had discovered radioactivity.
SAGE
1986 SAGE Publications, Ltd. All Rights Reserved.
SAGE Research Methods
Page 3 of 7 Reliability and Validity in Qualitative Research
More recently, the first men to hear the echo of the origin of the universe thought they were listening to guano.
In 1964, Arno Penzias and Robert Wilson set out to measure the radio waves emitted from different latitudes
in our disk-shaped galaxy. First, they had to identify what portions of the signal they received originated in
the instrument itself. When they received a strong signal in microwave frequencies (where galaxies emit
virtually no radiation), their first move was to devote considerable time and expense to cleaning out the “white
dielectric material” deposited by pigeons in the antenna throat. This produced only a negligible decrease in
signal strength (Penzias and Wilson, 1965). While Penzias and Wilson were in the antenna throat, Dicke et
al. (1965) were proposing the hypothesis that traces of the high temperatures that occurred shortly after the
(or a) “big bang” should still be observable, and predicting that they should sound very much like the signal
heard by Penzias and Wilson. This “cosmic microwave radiation” is now considered the basic evidence for
the truth of the “standard model” of the universe.
The history of the biomedical sciences, too, is full of examples of this particular kind of serendipity. Fleming
(1946) discusses the irritation he felt when some kind of mold got into his staphylococcus culture and ruined
the bacteria. He named the mold penicillin. Miller et al. (1955) inadvertently used a four-year-old bottle of
DNA and discovered the hormonal element that provokes cell division in plants. Paul Ehrlich discovered the
acid-fast method of staining tubercle bacilli only because he accidentally lit the stove on which his culture
was resting; somewhat later, Hans Christian Joachim Gram accidentally grabbed the bottle of Lugol's iodine
instead of the gentian violet, and only some of the bacteria (the “Gram-negative” ones) yielded up their purple
color when he washed them off (Beveridge, 1950). And so on.
These historical examples illustrate how one feature of the hypothetico-deductive model of scientific progress
is misleading: Hypothesis testing is not the only research activity in any scientific discipline. Indeed, the most
dramatic discoveries necessarily come about some other way, because in order to test a hypothesis, the
investigator must already know what it is he or she is going to discover.
The majority of nonqualitative methods in the social sciences are designed primarily for the logical testing of
hypotheses.3 Testing hypotheses is a useful, often essential element of research. It is also a useful model for
the training of researchers, for it accustoms the novice to subject his or her predictions to the risk of empirical
refutation.
As social scientists have come to recognize in recent decades, however, hypothesis testing is appropriate to
only a small proportion of the questions they ask. Qualitative research has always retained the proper ideals
of hypothesis-testing research—sound reasoning and the empirical risking of theory. But, in being intrinsically
exploratory, it explicitly departs from certain strictures of the hypothetico-deductive model.
Formal logic, for instance, is not the only kind of sound reasoning. In fact, formal logic possesses certain
flaws, such as its perverse insistence on the analytic “truth” of such statements as “everybody over twelve
feet tall is named Fred,” and “if Durkheim lives, then he is a rock star.” (Formal logic is merely an arbitrary
set of conventions. One of these conventions is that any false statement implies every other statement.)
The prior explicit statement of hypotheses and null hypotheses is not the only way to subject predictions
SAGE
1986 SAGE Publications, Ltd. All Rights Reserved.
SAGE Research Methods
Page 4 of 7 Reliability and Validity in Qualitative Research
to empirical test. Each time Chauncey greets his old friend Ricky, he does expose himself to the unlikely
possibility that he has mistaken a perfect stranger for Ricky. Much social research deliberately seeks out such
“embarrassing” interaction; Agar (1982) has applied the hermeneutic term “breakdown” to these informative
gaffes. The general commitment of qualitative researchers to interacting with their objects of study on the
latter's home ground strongly encourages the discovery that what the researcher takes for granted at his
home does not apply in the new situation. The anthropologist who returns alive from some exotic place must
know something nontrivial about it.
Relaxing certain of the narrow definitions of the hypothetico-deductive model, then, facilitates discovery of the
new and unexpected. It would be an error, however, to drop the scientific concern for objectivity. The scientific
credo is one good way to permit the resolution of a conflict of opinion. It is not the only way; the scholastic
solution, still prevalent in many disciplines called “humanities,” relies on argument and rhetoric rather than
on argument and demonstration. Another alternative is argumentum ad imperium—“might makes right.” One
attractive feature of the scientific solution is that it is an extension of the ordinary processes of inference that
people use in everyday life (Piaget, 1954). As Wilhelm Dilthey pointed out, it is impossible to account for the
observed reality of human interaction without acknowledging that human beings have an innate capacity to
understand one another. Thus striving forever-greater objectivity is as much a part of people's everyday social
inference as it is of their everyday physical inference.
Components of Objectivity
The analogy between qualitative research and other scientific methods and traditions has its limitations. Yet
the ability of practitioners of certain kinds of scientific endeavor to talk about what it is they do is much
more advanced than that of qualitative researchers (Van Maanen, 1979). Indeed, a primary purpose of this
monograph is to remedy that situation. It is often useful to examine methodological formulations from other
traditions to assess their adaptability to qualitative research.
One appropriate and useful device first used in psychometrics (the field of tests and measurements) is the
partitioning of objectivity into two components: reliability and validity. Loosely speaking, “reliability” is the
extent to which a measurement procedure yields the same answer however and whenever it is carried out;
“validity” is the extent to which it gives the correct answer.4 These concepts apply equally well to qualitative
observations.5
A standard physical example of reliability and validity involves the use of thermometers to measure
temperature. A thermometer that shows the same reading of 82 degrees each time it is plunged into
boiling water gives a reliable measurement. A second thermometer might give readings over a series
of measurements that vary from around 100 degrees. The second thermometer would be unreliable but
relatively valid, whereas the first would be invalid but perfectly reliable.
The standard example of the thermometer is neither very qualitative nor very familiar to social scientists. A
SAGE
1986 SAGE Publications, Ltd. All Rights Reserved.
SAGE Research Methods
Page 5 of 7 Reliability and Validity in Qualitative Research
rather homier (if artificial) example occurs when Chauncey sees a blond man across the room at a large
cocktail party, and has the uncertain feeling that he knows him from somewhere. He looks again, sees the
same thing, and continues to have the feelings of uncertainty. Chauncey has perfectly reliable data, and it is
of no use. Is his feeling valid? (As in ordinary language, the technical use of the term “valid” is as a properly
hedged weak synonym for “true.”)
Chauncey might ask himself whether it seems he would know a person who looks like that, moves like that,
dresses like that, and so on. Is the blond, in other words, apparently the sort of person Chauncey would
know? Or he might ask himself subtler questions, such as whether people who look like that frequent the
places he does. At a cocktail party, such a search for validity will probably fail because the guest list is
deliberately socially homogeneous, and any two members are likely to have been in the same other places.
So Chauncey must resort to empirical research if he is to discover whether his feeling is useful.
Perhaps Chauncey's least costly pilot project would be to ask the host what the blond man's name is, or
whether in fact the host has relevant information (e.g., that the blond has just arrived in the country for the
first time from a place Chauncey has never been, or that the three of them had a conversation last week).
Another strategy would be to make ambiguous eye contact with the blond, in such a way as to assign to the
other responsibility for acknowledging the acquaintance. Ultimately, it may prove necessary to confront him
and ask, “Don't I know you?”
If Chauncey devotes as much time to worrying about his problem as it requires to read about it, we would
conclude that he is socially inept, or at least painfully shy. This is one of the problems of methodological
discussion: detailing the inferential steps in getting the job done looks picky and absurd. If we suppose this
computation passes very quickly through Chauncey's mind as he gives the blond a second glance, we might
better empathize with him. When discussing the validity checks of social research, it is useful to remember
that a careful description of what is done generally tends to suggest an obsessive preoccupation with detail on
the part of the researcher. This is an artifact of the fact of description, not a recommendation for compulsive
behavior.
Objectivity, though the term has been taken by some to suggest a naive and inhumane version of vulgar
positivism, is the essential basis of all good research. Without it, the only reason the reader of the research
might have for accepting the conclusions of the investigator would be an authoritarian respect for the person
of the author. Objectivity is the simultaneous realization of as much reliability and validity as possible.
Reliability is the degree to which the finding is independent of accidental circumstances of the research, and
validity is the degree to which the finding is interpreted in a correct way.
Reliability and validity are by no means symmetrical. It is easy to obtain perfect reliability with no validity at all
(if, say, the thermometer is broken, or it is plunged into the wrong flask). Perfect validity, on the other hand,
would assure perfect reliability, for every observation would yield the complete and exact truth.
As a means to the truth, social science has relied almost entirely on techniques for assuring reliability, in part
SAGE
1986 SAGE Publications, Ltd. All Rights Reserved.
SAGE Research Methods
Page 6 of 7 Reliability and Validity in Qualitative Research
because “perfect validity” is not even theoretically attainable. Most nonqualitative research methodologies
come complete with a variety of checks on reliability, and none on validity.
http://dx.doi.org/10.4135/9781412985659.n2
SAGE
1986 SAGE Publications, Ltd. All Rights Reserved.
SAGE Research Methods
Page 7 of 7 Reliability and Validity in Qualitative Research
- Reliability and Validity
- In: Reliability and Validity in Qualitative Research