answer
Case-Control Studies
1
Overview of Lesson
Odds vs Probability
Overview of Case-Control Studies
Advantages
Disadvantages
Analysis
Calculating Odds Ratio (OR)
2
Probability vs Odds
Probability
Range of values: 0 – 1
Expressed in %
Probability is defined as the fraction of desired outcomes in the context of every possible outcome. Probabilities have a value between 0 and 1, where 0 is no chance of the desired outcome and 1 is guaranteed desired outcome. Probabilities are usually given as percentages.
Example: When flipping a coin, the coin will either land on heads or tails. Let’s say my desired outcome is heads. Each time I flip the coin, I will either get my desired outcome (heads) or an undesired outcome (tails). Because there are only two possible outcomes – heads or tails -- the probability of each flip is Heads/(Heads + Tails). Numerically, that would be 1/2, or 50%. Thus, each time I flip a coin, the probability of getting heads is 50%.
Let’s say I had a three-sided coin that has two heads and one tail. Thus, with every flip there are three possible outcomes – heads, heads, or tails. Two of those outcomes are my desired outcome. Thus I have a 2/3 -- or 67% probability – of getting heads each time I flip my three-sided coin.
Note: Probability calculation includes the numerator in the denominator of the calculation, because probability considers the context of the entire event.
4
Probability of getting a yellow M&M?
8 red
4 blue
3 orange
4 yellow
2 green
3 brown
If you were to close your eyes and pick one of the M&Ms from this spilled bag, what is the probability that you would get a yellow M&M?
5
Calculating Probability
The probability of getting a yellow m&m is calculated by taking the total number of yellow m&ms (4) and dividing it by the total number of m&ms (24) – which is all possible outcomes
If you said the probability of me getting a yellow m&m was 17% (or 16.7%), you’d be correct!
6
Odds
Range of values: 0 to ∞
Ratio: desired outcome to undesired outcome
Odds can have any value from zero to infinity and they represent a ratio of desired outcomes versus undesired outcomes. Odds can be expressed in two different ways: ‘odds in favor’ and ‘odds against’. ‘Odds in favor’ are the odds describing if an event will occur, while ‘odds against’ will describe if an event will not occur. With odds, the numerator is NOT a part of the denominator.
So, let’s go back to the coin example. For a regular coin, each flip will either be heads or tails – one desired outcome, one undesired outcome. Thus, the odds in favor of heads for a coin flip is 1:1. The odds against heads is also 1:1.
For my three-sided coin with two heads, the odds for heads is 2:1, whereas the odds against heads are 1:2
7
Odds of getting a yellow M&M?
8 red
4 blue
3 orange
4 yellow
2 green
3 brown
So, back to the m&ms… If I were to close my eyes and pick one, what are the odds that I’d get a yellow one? What are the odds that I wouldn’t?
8
Odds of getting yellow
The odds in favor of getting yellow are 4:20 or 1:5. Notice how the yellow M&Ms are in the numerator with the “in favor of” calculation.
9
Odds against getting yellow
The odds against getting yellow are 20:4 or 5:1. Notice how the yellow M&Ms are in the denominator for the against calculations.
10
M&Mmmmmmmm
Events that have a high probability, also have high odds. And, there is a mathematical relationship between the two, which goes beyond the scope of this mini-lesson and my understanding of math. But the reason for this information is to help you better understand the calculations used in this lesson on case-control studies.
11
Let’s learn about Case-Control Studies!
12
Starts in Present with D
In case-control studies, people get into the study based on their disease status and THEN information on past exposures is collected. People who are selected because they are D+ are called cases, and those who are selected because they are D- are called controls. Cases can be either incident (newly diagnosed) or prevalent (existing at a point in time) cases of the disease.
A case-control study looks at the odds of being exposed among cases and controls to tell us the likelihood of exposure based on disease status.
One famous epidemiologist calls ca-co studies “trohoc studies” because they are backwards cohort studies. The advantages of case-control studies are the disadvantages of f-up studies and the disadvantages of case-control studies are the advantages of f-up studies. Let’s take a look, shall we?
13
Advantages of Ca-Co
Suitable for rare and common diseases
Economical (Time and $$)
Allow testing of multiple hypotheses
Extremely efficient
Very valuable in controlling confounding and evaluating information
14
Case-control studies offer a solution to the difficulty of studying diseases with very long induction or latency periods, as investigators can identify affected (D+) and unaffected (D-) individuals and then look backward in time to assess their antecedent exposures, rather than having to wait a number of years for the disease to develop as with f-up designs. This also makes them ideally suited for investigating rare diseases, which would otherwise need to follow tremendously large numbers of individuals in order to accumulate a sufficient number of D+s. This process is also what makes case-control studies efficient, in terms of both time and costs, relative to other analytic approaches.
Ca-Co also allow for the evaluation of a wide range of potential etiologic exposures that might relate to a specific disease as well as the inter-relationships among these factors. Therefore case-control studies can be used to test specific hypothesis in the absence of an a priori hypothesis and explore a range of exposures among affected and non-affected individuals.
This study design is especially useful in the early stages of the development of knowledge about a particular disease or outcome of interest.
Disadvantages
Susceptible to bias
Study only one disease
Not good for rare exposures
Temporal relationship between E and D unknown
Cannot measure incidence
Sometimes difficult to find a control group
15
The major disadvantage of case-control studies is that both the exposure and the disease have already occurred at the time the participants enter into the study which makes this design especially susceptible to bias.
A fundamental problem with ca-co studies is that there is no way to prove to anyone that you have selected a proper control group. As a result, this design is particularly susceptible to bias in the selection of either the cases or the controls into the study on the basis of their exposure status, as well as from differential reporting or recording of exposure information between study group based on the disease status.
Another disadvantage in that you can only study one disease. It also is not good for rare exposures, and the time sequence between E and D may be uncertain
Ca-co studies also cannot measure incidence (or mortality) as the investigation begins with the selection of subjects based on D status.
Odds Ratio (OR)
16
A case-control study compares the odds of exposure among cases (D+) relative to the odds of controls (D-) who are E+. Let’s take a look at the 2x2 table to show how the each of these odds are calculated…
Odds of E+ among Cases
| Case (D+) | Control (D-) | |
| E+ | a | b |
| E- | c | d |
| Total | a+c | b + d |
So, what are the odds in favor of being exposed if you are a case? I’ll give you two choices:
a/(a+c)
a/c
Scroll down for the answer.
#2 is correct. If you said #1, think back to the M&M example, where the odds in favor of selecting a yellow M&M was the number of yellow M&Ms divided by all of the M&Ms EXCEPT the yellow ones….
17
Odds of E+ among Controls
| Case (D+) | Control (D-) | |
| E+ | a | b |
| E- | c | d |
| Total | a+c | b + d |
So, what are the odds in favor of being exposed if you are a control? If you said anything other than b/d, please review the odds information or email me directly for assistance.
18
Calculaing OR
OR = odds of E among Ca/odds of E among Co
= (a/c) / (b/d)
= (a x d) / (b x c)
= ad/bc
So to calculate OR, we could calculate the odds of E+ among cases and the odds of E- among controls, then calculate a ratio of the two odds. Or, we can do the math magic on the slide and boil it down to a very simple equation of ad/bc.
Note: It’s really not math magic. When dividing fractions by fractions, we flip the denominator and multiply the two fractions. Here’s a link to a refresher. https://www.wikihow.com/Divide-Fractions-by-Fractions
19
Concordant Cells
| Case (D+) | Control (D-) | |
| E+ | a | b |
| E- | c | d |
| Total | a+c | b + d |
a and d are called “concordant cells”
20
Discordant Cells
| Case (D+) | Control (D-) | |
| E+ | a | b |
| E- | c | d |
| Total | a+c | b + d |
B and C are called discordant cells
21
OR = Concordant/Discordant
| Case (D+) | Control (D-) | |
| E+ | a | b |
| E- | c | d |
| Total | a+c | b + d |
So, OR = concordant cells divided by discordant cells. Fun times!
22
Interpretation of OR
When OR = 1.00
Exposure does not affect odds of disease
This is the NULL
When OR < 1.00
Negative association
Odds of exposure lower among cases
When OR > 1.00
Positive association
Odds of exposure higher among cases
23
An odds ratio of
• 1.0 (or close to 1.0) indicates that the odds of exposure among cases are the same as, or similar to, the odds of exposure among controls. Thus, the exposure is not associated with the disease.
• Greater than 1.0 indicates that the odds of exposure among cases are greater than the odds of exposure among controls. The exposure might be a risk factor for the disease.
• Less than 1.0 indicates that the odds of exposure among cases are lower than the odds of exposure among controls. The exposure might be a protective factor against the disease.
As with interpreting RR, the further away from 1, the stronger the association, be it positive or negative.
Example
We want to determine how much more common occupational benzene exposure is among persons with leukemia. We conducted a case-control study in which 85 of the 125 workers with leukemia were exposed to benzene on the job. Conversely, among the 125 controls, only 40 had been exposed.
24
Step 1: Set up data table
| Case (D+) | Control (D-) | Total | |
| E+ | 85 | 40 | 125 |
| E- | 40 | 84 | 125 |
| Total | 125 | 125 | 250 |
25
Note: among the cases, we knew that 85 had been exposed. That is our cell a. To get our cell c, we subtract that number from the total number of cases, which we were given. We were given the value for cell b (exposed among controls), and determined cell d by subtracting that number from the total number of controls, which we were given.
Step 2: Calculate OR (ad/bc)
26
OR = ad/bc
= (85 x 85) / (40 x 40)
= 7225/1600
= 4.5
Step 3: Interpret
The odds of benzene exposure among cases is 4.5 times as great as the odds of exposure among controls.
27
The odds of benzene exposure among cases is 4.5 times as great as the odds of exposure among controls.
Word of Caution
Very common
Very commonly wrong
Case-Control studies make up about 80 – 85% of all published studies. Unfortunately, many of these studies really are not case-control studies, despite being touted and published as such. This is because the case-control design is difficult to grasp and perform. Most commonly we see data from cross-sectional studies being used as a case-control study.
28