Biostatistics Assignments
IHP 525 Milestone Two Template
READ ENTIRE TEMPLATE DIRECTIONS BEFORE FILLING IN TEMPLATE
Do NOT delete anything from this template.
1) Student Name:
State the question you will pursue. (This should be copied and pasted from the list of questions and should be the same question submitted in week 2 unless you have changed your question.)
2) Question of Interest (paste):
Directions for following page(s):
· Fill out the table below for EACH variable of interest for YOUR QUESTION. (Your possible variables of interest are gender, length of stay, age, BMI, follow-up time, and survival.)
· Include ONLY the variables that are relevant to your question of interest. (If it’s not mentioned in your question directly, it’s not relevant.)
· Each variable should take up ONE row.
· Only use the rows you need; you should NOT fill every row of the table.
· You will have more than one variable; it will be quantitative or categorical. Choose an appropriate row for your variable type.
· Data set comes from the file uploaded to StatCrunch in Module One.
|
Variable type (categorical or quantitative) Do NOT change anything in this column. |
Variable name (one variable per row, don’t use all rows) Your choices for names are gender, length of stay, age, BMI, follow-up time, and survival. |
Descriptive statistics *For THIS part of the final project, do not subdivide the data for any quantitative variable in each row based on any other variable. For example, do NOT find the mean length of stay separately for males and females for this part of the final project. |
Key features
Creating a histogram or bar chart may help, but do NOT include either of these in this table. |
|
|
Categorical |
Variable name: |
Frequency of each value of the variable:
Relative Frequencies (percentages):
|
Frequency Definition/formula:
Relative Frequency Definition/formula: |
Is there an equal number of data points for each value of the variable (e.g., is there an equal number of males and females)?
|
|
Quantitative |
Variable name:
|
Mean value:
Median value:
Range value:
Standard Deviation value:
|
Mean Definition/formula:
Median Definition/formula:
Range Definition/formula:
Standard Deviation Definition/formula: |
Is this variable symmetric or skewed?
Describe any outliers:
|
|
Categorical |
Variable name: |
Frequency of each value of the variable:
Relative Frequencies (percentages):
|
Frequency Definition/formula:
Relative Frequency Definition/formula: |
Is there an equal number of data points for each value of the variable (e.g., is there an equal number of males and females)? |
|
Quantitative |
Variable name: |
Mean value:
Median value:
Range value:
Standard Deviation value:
|
Mean Definition/formula:
Median Definition/formula:
Range Definition/formula:
Standard Deviation Definition/formula: |
Is this variable symmetric or skewed? (Check histogram but do not include histogram.)
Describe any outliers:
|
|
Categorical |
Variable name: |
Frequency of each value of the variable:
Relative Frequencies (percentages):
|
Frequency Definition/formula:
Relative Frequency Definition/formula: |
Is there an equal number of data points for each value of the variable (e.g., is there an equal number of males and females)? |
|
Quantitative |
Variable name: |
Mean value:
Median value:
Range value:
Standard Deviation value:
|
Mean Definition/formula:
Median Definition/formula:
Range Definition/formula:
Standard Deviation Definition/formula: |
Is this variable symmetric or skewed? (Check histogram but do not include histogram.)
Describe any outliers:
|
DO NOT list or discuss descriptive statistics in this space. Use the table above, as directed.
DATA COLLECTION:
Examine the statistical report description (found in module 3 milestone rubric or announcements) to find the source of the data. The data we are using is a smaller subset of the original data. Do you think this smaller data set (100 observations) was randomly selected from the larger data set in the original study? Please give a reason for your answer.
Answer here:
Analyze the limitations of the data set you were provided and how those limitations might affect your findings.
· Limit your response to the data relevant to your question of interest.
· For example, only using two variables in the data set is NOT a limitation of the data in your question of interest. It may be a limitation of the study or question of interest, but it is NOT a limitation of the data you have been provided for your question of interest. Don’t discuss this here.
· Comorbidities or missing patient information are NOT appropriate to discuss here. Not including these elements might be a limitation of the study or question of interest, but it is NOT a limitation of the data you have been provided for your question of interest.
· Other treatment information is NOT appropriate to discuss here. Not including these elements might be a limitation of the study or question of interest, but it is NOT a limitation of the data you have been provided for your question of interest.
· Possible limitations:
· Consider any outliers and how they could affect the analysis (if there are any).
· Consider the scope of data collection (time frame, location) and the question of interest. Is the sample adequate to answer the general question of interest?
· ONE clear limitation is sufficient.
Write limitation discussion here: