Biostatistics
COH 602: Biostatistics SAS Data Analysis Project: Descriptive & Inferential Data Analysis Final Paper
Instructions & Grading Criteria
Dr. Kemi A. Wosu
Purpose & Content of Analysis Paper
This final paper is the culmination of all of your SAS data analysis work in this course. The purpose of
this paper is to show your expertise in performing descriptive and inferential SAS analyses.
Additionally, writing your paper will give you some experience of how to correctly discuss your work
and results in a clear and concise manner.
Your completed final analysis project will be graded based on the following criteria:
1. Introduction (10 points)
2. Methods (20 points)
3. Discussion (45 points)
4. Conclusion (10 points)
5. Proper grammar and sentence structure used for clear and concise writing (5 points)
6. SAS results printed out in RTF format and results being reported in paper highlighted (5 points)
7. SAS code copied and pasted into the same Word document as your paper (5 points)
Your paper should be in APA format – that is, double-spaced, with a title page, header with page
number and 1 inch margin on all sides. You can find a good reference to APA style here.
https://owl.english.purdue.edu/owl/resource/560/01/
Parts 1-4 of your paper should be a minimum of 6 pages, double-spaced (not including tables). After 10
pages I will stop reading (seriously). If it is much less than 6 pages, you probably left something
important out. If it is over 10 pages, you are probably going off-topic.
Part 6 should include about 2 pages of SAS code copied and pasted into the Appendix of your paper. If
it isn’t that long, you probably left something out. The output may be much longer, depending on your
research question and the data set used.
Part 7 should include approximately 25 pages of SAS results printed out in RTF format and the specific
sections where you obtained the results you are reporting in the paper highlighted, so that it is very clear
where your answers are coming from. This part is essential, so that I can double-check and confirm that
you have run your analyses in the correct manner and that you are reporting the appropriate results. If
your results are not highlighted, and I have to hunt for your answers, you will lose points on this section.
Descriptive Analysis
Make sure your paper has four distinct parts (shown in the table below) that flow well and transition as
seamlessly as possible into one another.
The main point of this final paper is to state the conclusions you’ve reached from all the various
analyses you have performed, and use these conclusions to answer your overall research question. You
should write the paper as if your reader does not have an extensive background in biostatistics. When
you write as if your reader is a novice, this will force you to be very detailed in your explanations, and
will help you to write in a much clearer and well-thought-out manner.
Inferential Analysis
After you have written up the findings for your descriptive data analysis, your next step is to conduct
inferential data analysis and report those findings. Now you are moving from just describing the
individual variables of interest to investigating whether there are statistically significant relationships
among your variables of interest. These inference tests will help you answer your main research
question.
The purpose of this assignment is to provide a second rough draft of another main section of your
analysis paper (the inferential statistics), so that you can receive feedback on your methods. Your
inferential data analysis should include more than one inferential test. This could just be an ANOVA
with an examination of the overall F-test and discussion of the post-hoc tests. It could also be a chi-
square and a logistic regression. Any of the tests of inference discussed in Chapters 7 and 9 of your
textbook are fair game for you to use.
Please use the following as a guideline for what you are required to include in the paper and how your
paper will be graded:
Criteria Maximum Points Possible
Introduction:
a) Short description of your research question and its relevance to
public health – it is recommended to do a quick literature
review and include information from other peer-reviewed
journal articles to support the statements you are making
regarding this paper [Remember that if you use any outside
sources for this paper, that you cite them, APA-style, in the
Reference section at the end of the paper (before the
Appendix)]
b) Statement of the dataset you will use to answer the question
and a short description of the parent study that this dataset
originated from
10 points
a) 5 points
b) 5 points
Methods:
a) Names of the variables you will use to answer the research
question
b) Descriptions of the variables you will use to answer the
research question
c) Description of the descriptive statistics methodology [(1)
Describe the SAS analyses you did to produce descriptive
statistics that summarize the study sample based upon the
characteristics (variables) you are interested in; (2) Discuss
why you chose these methods (should be based upon the
variables you are using and the question you are answering)]
d) Description of the inferential statistics methodology [(1)
Describe the SAS analyses you did to produce the inferential
statistics that assess the relationships between/among the
characteristics (variables) that you are investigating in order to
answer your research question for the study sample; (2)
Discuss why you chose these methods (this should be based
20 points
a) 5 points
b) 5 points
c) 5 points
d) 5 points
upon the variables you are using and the question you are
answering)]
Discussion:
a) Discussion of the results of the descriptive statistics analysis
done using SAS [(1) Report the appropriate statistics for
numeric variables, based upon whether the variable is normal
or skewed; (2) Report the appropriate statistics for categorical
variables; (3) Present graphical (visual) representations of each
variable of interest, AND make sure to discuss/explain what
the graph means within the discussion section. You can copy
and paste this graph or chart directly from SAS into the body
of your paper; (4) Present a table of results within the body of
your paper that summarizes all the descriptive statistics you
have discussed in your results section. Make sure to create
your own table; DO NOT just copy and paste a table from
SAS into your paper]
b) Discussion of the results of the inferential statistics analysis
done using SAS [This must be done for EACH test you run
in SAS: (1) State the null and alternate hypotheses for each test
of inference; (2) State the appropriate test statistic SAS will
calculate for that particular test (i.e., t-values, z-values, chi-
square values, F-values, r-values, or β-values); (3) State the
decision rule – when working in SAS you should be using
either the p-value decision rule OR the confidence interval
decision rule; (4) Report the test-statistic value produced by
SAS; (5) State the conclusion regarding the null hypothesis
and how you reached this conclusion (the statistical evidence);
then, based on your conclusion, answer the original research
question]
45 points
a) 20 points
b) 25 points
Conclusion: 10 points
a) 5 points
a) State the final conclusions of what you’ve learned about your
study sample, based upon the findings from your descriptive
statistics analysis; discuss how these findings relate to the
original research question
b) State the final conclusions of what you’ve learned about your
study sample from your inferential statistics analysis, and
answer the original research question based upon these
findings
b) 5 points
Proper grammar and sentence structure used for clear and concise
writing
5 points
SAS Results that show where the results you are actually reporting
came from should be printed out into RTF format and the sections
where you directly obtained results should be highlighted, so that it is
clear to see where you obtained your final answers
5 points
SAS Codes used for your analyses (copied and pasted into the
Appendix of your paper)
5 points
TOTAL SCORE 100 points
PLEASE REMEMBER:
Your paper should be in APA format – that is, double-spaced, with a title page, header with
page number, and 1 inch margins on all sides. You can find a good reference to APA style here:
https://owl.english.purdue.edu/owl/resource/560/01/
A data dictionary has been created for these three datasets, which will give you detailed
information about the variables available for you to use in these datasets for your analysis
project. The data dictionary can be found on the class website.
Additional resources to help you with the data analysis (SAS coding) AND writing process for
your analysis project can be found on Blackboard. You have many resources to help you with
this project. It is up to you to use them.
Paper Length
The length of your paper should be a minimum of 5 pages, double-spaced (not including tables, graphs,
or charts). After 10 pages I will stop reading (seriously). If it is much less than 5 pages, you probably left
something important out. If it is over 10 pages, you are probably going off-topic.
Descriptive Tables
You can, and should, add tables (that you create) to your paper, but make sure you have achieved your
minimum of 6 pages of text. In other words, do not use visual representations of your findings as filler
for your paper. I will be able to notice any attempts to pad your paper. Tables are a good way to illustrate
what you are writing about in your Discussion section, and you must make sure to clearly discuss any
tables presented in your paper.
Tables are a good way to illustrate what you are writing about in your Discussion section. Any results
or findings you present in a table, within the body of your paper, must be explained and discussed.
SAS Codes & Output Submission
It is important that you submit your SAS Codes and Output, so that I can make sure you have run the
proper analyses. This is a way to double-check your work, and it is necessary. Your output will take up
many pages, but your code should just take a few pages.
Copying Codes
To copy your SAS Codes in order to submit them, highlight the codes you want and copy them by
pressing and holding the “Ctrl” key and depressing the ”C” key simultaneously on your keyboard if
using Windows Operating System.
If you are using the Mac Operating System, highlight the codes you want and copy them by pressing and
holding the “Command” key and depressing the ”C” key simultaneously on your keyboard if using
Windows Operating System.
Pasting Codes
Open a Word document and position the mouse cursor where you want to paste the text you previously
copied. Press the “Ctrl” (or “Command”) key and the “V” key simultaneously to paste the text.
Once you have copied and pasted all your codes to the document, you can save it and submit it in the
Dropbox.
Printing SAS Results in SAS Studio
These steps for printing can be carried out for SAS Results and should be used when submitting
your Results and Code with the final paper.
How to Insert Symbols (Greek Letters) into Your Document (Microsoft Word
Only)
How to Create Equations in Your Document (Microsoft Word Only)