Statistical Data Analysis Project.
Page 1 of 4
required to submit a PowerPoint presentation and every member is required to participate in
presentation and discussion.
There will be three tasks in this project. The first task is to select a domain of application and a dataset
to analyze in the purpose of resolving a real-life problem. The second and third tasks are described below.
Project learning outcomes
● Perform statistical analyses that show the depth of student understanding of the
statistical methods relevant to the topic.
● Present complex analyses and draw appropriate conclusions with clarity and accuracy.
● Demonstrate understanding of theoretical and practical aspects of analyzing statistical
data and solving real-life problems.
● Demonstrate an ability to work independently and in a group.
● Work with relatively little guidance and be able to exercise initiative.
Task1:
1. Select one “clean” dataset and propose to analyze the available data variables.
Example of dataset available here: Medical Cost Personal Datasets | Kaggle
2. Answer at least three (3) questions that you propose or make decisions using
statistics and quantitative techniques. Example of questions: Can you accurately
predict insurance costs? Are there any relationships between variables? How are
the objects distributed in the dataset for a given variable?
Task2:
1. Explain the data collected: context, source of data…
2. Describe the data statistically (central tendency, dispersion, distribution, correlation). Use visual
representations of the data.
3. Identify the target population
4. Realize the data reduction or sampling if needed, justify your method.
5. Identify the methods that you will use to answer to the three (3) questions or decisions that you
defined previously (hypothesis testing, regression, forecasting…)
Page 2 of 4
Task3:
Analyze the out finding of task 2 and build your conclusion.
Final Presentation Outline
Introduction
Goal of the project
Problem or Decision-Making statement
Methodology
Analysis
Conclusion
Report and presentation Grading Rubrics
Criteria \ level of assessment D- Poor (25%) C- Fair (50%) B- Good (75%) A- Excellent
(100%)
R ep
o rt
[ C
L O
3 .1
-V 2
]: T
o ta
l M
a rk
o u
t o f
3 0
Goal of
Project
(Max: 2
marks)
The goal and/or
motivation is missing
from the report.
The goal and/or
motivation for the
survey or research
project is unclear or
inaccurate and is
missing pertinent
information.
The goal and
motivation for
conducting the survey
or research project is
stated but ambiguous.
Contains some, but
not all, pertinent
information.
The goal and
motivation for
conducting the project
is clearly and
accurately stated and
contains all pertinent
information.
Target
Population
(Max: 2
marks)
The target population is
not identified.
The target
population is
identified but is
inaccurate.
The target population
is identified but is
unclear.
The target population
is clearly and
accurately identified.
Sample and
Method of
Sampling
(Max: 4
marks)
The Sample, method of
sampling and the
analysis of the
appropriateness of both
the sample and method
of sampling are
missing.
Either the Sample or
method of sampling
is missing from the
report. The analysis
of the
appropriateness of
either the sample or
sampling method is
missing or shows
little to no
understanding of the
terms and
procedures.
The Sample and
method of sampling
is identified and
accurate. The analysis
of the appropriateness
of both the sample
and method of
sampling lacks clarity
and/or shows limited
understanding of the
terms and procedures.
The Sample and
method of sampling is
clearly identified and
accurate. An analysis
of the appropriateness
of both the sample and
method is clearly
written and based on a
full understanding of
the terms and
procedures.
Data
Collection
(Max: 4
The data collected and
method of collecting
data are missing from
The data collected
and method of
collecting data are
The data collected
and method of
collecting data are
The data collected and
method of collecting
data are clearly stated
Page 3 of 4
marks) the report. Analysis is
missing.
stated but unclear
and/or inaccurate.
Visual
representations of
the data are not
noted, and analysis
is missing.
stated but unclear
and/or inaccurate.
Visual
representations of the
data are noted but, the
analysis of the
appropriateness of the
method of
presentation is
missing or lacks
support
and accurate. Visual
representations of the
data are noted and,
analysis of the
appropriateness of the
method of presentation
is clear and supported.
Personal
analysis
(12pts)
Evaluation of the
methods, organization,
statistics and
presentation of data,
along with concerns or
questions are missing
from the report.
Evaluations of the
methods,
organization,
statistics, and
presentation of data
are included but
views are not
supported. No
reference is made to
concerns or
questions
Evaluations of the
methods,
organization,
statistics, and
presentation of data
are included but are
not specific and
poorly supported by
criteria outlined in
textbook and in class.
Concerns and/or
questions are not
specific and/or not
supported.
Evaluation of the
methods, organization,
statistics, and
presentation of data
are clear and supported
by criteria outlined in
textbook and in class.
Concerns and/or
questions are clearly
stated and supported.
Conclusion
(Max: 6
marks)
The conclusion of those
conducting the study is
missing.
The conclusion of
those conducting the
study is inaccurate.
The conclusion of
those conducting the
study is not clearly
stated.
The conclusion of
those conducting the
study is clearly stated
and accurate.
Report [CLO 3.1-V2] Total Mark out of 30
O ra
l p
re se
n ta
ti o
n (
M a
x :
5 m
a rk
s) [
C L
O 3
.1 -V
2 ]
Group
grading
(Max: 3
marks)
Slides/Poster seem to
have been cut and
pasted together
haphazardly at the last
minute; numerous
mistakes; speaker not
always sure what is
coming next
Boring slides/Poster;
no glaring mistakes
but no real effort
made into creating
truly effective slides;
Generally good set of
slides/Poster
elements; conveys the
main points well
Very creative slides
carefully thought out
to bring out the main
points of the statistical
analysis.
Main points well
stated and argued.
Individual
grading
(Max: 2
marks)
Mumbles the words,
audience members in
the back can't hear
anything; too many
filler words; distracting
gestures.
Reads most of the
presentation from the
slides or notes with no
eye contact with
audience members;
seems unaware of
audience reactions and
team members.
Low voice,
occasionally
inaudible; some
distracting filler
words and gestures.
Occasional eye
contact with
audience but mostly
reads the
presentation; only
brief responses to
audience questions;
Mainly focused on
his/her portion of the
presentation;
Clear voice, generally
effective delivery;
minimal distracting
gestures, etc., but
somewhat monotone.
Generally aware of
the audience
reactions.
Good team player. Is
interested in the
presentations by the
other team members;
Natural, confident
delivery that does not
just convey the
message but enhances
it.
Keeps the audience
engaged throughout
the presentation,
keenly aware of
audience reactions.
Superb team player.
Goes out of his way to
help the rest of the
team
Page 4 of 4
Presentation
[CLO 3.1-V2]
Total Mark out of 5
CLO 3.1-V2 Final Mark out of 35