Reflection Paper
A Practical Approach to Analyzing Healthcare Data, Fourth Edition Chapter 3, Tools for Data Organization, Analysis, and Presentation
Susan White, PhD, RHIA, CHDA
ahima.org
© 2019 AHIMA
ahima.org
Learning Objectives
Compare and contrast database structures
Categorize types of statistical software
Illustrate commonly used data visualization methods
© 2019 AHIMA
ahima.org
Data Organization Using Databases
Healthcare data is complex and often multi-dimensional
Provider
Patients
Insurance companies
Services
Providing an organizational structure for the data can facilitate more efficient analysis and reporting
Database – self-describing collection of integrated records.
Self-describing – contains a description of its own structure
Integrated – data elements are related to each other
© 2019 AHIMA
ahima.org
Database Vocabulary
Tables- two dimensional arrays of data
Rows = records
Columns = variables or attributes
RDMS – Relational Database Management System
Software that is used to hold and maintain data tables and their relationships
SQL – Structured Query Language
Programming language used to communicate with a relational database
ERD – Entity Relationship Diagram
Diagram that shows how tables in an RDMS relate
© 2019 AHIMA
ahima.org
Hierarchy of a Relational Database
Tables are rows and columns of values
Envision a tab in a spreadsheet
Fields are the columns in a spreadsheet
In a patient database, fields may be age, gender, admission date, etc.
Data elements or records are the rows in a spreadsheet
In a patient database, row may represent patients or services provided to patients
A unique row identifier in a table is called the primary key
Cannot be duplicated within the same table
Used to link tables together
© 2019 AHIMA
ahima.org
Data Dictionary
Details roadmap of the database
Should include
Name of computer or software program that contains the data element
Type of data in the field
Length of data in the field
Edits placed on the data field
Values allowed to be placed in the data field
A clear definition of each value
© 2019 AHIMA
ahima.org
Structured Query Language
SQL
Tool to use and maintain databases
Select data
Update data
Insert rows into a table
Delete rows from a table
© 2019 AHIMA
ahima.org
SQL Example
Retrieve the records for all patients from Milwaukee
SELECT PATIENT_LNAME, PATIENT_FNAME FROM PATIENT WHERE PATIENT_CITY = ‘Milwaukee’
Key words in the query are in red font
© 2019 AHIMA
ahima.org
Statistical Software Packages
R
Command line with a menu driven add in (R Commander)
Open source with a large user base on-line
May open Excel files for analysis
Statistical Analysis System (SAS)
Command line program
Excellent for manipulating large datasets
SPSS
Menu driven statistical software
Most common in academic settings
Microsoft Excel
Spreadsheet software with extensive statistical function
Excellent for summarizing data quickly
Commonly found in business setting
© 2019 AHIMA
ahima.org
R
Open source program that may be installed on both windows and Mac based computers
Used to demonstrate examples throughout the text
Many on-line tutorials and video demonstrations of the capabilities
Open source allows the users to expand the functionality of the software
Free to use
© 2019 AHIMA
ahima.org
SAS Syntax
SAS is a programming language much like SQL
Key words:
Data – used to name and create a dataset
Proc – declare which analytic procedure will be used
Set – declare which dataset will be the subject of the analysis
Run – designates the end of the command and starts the calculation
Syntax: always end commands with a ‘;’
© 2019 AHIMA
ahima.org
Graphical Displays of Data
Types of graphical comparisons
Group summary
Trends or changes over time
Relative size of groups
Relationships between variables
© 2019 AHIMA
ahima.org
Bar Graph or Chart
Group summary
Comparison of counts or averages across groups
Two variables: admissions, age category.
One bar for each gender
© 2019 AHIMA
ahima.org
Line Graphs or Chart
Trends or changes over time
Look for trends/patterns
Should not be used for connecting unrelated points
© 2019 AHIMA
ahima.org
Pie Chart
Compares relative size of groups
Used to represent relative proportions of a total
Note that this is different than a bar chart – in a pie chart categories must be part of a bigger set or population
© 2019 AHIMA
ahima.org
Scatter Diagrams
Used to display the relationship between two continuous variables
Should not be used if either variable is categorical
© 2019 AHIMA
ahima.org
Infographics
Conveys a message or story using a combination of graphs and text
Primary types:
Cause and effect
Chronological
Quantitative
Directional
Product
© 2019 AHIMA
ahima.org
Tables versus Graphs
Tables have several advantages over graphs such as:
Present more information than a graph
Display the exact values
Require less work to create
Graphs also have advantages over tables such as:
Catch the attention of the reader
Show trends easily
Bring out facts or relationships that stimulate thinking
© 2019 AHIMA
ahima.org