Assignment- W2

profilewealthprocess1
RES-850-RS-LuckettQuantDissertation-W2.pdf

The Relationship between Musical Aptitude and Academic Achievement among

Beginning Band Students

Submitted by

Theodus Luckett III

A Dissertation Presented in Partial Fulfillment

of the Requirements for the Degree

Doctorate of Education

Grand Canyon University

Phoenix, Arizona

December 11, 2015

© by Theodus Luckett III, 2015

All rights reserved.

GRAND CANYON UNIVERSITY

The Relationship between Musical Aptitude and Academic Achievement among

Beginning Band Students

I verify that my dissertation represents original research, is not falsified or plagiarized,

and that I have accurately reported, cited, and referenced all sources within this

manuscript in strict compliance with APA and Grand Canyon University (GCU)

guidelines. I also verify my dissertation complies with the approval(s) granted for this

research investigation by GCU Institutional Review Board (IRB).

__________________________________ ______________________

Theodus Luckett III Date

Abstract

The purpose of this quantitative correlational study was to examine if and to what degree

a correlation existed between the musical aptitude measured by Intermediate Measures of

Music Audiation (IMMA), and reading and mathematics scores on the State of Texas

Assessment of Academic Readiness (STAAR) among sixth grade beginning band students.

Two research questions guided the study: (1) was there a correlation between the level of

musical aptitude on the IMMA and the composite and sub-categorical performance levels

of the mathematics scores on the STAAR among beginning band students? and (2) was

there a correlation between the level of musical aptitude and the composite and sub-

categorical performance levels of the reading scores on the STAAR among beginning band

students? The theoretical foundation was Gardner’s theory of multiple intelligences

supported by Gordon’s music learning theory. Results from regression analyses indicated

there was a statistically significant relationship between students’ musical aptitude and

mathematics achievement composite scores, R = .673, R2 = .453, F (1, 63) = 52.132, p <

.001, between students’ musical aptitude scores and five mathematics achievement

subscales, R = .709, R2 = .502, F (5, 59) = 11.915, p < .001, between students’ musical

aptitude and reading achievement composite scores, R = .848, R2 = .718, F(1, 63) =

160.722, p < .001. Results from the multiple regression analysis indicated a significant

relationship between students’ musical aptitude scores and three reading achievement

subscales, R = .861, R2 = .740, F(3, 61) = 58.022, p < .001. The implications suggest that

music programs designed to increase musical aptitude may have a positive effect on

reading and mathematics achievement among middle school students.

Keywords: Musical aptitude, academic achievement, reading, mathematics

vi

Dedication

This dissertation is dedicated to the wonderful people who helped me along this

doctoral journey. First, I would like to dedicate this dissertation to the Lord, who is my

helper, friend, and Savior! Without Him, I am nothing. Second, I would like to thank my

lovely wife and wish her the best as she embarks on her doctoral journey. I couldn’t have

done it without her support and advice. Third, I would like to thank my sister (Dr. Pamela

Luckett) for paving the way to my dream. Fourth, I would like thank my parents (Emma

Lee Luckett and Theodus Luckett Jr.) for their sacrifices. They have made me into the

man I am today. Lastly, I would like to dedicate this dissertation to my son (Theodus

Luckett IV) who has pushed me to complete this degree! I love you, son!

-If I am for you, who can be against you!

vii

Acknowledgements

I must acknowledge and thank my entire dissertation team: Dr. Dolores Kelly,

Chair, my friend and colleague Dr. Richard Holsomback, Dr. James Lehmann, and Dr.

Dorina Miron. You made this journey not only possible, but also enjoyable. I cannot

thank you enough for all of your time and dedication thought this process. Thank you!

viii

Table of Contents

List of Tables .................................................................................................................... xii

List of Figures .................................................................................................................. xiii

Chapter 1: Introduction to the Study ....................................................................................1

Introduction ....................................................................................................................1

Background of the Study ...............................................................................................3

Problem Statement .........................................................................................................5

Purpose of the Study ......................................................................................................6

Research Questions and Hypotheses .............................................................................7

Advancing Scientific Knowledge ..................................................................................9

Significance of the Study .............................................................................................11

Rationale for Methodology ..........................................................................................13

Nature of the Research Design for the Study ...............................................................14

Definitions of Terms ....................................................................................................16

Assumptions, Limitations, Delimitations ....................................................................18

Assumptions ........................................................................................................18

Limitations ..........................................................................................................19

Delimitations .......................................................................................................19

Summary and Organization of the Remainder of the Study ........................................19

Chapter 2: Literature Review .............................................................................................22

Introduction and Background ......................................................................................22

Theoretical Framework ................................................................................................26

Theory of multiple intelligences .........................................................................26

ix

Music learning theory .........................................................................................27

Review of the Literature ..............................................................................................28

Overview of musical aptitude .............................................................................29

Academic achievement .......................................................................................33

Music and academic achievement.......................................................................39

Musical aptitude and academic achievement ......................................................48

Methodology. ......................................................................................................52

Instrumentation. ..................................................................................................54

Summary ......................................................................................................................55

Chapter 3: Methodology ....................................................................................................58

Introduction ..................................................................................................................58

Statement of the Problem .............................................................................................59

Research Questions and Hypotheses ...........................................................................60

Research Methodology ................................................................................................62

Research Design...........................................................................................................64

Population and Sample Selection.................................................................................66

Instrumentation ............................................................................................................67

IMMA. ................................................................................................................68

STAAR................................................................................................................68

Validity ........................................................................................................................69

IMMA validity ....................................................................................................69

STAAR validity ..................................................................................................69

Reliability .....................................................................................................................70

x

IMMA reliability .................................................................................................70

STAAR reliability ...............................................................................................71

Data Collection Procedures ..........................................................................................72

Data Analysis Procedures ............................................................................................74

Ethical Considerations .................................................................................................76

Limitations ...................................................................................................................77

Summary ......................................................................................................................78

Chapter 4: Data Analysis and Results ................................................................................81

Introduction ..................................................................................................................81

Descriptive Data...........................................................................................................82

Data Analysis Procedure ..............................................................................................85

Research Question 1: data evaluation .................................................................87

Research Question 2: data evaluation .................................................................90

Results ..........................................................................................................................93

Results of Research Question 1 ..........................................................................93

Results of Research Question 2 ..........................................................................97

Summary ....................................................................................................................102

Chapter 5: Summary, Conclusions, and Recommendations ............................................104

Introduction ................................................................................................................104

Summary of the Study ...............................................................................................105

Summary of Findings and Conclusion .......................................................................108

Implications................................................................................................................113

Theoretical implications ....................................................................................113

xi

Practical implications ........................................................................................114

Future implications ...........................................................................................115

Strengths and weaknesses .................................................................................116

Recommendations ......................................................................................................116

Recommendations for future research ..............................................................116

Recommendations for future practice ...............................................................117

References ........................................................................................................................119

Appendix A. Site Authorization Letter ............................................................................136

Appendix B. Permission to Use Intrument ......................................................................137

Appendix C. IMMA Instrument ......................................................................................138

Appendix D. Informed Consent .......................................................................................147

Appendix E. Computation of Minimum Sample Size .....................................................149

Appendix F. IRB Approval Letter ...................................................................................150

xii

List of Tables

Table 1. Intermediate Measures of Music Audiation Reliabilities, Standard

Errors of Measurement, and Standard Errors of a Difference ........................... 71

Table 2. STAAR Grade 6 Total Group Descriptive Data ................................................ 72

Table 3. The Study Population: Gender and Ethnicity .................................................... 84

Table 4. Summary of Variables and Statistical Tests used to Evaluate

Research Questions 1 and 2 ............................................................................... 87

Table 5. Descriptive Statistics of the Criterion and Predictor Variables ......................... 88

Table 6. Skewness and Kurtosis Statistics of the Criterion and Predictor Variables

Used to Evaluate Research Question 1 .............................................................. 89

Table 7. Summary of Correlations between Predictor Variables used in Research

Question 1 .......................................................................................................... 90

Table 8. Descriptive Statistics of the Criterion and Predictor Variables used to

Evaluate Research Question 2 ........................................................................... 91

Table 9. Skewness and Kurtosis Statistics of the Criterion and Predictor Variables

Used to Evaluate Research Question 2 .............................................................. 92

Table 10. Summary of Correlations between Predictor Variables used in Research

Question 2 .......................................................................................................... 93

Table 11. Model Summary of Regression for Research Question 1 ................................ 95

Table 12. Model Summary of Multiple Regression for Research Question 1 ................. 97

Table 13. Model Summary of Regression for Research Question 2 ................................ 99

Table 14. Model Summary of Multiple Regression for Research Question 2 ............... 101

Table 15. Summary of Results for Hypotheses 1 and 2 ................................................. 103

xiii

List of Figures

Figure 1. District enrollment percentages by ethnicity for 2013-2014 school year.......... 83

Figure 2. Scatterplot depicting the relationship between mathematics achievement

and musical aptitude. ......................................................................................... 96

Figure 3. Scatterplot depicting the relationship between reading achievement

and musical aptitude. ....................................................................................... 100

1

Chapter 1: Introduction to the Study

Introduction

Since the beginning of the 21st century, public education in the United States

could be characterized as a decade of increased emphasis on school accountability

(Duncan, 2011; William, 2010). The accountability standards mandated by federal and

state legislature require public school districts to acquire passing scores on standardized

tests. Yet, standardized assessments across the nation suggest that more than one-third of

American students are not proficient in reading and mathematics (Campbell & Malkus,

2011; Hemmings, Grootenboer, & Kay, 2011; Martin, 2012; National Assessment of

Educational Progress, 2009). In addition, the level of academic proficiency for minority

students is even lower (Arbona & Jimenez, 2014; Morales, 2010; Olszewski-Kubilius &

Thomson, 2010; Williams, 2011).

These alarming statistical data prompted music educators across the United States

to examine musical aptitude as it relates to academic achievement. Similar empirical

studies have examined the relationship between musical aptitude and academic

achievement in Texas. Holsomback (2001) conducted a quantitative correlational study

that examined the relationship between musical aptitude and the Texas Assessment of

Knowledge and Skills (TAKS) for beginning band students. The study was conducted

with 104 sixth grade band students in an east Texas school district. The researcher found

that a strong statistical relationship existed between musical aptitude and academic

achievement. The following year, Holsomback (2002) conducted a similar study that

examined the relationship between the musical aptitude and academic achievement of 74

seventh grade band students. That second study identified a positive correlation between

2

musical aptitude, as measured by the Selmer Music Guidance Survey, and academic

achievement, as measured by the results of the Texas Assessment of Academic Skills

(TAAS). In addition, Holsomback (2002) suggested that further research should

investigate the relationship between musical aptitude and other standardized assessments.

Although compelling empirical evidence correlates musical aptitude to earlier

standardized assessments in Texas (Holsomback, 2002; 2001), as of 2015 there has been

no research concerning the correlation between musical aptitude and the State of Texas

Assessments of Academic Readiness (STAAR) among sixth grade beginning band

students. This gap in the literature justifies the need for this study. By investigating such

relationships, school administrators, teachers, and policy holders are capable of

identifying tactics that may improve test results. In addition, the data extended the

literature relating music to cognitive abilities by examining the correlation of musical

aptitude to specific areas of academic performance. Therefore, the purpose of this

quantitative correlational study was to examine if, and to what degree, a correlation exists

between musical aptitude and the reading and mathematics sections of the STAAR

among beginning band students.

Chapter 1 includes the background to the study, the problem statement, the

purpose of the study, and the research questions and corresponding hypotheses. A

discussion of how it will advance scientific knowledge and the significance of the study

is also presented. The researcher provides the rationale for the selected methodology and

research design. The chapter concludes with definitions of research terms, assumptions,

limitations, and the study’s delimitations.

3

Background of the Study

Over the past decade, the educational reform movement has been a major societal

and political debate in the United States (Knoeppel & Brewer, 2011; McGuinn, 2006).

Although many attempts have been made at school reform, minimal evidence of

academic advancement has been presented (Good, 2010; Ravitch & Cortese, 2009). In

1983, a highly controversial reform, A Nation at Risk: The Imperative for Education

Reform, suggested that more homework, extended school days, and extended school

years would increase academic achievement (U.S. Department of Education, 1983).

However, researchers suggested that the reform advocated by A Nation at Risk was

partially right and partially wrong (Burdick, 2012; Good, 2010; Palmer, Davis, Moore, &

Hilton, 2010). Kitsantas, Cheema, and Ware (2011) conducted a mixed method study that

examined the amount of homework spent, self-efficacy, and academic achievement. The

study consisted of 3,776 students and 221 schools. The findings suggested that more

homework does not yield higher academic achievement.

Supportively, Good (2010) suggested that if learning achievements were low,

perhaps something different was needed—not more of the same. Yet, Johanningmeier

(2010) suggested that A Nation at Risk brought national awareness for academic

excellence in the United States. Additionally, Burdick (2012) suggested that various

accountability measurements may provide impetus for increased academic achievement.

Although various educational reform acts have attempted to improve public

education in the United States, in 2001, federally mandated policies such as the No Child

Left Behind Act (NCLB) required school districts to meet Adequate Yearly Progress

(AYP). NCLB urged schools to “establish challenging educational standards, to develop

4

aligned assessments, and to build accountability systems for districts and schools” (U.S.

Department of Education, 2010, p.1). Therefore, school districts have scrutinized every

educational program to determine their usefulness in helping educators meet NCLB

standards (Circle, 2005; Grey, 2010; Johnson, 2010; Morse, 2010).

Nevertheless, music programs are often ignored. Abril and Gault (2006)

suggested that the NCLB mandate has pejoratively affected arts programs in schools. Yet,

the arts programs are defined as a core academic subject under the NCLB mandate. West

(2012) suggested that the NCLB act is adversely affecting school music programs,

particularly schools that have made AYP. Additionally, West argued that many music

education programs are being reduced or eliminated. Moreover, Spohn (2008)

incorporated a mixed method case study that investigated teachers’ perspectives of the

NCLB policy and its effect on arts programs, particularly music programs. The sample

population consisted of 20 elementary and secondary visual arts teachers and 26

elementary and secondary music teachers. The data collected revealed that administrative

decisions made to improve standardized tests and the accommodations of regulations

mandated by NCLB have threatened arts education. Although researchers (Abril & Gault,

2006; Major, 2013; Proctor Duax, 2013) suggested that music is subjective in nature and

lacks quantifiable evidence of performance, other empirical evidence correlates students

involved in music programs to overall academic performance (Gadberry, 2010; Hall,

2013).

While Gordon (2003) and Kuhlman (2005) suggested that students involved in

music programs increases musical aptitude, other researchers suggested that a

relationship exists between musical aptitude and academic achievement (Holsomback,

5

2001; Kuhlman, 2005; Rubinson, 2010). Holsomback (2001) conducted a quantitative

correlational study that examined the relationship between musical aptitude and Texas

Assessment of Knowledge and Skills (TAKS) for beginning band students. The study

consisted of 104 sixth grade band students in an east Texas school district and suggested

that a strong statistical relationship existed between musical aptitude and academic

achievement. Additionally, the study indicated that further research should be conducted

to investigate the relationship between musical aptitude and other standardized

assessments.

Problem Statement

It was not known if, and to what degree a correlation existed between the level of

musical aptitude and the level of reading and mathematics scores on the STAAR among

beginning band students. Empirical research has examined the relationship between

musical aptitude and various academic assessments (Cavanagh, 2009; Holsomback,

2004). E. A. Geist, K. Geist, and Kuznik (2012) suggested that musical elements such as

tempo, rhythm, and steady beat enhance mathematical concepts such as spatial properties,

counting, and sequencing. Supportively, Oare and Bernstorf (2010) suggested that music

instruction enhances phonological processes that assist in developing good readers and

writers.

Although empirical evidence correlates musical elements to academic

achievement, minimum evidence has been presented on the relationship between musical

aptitude and the reading and mathematics sections of the STAAR. Holsomback (2001,

2002) indicated that a strong statistical relationship existed between musical aptitude and

academic achievement. Yet, he suggested that further research should examine the

6

relationship between musical aptitude and other standardized assessments. For this

reason, it was necessary to determine whether a relationship exists between musical

aptitude and the current academic standardized assessment in Texas.

Understanding the relationship between musical aptitude and academic

achievement is essential in finding effective ways to utilize music instruction to enhance

reading and mathematics achievement of middle school students. In addition, this study

contributed to solving the problem by providing a quantitative analysis on the

relationship between the level of musical aptitude and the level of reading and

mathematics scores on the STAAR among beginning band students in Texas. Therefore,

this study investigated the composite and sub-categorical performance levels of the

Intermediate Measures of Musical Audiation (IMMA) and the STAAR. The 2013-2014

sixth grade class consists of 65 students, all of whom are required to enroll in a beginning

band course and take a musical aptitude assessment. Therefore, the target population and

the sample consisted of 65 sixth grade band students.

Purpose of the Study

The purpose of this quantitative correlational study was to examine if, and to what

degree, a correlation existed between the level of musical aptitude assessed through

IMMA and the reading and mathematics scores on the STAAR among sixth grade

beginning band students in northeast Texas. The study utilized archival data deriving

from the results of the 2013-2014 IMMA assessment and the 2013-2014 reading and

mathematics sections of the STAAR. The independent variable, musical aptitude, was

defined as the potential for musical achievement (Gordon, 2007). The dependent variable,

STAAR, was defined as sequences of state mandated standardized assessments currently

7

used in Texas public schools to evaluate student achievement and knowledge in each

grade level. Specifically, the STAAR includes annual assessments for grades 3-8 in

reading and mathematics; assessments in writing at grades 4 and 7; in science at grades 5

and 8; and in social studies at grade 8. In addition, the STAAR includes end-of-course

assessments for English I, English II, Algebra I, Biology, and U.S. History (Texas

Education Agency, 2014). The study then evaluated the results to determine if a

statistically significant relationship existed between musical aptitude and the reading and

mathematics sections of the STAAR among sixth grade beginning band students. The

study was conducted with 65 students from a middle school band program in northeast

Texas.

The findings of this study advanced the understanding of the relationship between

musical aptitude and academic achievement among beginning band students. The

collection of data from this study added to the literature in this area by broadening the

knowledge surrounding the problem statement. The data extended the literature relating

music to cognitive abilities by examining the correlation of musical aptitude to specific

areas of academic performance. The findings contributed to the existing research by

providing a quantitative analysis on musical aptitude and the current academic

standardized assessment in northeast Texas. In addition, this study contributed to the field

by providing new information and resources relevant to musical aptitude and academic

achievement in reading and mathematics among middle school students.

Research Questions and Hypotheses

The purpose of this quantitative correlational study was to examine if, and to what

degree, a correlation existed between the level of musical aptitude and the reading and

8

mathematics scores on the STAAR among beginning band students in Texas. This

research was framed in the theoretical context that learning, and more specifically

musical learning, is a person's ability to understand and process sound, rhythm, patterns

in sound, relationships between sounds, and ability to process rhymes and other auditory

information. This theoretical context was based on Gardner’s (2006) theory of multiple

intelligences and Gordon’s (1986) music learning theory. In order to understand various

relationships between musical aptitude and academic achievement among beginning band

students, appropriate research questions are essential. In addition, the research questions

and hypotheses were related to the problem statement by examining the relationship

between the level of musical aptitude and the reading and mathematics scores on the

STAAR. The following research questions and hypotheses guided this study:

R1: Is there a correlation between the level of musical aptitude and the composite and

sub-categorical performance levels of the mathematics scores on the STAAR

among beginning band students?

H1a: A statistically significant correlation exists between musical aptitude and the

composite and sub-categorical performance levels of the mathematics section of

the STAAR among beginning band students.

H10: There is no statistically significant correlation between musical aptitude and the

composite and sub-categorical performance levels of the mathematics section of

the STAAR among beginning band students.

R2: Is there a correlation between the level of musical aptitude and the composite and

sub-categorical performance levels of the reading scores on the STAAR among

beginning band students?

9

H2a: A statistically significant correlation exists between musical aptitude and the

composite and sub-categorical performance levels of the reading section of the

STAAR among beginning band students.

H20: There is no statistically significant correlation between musical aptitude and the

composite and sub-categorical performance levels of the reading section of the

STAAR among beginning band students.

Advancing Scientific Knowledge

This study advanced scientific knowledge by providing a quantitative

correlational analysis on musical aptitude and academic achievement. Compelling

empirical evidence has examined musical aptitude (Gordon, 2007; Karma, 2007;

Rutkowski, 1996; Ukkola-Vuoti et al., 2013) and academic achievement (Cheema &

Galluzzo; 2013; Maltese, Tai, & Xitao, 2012; Musa, 2013; Rowe, Miller, Ebenstein, &

Thompson, 2012; Schutz, Simon, & Musgrave, 2013; Stanley & Stanley, 2011; Talley &

Scherer, 2013; Toldson, 2012; Young, Hyuck, Sunyoung, & You Kyung, 2012).

However, few studies have examined the relationship between musical aptitude and

academic achievement (Holsomback; 2002; Holsomback, 2004; Kuhlman, 2005;

Rubinson, 2010). Holsomback (2001) conducted a quantitative correlational study that

examined the relationship between musical aptitude and Texas Assessment of Knowledge

and Skills (TAKS) of beginning band students. The study consisted of 104 sixth grade

band students in an east Texas school district. The study suggested that a strong statistical

relationship existed between musical aptitude and academic achievement. The following

year, Holsomback (2002) conducted a similar study that examined the relationship

between musical aptitude and academic achievement of 74 seventh grade band students.

10

The study suggested that a positive correlation existed between musical aptitude, as

measured by the Selmer Music Guidance Survey, and academic achievement, as

measured by the results of the Texas Assessment of Academic Skills (TAAS). In

addition, Holsomback (2002) suggested that further research should investigate the

relationship between musical aptitude and other standardized assessments.

While empirical evidence has examined relationships between musical aptitude

and academic achievement, previous studies identified a gap in the literature and allowed

this study to analyze the relationship between musical aptitude and the composite and

sub-categorical performance levels of the STAAR among sixth grade beginning band

students. The researcher evaluated the gaps in the literature and added to the existing

research by providing a quantitative analysis on musical aptitude and academic

achievement in reading and mathematics. Although similar studies have examined the

relationship between musical aptitude and previous state academic assessments in Texas

(Holsomback, 2001; 2004), a gap existed in the literature concerning the relationship

between musical aptitude, as measured by the IMMA, and the current standardized

assessment, as measured by the reading and mathematics sections of the STAAR

(Holsomback, 2002).

This research was framed in the theoretical context that learning in general, and

more specifically musical learning, is a person's ability to understand and process sound,

rhythm, patterns in sound, relationships between sounds, and process rhymes and other

auditory information. Gardner’s theory of multiple intelligence classified intelligence into

seven distinct categories (musical intelligence, bodily-kinesthetic intelligence, logical-

mathematical intelligence, linguistic intelligence, spatial intelligence, interpersonal

11

intelligence, and intrapersonal intelligence). Although each intelligence is relatively

independent (Feldman, 2010), Gardner suggested these separate intelligences operate

together and not in isolation, depending on the type of activity in which individuals are

engaged (Gardner, 2006; Gardner & Moran, 2006). Since Gardner identified musical

intelligence as one component, it was advantageous to examine musical learning and its

contribution to overall intellectual capacity.

Gordon’s music learning theory is a stage specific theoretical model that

introduces musical learning processes and presents effective teaching methods (Gordon,

1986). music learning theory employs the following three basic learning sequences: skill

learning, tonal content, and rhythm content. As a method of instruction, the learning

sequences are combined in various learning sequence activities which can be combined

with classroom activities. In this method, a skill level cannot be achieved except in

combination with a tonal or rhythm content level. Both Gordon’s music learning theory

and Gardner’s theory of multiple intelligence guided this study by serving as foundations

in the development of the research questions and hypotheses. This study advanced each

theory by providing a quantitative analysis that supports the theoretical foundation on

which the study is built. Specifically, this study advanced each theory by examining the

relationship between musical aptitude and the composite and sub-categorical

performance levels of the STAAR among sixth grade beginning band students.

Significance of the Study

The significance of this quantitative correlational study was to understand the role

of musical aptitude better as it relates to reading and mathematics achievement among

middle school students. Recent empirical evidence indicated that arts programs,

12

particularly music, are being eradicated at an exponential rate (Major, 2013; Slaton,

2012; Spohn, 2008). Although this study does not seek to demonstrate causation,

investigating possible relationships may help educators find effective ways of enhancing

reading and mathematics achievement of middle school students. Prior research generally

shows a positive relationship between musical aptitude and academic achievement

(Holsomback; 2002; Holsomback, 2004; Kuhlman, 2005; Rubinson, 2010). Although

Holsomback (2001; 2002) examined the relationship between musical aptitude and the

previous academic assessment in Texas, a defined need or gap exists in the literature

concerning the relationship between the level of musical aptitude and the composite and

sub-categorical performance levels of the reading and mathematical section of the

STAAR among beginning band students. By investigating such relationships, school

administrators, teachers, and policy holders are capable of identifying tactics that may

improve test results.

The collection of data from this study added to the literature in this area by

broadening the knowledge surrounding the problem statement. In addition, the data

extended the literature relating music to cognitive abilities by examining the correlation

of musical aptitude to specific areas of academic performance. The findings contributed

to the existing research by providing a quantitative analysis on musical aptitude and the

current academic standardized assessment in Texas. By addressing the problem, this

study provided educators with new detailed information and resources relevant to musical

aptitude and academic achievement in reading and mathematics among middle school

students. Ultimately, this study could also impact educators by providing valuable data

that may help district leaders identify tactics that may improve test results.

13

Rationale for Methodology

This study utilized a quantitative approach to determine if, and to what degree, a

correlation existed between musical aptitude and the STAAR among sixth grade

beginning band students. Previous research has employed a quantitative methodology to

determine the relationship between musical aptitude and standardized assessments

(Holsomback, 2001; Kuhlman, 2005; Rubinson, 2010). A quantitative methodology

involves empirical analysis of data that has been collected from a sample of individuals

from specific populations to make a generalizable observation based on the measure of

relationships. Additionally, quantitative research seeks to establish relationships between

variables and attempts to clarify a phenomenon by performing a statistical analysis of a

body of numerical data (Fraenkel, Wallen, & Hyun, 2012). A quantitative methodology

was appropriate for this study since quantitative research involves statistical analysis of

quantitative data. Proper selection of methodology was imperative in comprehending and

interpreting the results based on the research questions and hypotheses (Yin, 2009).

Fraenkel, Wallen, and Hyun (2012) noted that quantitative methodology

comprises of explicit hypotheses. Additionally, the quantitative approach utilizes

objective instruments such as multiple choice standardized assessments, questionnaires,

personality scales, and aptitude assessments. Qualitative research expands the range of

knowledge and understanding of the world beyond the researchers themselves. It often

helps one see why a situation is the way it is, rather than just presenting a phenomenon

(Fraenkel, Wallen, Hyun, 2012). In addition, qualitative research is “intuitive in nature

and expands the scope of research to finding out the why and how of things that happen

in addition to the what, where, and when things happen” (p. 43). Since qualitative

14

research attempts to investigate naturally occurring phenomena in all their complexity, a

qualitative methodology would be inappropriate for evaluating the research questions and

hypotheses in the current study (Yin, 2009).

Nature of the Research Design for the Study

A correlational research design was employed in this study. According to

Fraenkel, Wallen, and Hyun (2012), correlational research design “seeks to investigate

the extent to which one or more relationships of some type exist” (p. 11). A correlational

study was the most appropriate design to identify the degree to which there is a

relationship between the level of musical aptitude and the mathematics and reading

scores on the STAAR among sixth grade beginning band students. As the study did not

“seek to determine reasons or causes for preexisting differences in groups of individuals”

(Fraenkel, Wallen, & Hyun, 2012, p. 365), a causal-comparative research design was

inappropriate for this study. In experimental research, variables are manipulated, and the

effects of this manipulation are measured upon the dependent variable (Fraenkel, Wallen,

& Hyun, 2012). Additionally, in experimental research, a treatment is deliberately

imposed on a group of objects or participants (Fraenkel, Wallen, Hyun, 2012). As this

study did not attempt to manipulate the variables, an experimental research design was

inappropriate for the study.

The sample selection for a correlational study, as in any type of study, should be

carefully planned (Yin, 2009). The minimum acceptable sample size for a correlational

study is considered by most researchers to be no less than 30 (Fraenkel, Wallen, & Hyun,

2012). In addition, a statistical power analysis was conducted in order to determine

adequate sample size. The parameter statistical power was set at 0.8, the significance

15

level at 0.05 and the effect size at 0.5. The analysis suggested that a minimum of 85

students was required for this study (Appendix E).

A structured process was used to collect the data. According to Fraenkel, Wallen,

and Hyun (2012), quantitative research is prevalent in developing procedures relating to

the comparison of variables, groups, or relating factors about individuals or groups in

experiments through correlational studies and surveys. Collecting data and analyzing

numbers that measure distinct attributes of individuals and groups is a trend that is

prevalent in today’s studies (Fraenkel, Wallen, & Hyun, 2012). Prior to collecting the

data, the researcher obtained written permission from the school district. Additionally,

Grand Canyon University Institutional Review Board (IRB) approval was required prior

to the data collection process (See Appendix F). The 2013-2014 sixth grade class

consisted of 65 students, all of whom were required to enroll in a beginning band course

and take a musical aptitude assessment. This study was conducted with the entire

population of 65 sixth grade band students available in the school district selected for

convenience as a research site for this study. A statistical power analysis was conducted

in order to determine adequate sample size. The analysis suggested that a minimum of 85

students were required for this study. Archival data, results from the 2013-2014 IMMA

assessment, and the results from the 2013-2014 reading and mathematics sections of the

STAAR, was analyzed in an attempt to answer two overarching research questions: (1) Is

there a correlation between musical aptitude and the composite and sub-categorical

performance levels of the mathematics section of the STAAR among sixth grade

beginning band students? (2) Is there a correlation between musical aptitude and the

16

composite and sub-categorical performance levels of the reading section of the STAAR

among sixth grade beginning band students?

The data collection procedures consisted of several steps to ensure proper

collection and storage of the data. The researcher obtained each student’s performance

data once IRB approval was received. Identifiers such as student names, identification

numbers, dates of birth, and addresses were removed by the school district and assigned

an alphabet to minimize a breach of confidentiality. All information regarding the

participants remained in the possession of the researcher and kept in a lock box located in

the counselor’s office for three years. Data collected was exported into the Statistical

Package for Social Science (SPSS) 23.0 software.

Definitions of Terms

The terms listed within this section are terms that are frequently used within the

study. This section defined the study construct and provided a common understanding of

technical terminology, variables, and concepts utilized within the scope of the study. The

following terms were used operationally in this study:

Academic achievement. A student’s ability to excel in an academic subject and

gain the necessary skills and knowledge required to contribute in a global society. It is

often measured by a student’s grade point average (GPA) and/or a standardized

assessment (U.S. Department of Education, 2010). This term was also identified as the

dependent variable in the current study.

Adequate Yearly Progress (AYP). A measure of a school or school system’s

ability to meet federal benchmarks with specified performance standards (U.S.

Department of Education, 2010).

17

Composite musical aptitude. A composite musical aptitude is the combination of

tonal and rhythmic aptitude scores (Gordon, 2003).

Intermediate Measures of Musical Audiation (IMMA). Developed by Edwin E.

Gordon and based on an extensive body of research and practical field-testing, the IMMA

is a musical aptitude test for grades 4-6. It was designed to act as objective aids to

teachers by providing students with appropriate opportunities and instruction. The IMMA

was developed because of the need for a more advanced version of the Primary Measures

of Music Audiation (PMMI) test. Administration of the test requires approximately

twenty minutes for the rhythm and tonal subtests (Gordon, 2007).

Middle school. Schools that are typically configured to begin with six grade and

end with eighth grade (Texas Education Agency, 2014).

Musical aptitude. Musical aptitude is the potential for musical achievement in

which one can learn music (Gordon, 2007). This term was also identified as the

independent variable in the current study.

No Child Left Behind (NCLB). NCLB is a law passed in 2001 which proposes to

raise student achievement and eliminate the achievement gap between students from

different backgrounds (U.S. Department of Education, 2010).

Rhythmic aptitude. Rhythmic aptitude is the ease in which one learns rhythm

(Gordon, 1986).

Standardized assessment. An assessment that is identical for each individual and

are usually made to correlate to specific state academic standards. They are often multiple

choice and have identical testing conditions (time limits, instructions, and scoring) for all

students (U.S. Department of Education, 2010).

18

The State of Texas Assessment of Academic Readiness (STAAR). The STAAR

are sequences of state mandated standardized assessments currently used in Texas public

schools to evaluate student achievement and knowledge in each grade level. Specifically,

the STAAR includes annual assessments for grades 3-8 in reading and mathematics;

assessments in writing at grades 4 and 7; in science at grades 5 and 8; and in social

studies at grade 8. In addition, the STAAR includes end-of-course assessments for

English I, English II, Algebra I, Biology, and U.S. History (Texas Education Agency,

2014).

Tonal aptitude. Tonal aptitude is the ease with which one learns melody (Gordon,

1986).

Assumptions, Limitations, Delimitations

Assumptions. Assumptions are any important assertion presumed to be true but

not actually verified; major assumptions should be described in any research proposal or

report (Fraenkel, Wallen, & Hyun, 2012). The following assumptions are present in this

study:

1. It was assumed that the musical components of tonal and rhythmic aptitude were accurately measured by the musical aptitude assessment. The rationale behind this

assumption is that the musical aptitude assessment was administered properly

according to the musical aptitude assessment manual.

2. It was assumed that the participants answered the assessment honestly, and to the best of their ability.

3. It was assumed that the reading and mathematics achievement of study participants was accurately measured by the reading and mathematics section of

the STAAR. The rationale behind this assumption is that only certified educators

are allowed to administer the STAAR according to the rules and regulations set

forth by Texas Education Agency (TEA). A month prior to test administration, all

test examiners are mandated to complete training in testing procedures. In

addition, all examiners and examinees are required to sign an oath of

confidentiality and test security prior to test administration (Texas Education

Agency, 2014).

19

Limitations. According to Fraenkel, Wallen, and Hyun (2012), knowledge

concerning the limitations of a study may assist other researchers in evaluating the degree

to which the findings can be generalized. The researcher identified the following

limitations:

1. The researcher’s access to secondary data was limited to one school district in northeast Texas that has only one middle school. As a result, the researcher

identified the following consequences:

a. Population size for the grade of interests (6th grade) was N = 65, which did not meet the minimal sample size needed to capture medium-size correlations.

The study can be identified as significant only if a large correlation exists.

b. Population may not be typical for the entire state of Texas (low external validity).

c. Since there are significant educational differences among states, the results of this study cannot be generalized to other states.

2. The scope of this study was limited by the scope of the instrument used to measure musical aptitude and the scope of the mathematics and reading tests.

While the study presented several unavoidable limitations, it did not negatively

affect the results of the study.

Delimitations. Delimitations are simply defined as boundaries set by the

researcher(s) to control the study (Fraenkel, Wallen, & Hyun, 2012). The researcher

made the following delimitations:

1. A longitudinal study may reveal different or similar relationship patterns, however, this study was limited to the 2013-2014 school year.

2. Use of Pearson correlation analysis: Correlational studies do not investigate cause and effect (Yin, 2009), causal conclusions cannot be drawn from the study.

Summary and Organization of the Remainder of the Study

This chapter introduced the study, which focuses on the relationship between

musical aptitude and academic achievement among sixth grade students in northeast

20

Texas. Previous empirical research has examined the relationship between musical

aptitude and various academic assessments (Cavangh, 2009; Holsomback, 2001; 2004;

Rubinson, 2010). Yet, a defined need or gap exists in the literature concerning the

relationship between musical aptitude, as measured by the IMMA, and academic

achievement, as measured by the reading and mathematics sections of the STAAR

(Holsomback, 2002).

The findings of this study may help school administrators, music specialists, and

reading and mathematics instructors find effective ways to utilize music instruction to

enhance reading and mathematics achievement of middle school students. In addition,

this study contributed to the field by providing new information and resources relevant to

musical aptitude and academic achievement among middle school students. Thus, the

purpose of this quantitative correlational study was to examine if, and to what degree, a

correlation existed between the level of musical aptitude and the reading and mathematics

scores on the STAAR among sixth grade beginning band students in Texas.

Chapter 2 will present an organized literature review covering the background to

the problem, the theoretical framework providing the foundation to the study, and various

topics and themes within those topics related to the study. Those topics will include: (a)

the history of musical aptitude, (b) academic achievement, (c) music in relation to

academic achievement, (d) musical aptitude and academic achievement, and (e)

methodological strengths and weaknesses followed by a summary. Chapter 3 will present

a detailed description of the methodology of the study beginning with the restatement of

research questions and hypotheses, followed by the research design, population and

sample selection, instrumentation, validity, reliability, data collection procedures, data

21

analysis procedures, and ethical considerations. Chapter 4 will present the data summary

and statistical analysis of the study data. Finally, Chapter 5 will present the summary of

the study, findings, implications, and recommendations from the study.

22

Chapter 2: Literature Review

Introduction and Background

The purpose of this quantitative correlational study was to examine if, and to what

degree, a correlation exists between musical aptitude and the reading and mathematics

sections of the State of Texas Assessment of Academic Readiness (STAAR) among sixth

grade beginning band students. Specifically, this quantitative correlational study

examined the relationship between musical aptitude, as measured by the Intermediate

Measures of Music Audiation (IMMA Gordon, 2002), and the reading and mathematics

performance levels of the STAAR. Administrators and policy makers have investigated

pedagogical methods for improving academic achievement in schools. Thus, the intent of

the literature review was to provide a comprehensive review of peer-reviewed journal

articles, books, professional publications, and internet sources relevant to the study.

The literature review was conducted utilizing various sources and approaches.

Empirical journal articles on relevant subjects were discovered by searching available

academic databases to include: ABI/INFORM Complete, EBSCOhost, JSTOR, ERIC,

and ProQuest. Doctoral dissertations were accessed by using the ProQuest Dissertations

& Thesis: The Humanities and Social Sciences Collection. Additional supporting

resources were found on the internet to include: the Office of Personnel Management

website, Google Scholar, and a number of educational websites. Keywords associated

with these websites include: musical aptitude, academic achievement, reading

achievement, and mathematics achievement. In addition, the review of the literature

included a snowballing technique (Marshall, 1998) which compiled references from each

23

document. This technique led to the identification of over 300 scholarly journal articles,

books, and dissertations relevant to the study.

This chapter addresses the Background to the Problem, which provides the

justification for this study based on the literature. Additionally, the historical background

behind the study is presented. Second, this chapter presents the theoretical framework,

which serves as the foundation of this research study and is utilized to help develop the

research question and data collection approach. This section presents the models to be

used behind the study variables. Next, this chapter discusses the Review of Literature and

other topics relevant to the study. These topics include: (a) the history of musical

aptitude, (b) academic achievement, (c) music in relation to academic achievement, (d)

musical aptitude and academic achievement, and (e) methodological strengths and

weaknesses in similar studies. Lastly, a summary of the literature review further

synthesizes the literature review and identifies the problem statement from the

Background to the Problem, the research questions based on the Theoretical Foundation,

the design from the review of similar designs, and data collection approaches and

instruments.

Throughout the twentieth century, music psychologists have designed tests to

measure music constructs such as talents, ability, and aptitude (Gordon, 1986; Seashore;

1938; Stanton, 1922). By using their tests, educators were capable of predicting musical

ability and possibly achievement. While some music educators were skeptical about the

value of objective information yielded by test scores, other educators appreciated and

utilized such tests. However, in the early 1970’s, researchers questioned whether the

results from musical aptitude assessments correlated with academic achievement

24

(Rainbow, 1963; Young, 1971). Young (1971) conducted a quantitative correlational

study that examined the relationship between musical aptitude and academic achievement

of elementary students in Chicago. The researcher employed the Musical Aptitude Profile

of Gordon, the Lorge-Thorndike Intelligence Test, and the Iowa Test of Basic Skills

(1962) to determine the extent of the relationship between musical aptitude and academic

achievement. The study suggested that a strong correlation existed between musical

aptitude and academic achievement. The researcher suggested that future studies should

explore the relationship between musical aptitude and academic achievement among

various demographics and socioeconomic backgrounds.

Similarly, Hobbs (1985) conducted a correlational study that examined the

relationship between musical aptitude, scholastic achievement, and academic

achievement. The sample consisted of 72 first, second, and third graders. The study found

low correlations (r = .33) between musical aptitude and scholastic aptitude, indicating

that musical aptitude and scholastic aptitude tests measure different aspects of cognition.

However, higher correlations were found between music aptitude and academic

achievement (r = .56). The study indicated that further research should examine the

relationship between musical aptitude and standardized academic assessments in other

states.

In the late 1990s to early 2000s, researchers began to examine the relationship

between musical aptitude and standard assessments in various states (Barrett, 1993;

Holsomback; 2002; 2004). Barrett (1993) conducted a quantitative, non-experimental

study that investigated the relationship between musical aptitude and academic

standardized assessment in Florida. The study consisted of 76 sixth, seventh, and eighth

25

graders. The study determined that no correlation existed between musical aptitude and

academic achievement. Conversely, Holsomback (2001) conducted a similar study that

examined the relationship between musical aptitude and Texas Assessment of Knowledge

and Skills (TAKS) of beginning band students. The study consisted of 104 sixth grade

band students in an east Texas school district. The study suggested that a strong statistical

relationship existed between musical aptitude and academic achievement. Additionally,

the study indicated that further research should be conducted to investigate the

relationship between musical aptitude and other standardized assessments.

From 2003 to 2014, one study, to the researcher’s knowledge, has examined the

relationship between musical aptitude and academic standardized assessments. Rubinson

(2010) conducted a quantitative correlational study that investigated the relationship

between musical aptitude and developing reading abilities of kindergarten students. The

researcher employed the tonal and rhythmic components of the Primary Measures of

Music Audiation (PMMA) to determine the amount of musical aptitude of kindergarten

students. Reading achievement was measured by various subtests of Dynamic Indicators

of Basic Early Literacy Skills (DIBELS), a standardized assessment of early literacy

development that evaluates the alphabetic abilities, and phonological awareness of

kindergarten students. The study consisted of 80 kindergarten students from an

elementary school in central Connecticut. The study suggested that a strong correlation

existed between musical aptitude and the phonological awareness of kindergarten

students.

While research on the relationship between musical aptitude and academic

achievement has declined significantly within the last decade, Holsomback (2002)

26

suggested that future research should investigate the relationship between musical

aptitude and current academic standardized assessment in Texas. Based on this gap, the

study will explore whether musical aptitude is closely related to reading and mathematics

achievement and could serve as a predictor of reading and mathematics achievement in

middle school students. The findings from this study may assist teachers, administrators,

and educational policy makers in developing effective methods for increasing reading

and mathematics achievement of middle school students.

Theoretical Framework

Previous research conducted on musical aptitude and academic achievement has

utilized Gardner’s theory of multiple intelligences (Gardner, 2006) and Gordon’s music

learning theory (Gordon, 1986) as a theoretical foundation. This research was framed in

the theoretical context that learning in general, and more specifically musical learning, is

a person's ability to understand and process sound, rhythm, patterns in sound,

relationships between sounds, and ability to process rhymes and other auditory

information. This framework was based on Gardner’s (2006) theory of multiple

intelligences and Gordon’s (1971) music learning theory. Multiple theoretical models

influenced the research questions in this study as they provided a rationale on the

relationship between musical aptitude and academic achievement. Each theoretical model

provided the foundation for the study.

Theory of multiple intelligences. American neuropsychologist and educator,

Howard Gardner, defined intelligences as “the ability to create an effective product that is

valued in a culture” (Gardner, 2006, p. 6). Rather than viewing intelligence as dominated

by a single general ability, Gardner pluralized the theoretical concept of intelligence.

27

Gardner’s theory of multiple intelligence classified intelligence into seven distinct

categories (musical intelligence, bodily-kinesthetic intelligence, logical-mathematical

intelligence, linguistic intelligence, spatial intelligence, interpersonal intelligence, and

intrapersonal intelligence). Although each intelligence is relatively independent

(Feldman, 2010), Gardner suggested these separate intelligences operate together and not

in isolation, depending on the type of activity in which we are engaged (Gardner, 2006;

Gardner & Moran, 2006). Since Gardner identified musical intelligence as one

component, it would be advantageous to examine musical learning and its contribution to

overall intellectual capacity. In addition, Music participation and learning, which

incorporates instruction geared toward the plurality of various intelligences, may

ultimately have a positive effect on reading and mathematical achievement. Gardner’s

(2006) multiple intelligences describes music and movement as equal to and unique from

reading and mathematical learning. This theory motivated educators to investigate the

relationship between musical aptitude and academic achievement.

Music learning theory. This theory was developed by Edwin E. Gordon and

based on an extensive body of research and practical field-testing. music learning theory

is a stage-specific theoretical model that introduces musical learning processes and

presents effective teaching methods. In 1971, the theoretical model was introduced in his

book, The Psychology of Music Teaching, and has been revised in his subsequent

publications. The teaching model is sequential and utilizes the concept of audiation—

hearing music in the mind with understanding—music learning theory employs the

following three sequential learning activities: (a) skill learning, (b) tonal content, and (c)

rhythm content. In this theoretical context, musical aptitude is considered to be normally

28

distributed within the general population, with relatively few people having high or low

musical aptitude and the majority having average musical aptitude (Gordon, 2002).

Specifically, Gordon (1999) suggested that:

Music Learning Theory is unique among music teaching methods in

accounting directly for students’ differing potentials to achieve in music.

Students of average aptitude are taught more tonal content and rhythm

content than low aptitude students, and high aptitude students learn more

content than average aptitude students. By teaching to students’ individual

differences, teachers lessen the risk of boring students with high potential

and frustrating students with lower potential. (p. 2)

Although Gordon’s music learning theory analyzes musical learning

processes, it is ultimately designed to enhance musical intelligence. Based on

these theoretical models, this study adds to the existing research by providing a

quantitative analysis on musical aptitude and the current standardized assessment

in northeast Texas.

Review of the Literature

The purpose of this quantitative correlational study was to examine if, and to what

degree, a correlation existed between musical aptitude and the reading and mathematics

sections of the STAAR among sixth grade beginning band students. The review of

literature provides an overview of various thematic topics relevant to the study. These

topics include: (a) an overview of musical aptitude, (b) academic achievement, (c) music

in relation to academic achievement, and (d) musical aptitude and academic achievement.

29

Overall, the review of literature provides the foundation of the study to examine the

relationship between musical aptitude and academic achievement.

Overview of musical aptitude. The overview of musical aptitude examination is

a component of the historical nature of music education in the United States (Gordon,

2007; Seashore, 1938). Some of the early musical aptitude assessments included the

Seashore Measures of Musical Talent, the Kwalwasser-Dykema Music Tests, and the

Music Aptitude Profile of Gordon. Motivated by the standardization movement, musical

aptitude has been defined as the potential for music achievement. Music achievement is

the actual attainment of musical knowledge and ability.

Early in the twentieth century, musicologist Carl E. Seashore indicated that

“When the proximate physiological threshold has been obtained, practice is of no avail”

(Seashore, 1919, p. 60). Musically, Seashore (1919) suggested that once an individual has

attained their maximum physiological ability, musical training was impractical. Yet,

Seashore believed that musical aptitude was a God-given gift that one was either born

with or without. Seashore supported the notion that practice and training in music would

not affect one’s physiological ability, but may affect one’s cognitive abilities. Seashore

then realized that musical aptitude was not only an innate ability, but was also determined

genetically.

Many researchers believed that musical aptitude was inherited and that it

stabilizes at birth (Stanton, 1922; Seashore, 1919). However, James Mursell, a prolific

author in the field of music, psychology, and education, became suspicious of the genetic

disposition theory of musical aptitude. Mursell popularized his view in an exchange of

letters with Seashore that was published in the Music Educators Journal during the early

30

1930’s. Over the next several years, the nature vs. nurture disposition eventually evolved

into a major controversy among music educators across the nation and debated by

Mursell in The Psychology of Music of 1937. Mursell suggested that “if indeed musical

aptitude is innate and if it cannot be altered with practice and training, why are the

majority of students in American public schools required to receive instruction in general

music” (Mursell, 1937, p.6). He also suggested that “If Seashore is correct, training in

music is all but useless for those born without a high level of musical aptitude” (p.5).

Mursell (1937) suggested that Seashore was largely involved in the identification and

instruction of the musically talented and to a much lesser extent and pedagogical

practices regarding students’ individual musical abilities. Yet, Mursell’s views became a

controversial topic concerning musical aptitude in the late 1930s.

Intermediate Measures of Musical Audiation (IMMA). Developed by Edwin E.

Gordon and based on an extensive body of research and practical field-testing, the IMMA

was designed to act as objective aids to teachers by providing students with appropriate

opportunities and instruction. The IMMA was developed due to the need for a more

advanced version of Gordon’s earlier musical aptitude assessment, the Primary Measures

of Music Audiation (PMMA, Gordon, 2002). The IMMA ultimately serves two purposes:

(a) to identify children with overall musical aptitude so that they can be encouraged to

participate in special musical activities and (b) to diagnose individual musical strengths

and weaknesses. The assessment may be administered to children from ages six through

nine as well as from 10 through 11, though children’s musical aptitudes generally

stabilize around age nine (Gordon, 2002). The level of musical aptitude a child attains by

the age of nine remains ostensibly the same throughout life. Although a child’s aptitude

31

results increases from year to year, his or her percentile ranking remain relatively stable.

Additionally, Gordon (2003) suggested:

The IMMA is intended to be used in a group where 50% or more of the children

scored 80% or above on the Tonal subtest, Rhythm subtest, or both on the

PMMA. The IMMA is most appropriate for children with above average and high

developmental aptitudes, whereas PMMA is most appropriate for children with

average and low developmental musical aptitudes. (p. 10)

It would be erroneous for researchers to evaluate variations of developmental

music aptitude by comparing the students’ scores on the PMMA to those of the IMMA.

Rather, researchers should compare the dissimilarities of the students’ scores on two or

more of the same assessment to ensure accurate interpretation.

The Seashore Measures of Musical Talents. Developed by Carl E. Seashore, the

Seashore Measures of Musical Talents is a musical aptitude assessment that initially

appeared in 1919 and was revised extensively in 1939. The assessment provided a

reference against which to compare other approaches to assess musical abilities.

Although contemporary constructs and outlooks contributed to the decline of the

Seashore Measures of Musical Talents, the assessment exemplified the belief that musical

ability, particularly in the aptitude aspect, is based on psychoacoustical discriminations

(Radocy & Boyle, 2003).

The Seashore Measures of Musical Talents (1919) consists of two aural

discrimination rhythm tasks. In the rhythm test, the respondents indicate whether paired

tapped patterns are the same or different. The time test requires an indication of whether

the second of the paired tones are longer or shorter than the first. Although Mursell and

32

other theorists question the tests’ validity as predictors of musical talent, they are

unquestionably valid measures of both discrimination tasks (Radocy & Boyle, 2003).

Kwalwasser-Dykema Music Tests. While Seashore was between the publication

of the first and second edition of Seashore Measures of Musical Talents, in 1930, Jacob

Kwalwasser and Peter Dykema developed and published the Kwalwasser-Dykema Music

Tests, commonly known as the K-D tests. Both Kwalwasser-Dykema Music Tests and

the Seashore Measures of Musical Talents assessments display similarities (Radocy &

Boyle, 2003). Perhaps it is because the ultimate level of attainment of musically trained

students is best predicted by past musical achievement that Kwalwasser and Dykema,

unlike Seashore, included musical achievement measures in their assessment. Clearly,

they were more interested in the practical aspects of Seashore’s assessment than in the

search for a viable description of musical aptitude. Kwalwasser-Dykema Music Tests

were comprised of ten subtests. Six of the ten subtests were created to measure similar

factors found in the Seashore Measures of Musical Talents assessment, although they

were developed and titled differently. However, Kwalwasser-Dykema employed

orchestral instruments and the Duo-Art Reproducing Piano as stimuli for many of the

subtests.

The phenomenon of musical aptitude assessment is a component of the historical

nature of music education in the United States (Gordon, 2007; Seashore, 1938).

Furthermore, musical aptitude assessments are designed to assist music teachers in

providing students with appropriate instruction and learning opportunities. Many music

educators employ tests such as the Intermediate Measures of Musical Audiation (1986),

Kwalwasser-Dykema Music Tests (1930), and The Seashore Measures of Musical

33

Talents (1919) to measure musical aptitude. Yet, Gordon’s Intermediate Measures of

Musical Audiation (1986) is the only brief, longitudinally valid music aptitude test for

Grades 1 through 6. Based on this information, the IMMA would be the most appropriate

assessment to measure musical aptitude among middle school band students.

Academic achievement. Since the beginning of the 21st century, the educational

reform movement has been a major societal and political debate in the United States

(Davies, 2007; McGuinn, 2006). Although many attempts have been made at school

reform, minimal evidence of academic advancement has been presented (Good, 2010;

Ravitch & Cortese, 2009). In 1983, a highly controversial reform, A Nation at Risk: The

Imperative for Education Reform, suggested that more homework, extended school days,

and extended school years would increase academic achievement (U.S. Department of

Education, 1983). However, researchers suggested that the reform advocated by A Nation

at Risk was partially right and partially wrong (Burdick, 2012; Good, 2010; Palmer,

Davis, Moore, & Hilton, 2010). Cheema, Kitsantas, and Ware (2011) conducted a mixed

method study that examined the amount of homework spent, self-efficacy, and academic

achievement. The study consisted of 3,776 students and 221 schools. The findings

suggested that more homework does not yield higher academic achievement. In addition,

Good (2010) suggested that if learning achievements were low, perhaps something

different was needed, not more of the same. Moreover, he suggested that differentiated

instruction might yield higher learning achievements. Yet, Johanningmeier (2010) noted

that A Nation at Risk brought national awareness for academic excellence in the United

States and various accountability measurements may provide the impetus for increased

academic achievement.

34

Measurement of academic achievement. Recent empirical studies have

investigated effective methods of measuring students’ academic performance in schools

(Burns, 2010; Stubnisky, Perry, Hall, & Guay, 2012; Weaver, 2011). Although the

investigation of academic advancement has become a critical method for measuring the

effectiveness of educational organizations, federal and state authorities continue to

develop assessments that attempt to measure academic achievement. Weaver (2011)

suggested that standardized testing has become a key factor in measuring academic

achievement. However, Jorgenson (2012) affirmed that standardized testing does not

adequately measure academic achievement. While some researchers contend that

standardized testing inadequately assesses academic performance, various states have

developed standardized assessments that examine academic achievement.

In 2003, the Texas Education Agency (TEA), in conjunction with Pearson and

various Texas educators collaborated to develop TAKS, the Texas Assessment of

Knowledge and Skills (TEA, 2003). While the assessment evaluated reading, writing,

mathematics, science, and social studies, it failed to provide sufficient academic readiness

for Texas students. Thus, the TEA, in collaboration with the Texas Higher Education

Coordinating Board (THECB), developed a new appraisal system that focuses on

enhancing the postsecondary readiness of high school students. Additionally, the new

appraisal system ensures that Texas students are academically competitive both

nationally and internationally (TEA, 2009). On May 5, 2011, the TEA systematically

adopted a new method of measuring academic achievement. The State of Texas

Assessment of Academic Readiness (STAAR) was developed to enhance the rigidity of

the assessment as well as to evaluate knowledge and skills effectively. Some of the

35

components of the STAAR assessment include the transition of grade-based assessments

to course-based assessments, the establishment of academic readiness standards for

Algebra II and English III, and the utilization of validation studies to ensure that

performance standards are linked appropriately (TEA, 2012). Although the TEA contends

that the STAAR assessment will adequately measure academic achievement, minimal

empirical research has been conducted validating such assessment. Thus, further research

is necessary to investigate various relationships between the results from the TAKS and

STAAR.

School accountability. School accountability has become synonymous with

federal, state, and local efforts to measure and monitor academic achievement adequately

(Elmore & Furhman, 2001). The United States Department of Education mandated that

each elementary and secondary school meet Adequate Yearly Progress (AYP).

Undoubtedly, AYP, a mandate of the No Child Left Behind Act (NCLB), and the more

recent "Race to the Top" proposal (United States Department of Education, 2009), are

based on accountability frameworks. Although such accountability frameworks were

designed to improve academic achievement in the long term, researchers suggested that

the accountability measurement had a negative effect on public schools.

Mathis (2004) suggested that AYP and NCLB are statistically impossible. He

suggested that “high” standards are considered high because a limited number of people

achieve them. Obviously, if everyone achieved such standards, they would become low

standards. Powell, Higgins, Aram, and Freed (2009) conducted a qualitative study that

investigated the NCLB act as it relates to the decision-making of rural teachers and

administrators. Statistics were collected from 101 rural elementary school administrators

36

in Missouri and 76 rural elementary teachers in Maine. The results indicated that there

were significant changes in the use of instructional time for teaching reading and some

subjects and non-instructional time for recess and kindergarten naptime. Administrators

affirmed that professional development hours were used exclusively for maintaining

AYP. Maine teachers reported that NCLB benefits some groups of students more than

NCLB benefits others and that it has a harmful effect on student motivation. Similarly,

educationalists are discovering that the strategy is imperfect, developmentally

inappropriate, ill-funded, and ultimately is leaving more students, teachers, and schools

behind than ever before (Eisele-Dyrli, 2010; Krumenaker, 2009).

In the State of Texas, school achievement is evaluated through the state-mandated

accountability system in which ratings are given to individual school campuses and to

school districts based on academic performance. To earn the highest rating in the state,

Exemplary, schools must meet the following guidelines: no more than a 0.2% dropout

ratio, at least 90% of students passing the TAKS, as well as subgroups, and 95% of the

completion standard met (Texas Education Agency, 2008). To earn the second highest

rating, Academically Recognized, these criteria must be met: no greater than a 0.7%

dropout ratio, at least 75% of students passing the TAKS, as well as various subgroups,

and 85% of the completion standard (TEA, 2006, 2007, 2008).

Receiving the ranking of Academically Acceptable means having data indicating:

no greater than a 1.0% dropout rate, at least 65% of students passing the English

Language Arts (ELA), Writing, and Social Studies sections of the TAKS, as well as

subgroups, at least 45% of students passing the TAKS Math, including subgroups, at least

40% of students pass the TAKS Science measure, including subgroups, and 75% of the

37

completion standard met (TEA, 2008). To earn the lowest accountability ranking,

Academically Unacceptable, Individual school campuses and districts that do not meet

the basic requirement stated above are ranked as academically unacceptable.

Additionally, the state of Texas has mandated consequences for schools and school

districts that are categorized as academically unacceptable. Although the state of Texas is

currently replacing the TAKS with the STAAR, the accountability measurements are

identical.

No Child Left Behind and music education. In 2001, federally mandated policies

such as the NCLB act have negatively affected music education in public schools (Spohn,

2008; West, 2012; Abril & Gault, 2006). West (2012) suggested that the NCLB act is

adversely affecting school music programs, particularly schools that have made AYP.

Additionally, West argued that many music education programs are being reduced or

eliminated. Spohn (2008) incorporated a mixed method case study that investigated

teachers’ perspectives of the NCLB policy and its effect on arts programs, particularly

music programs. The sample population consisted of 20 elementary and secondary visual

arts teachers and 26 elementary and secondary music teachers. The data collected

revealed that administrative decisions made to improve standardized tests and the

accommodations of regulations mandated by NCLB have threatened arts education.

Similarly, Abril and Gault (2006) affirmed that NCLB budgets, standardized tests,

and scheduling presented adverse changes amongst music programs as stated by school

administrators. Circle (2005) suggested that music curricula across the United States have

been reduced due to increased time allotment in reading and math. Controversially,

examinations of NCLB have provided limited quantitative data suggesting the decline or

38

elimination of music education programs (Ashford, 2004; Colwell, 2005; Mishook &

Kornhaber, 2006).

Consequently, Chapman (2004) and Meyer (2005) projected that minimal time

would be available for electives as limited states fail to incorporate various electives such

as art, music, theatre, and physical education into their accountability systems. Anecdotal

evidence suggested that high-stakes accountability has affected scheduling practices of

public schools (Beveridge, 2010; West, 2012). Hence, Beveridge (2010) argued that

adverse scheduling practices caused by NCLB mandate are more far-reaching than mere

student class placement. Oftentimes, school districts would eliminate a student’s elective

if he or she did not perform adequately on state assessments. The elective course is often

replaced with remedial academic core subjects intended to increase standardized test

scores. Additionally, remedial teachers, as well as administrators, utilize enrichment

subjects such as music, art, and theatre to entice students to perform well on standardized

assessments.

While other methods could be adopted, such as after school tutoring, which

minimizes the least amount of disruptions to the school day (Beveridge, 2010),

administrators argue that pullouts from non-core subjects are the logical solution (Grey,

2010). Although empirical research has suggested that controversial connections exist

between arts education and NCLB mandates, further research could investigate budgets

and time allotment before and after the NCLB mandate.

Since its inception in 2011, Texas public schools have administered the STAAR

to measure academic achievement in students. While prior academic assessments failed

to provide sufficient academic readiness for Texas students, the STAAR was designed to

39

enhance the rigidity of the previous assessment. Additionally, this standardized academic

assessment meets the criteria mandated by the NCLB and is the most current and valid

academic standardized assessment for the state of Texas (TEA, 2012). Based on this

information, the STAAR was the most appropriate assessment to measure academic

achievement among middle school students in Texas.

Music and academic achievement. Empirical research suggested that an

inextricable relationship exists between music education and academic achievement

(Cavangh, 2009; Cox & Stephens, 2006). Argabright (2005) proposed that participation

in music education programs increase academic achievement. Additionally, Argabright

suggested that students who participated in private instrumental instruction significantly

enhanced academic learning. Moreover, Gadberry (2010) conducted a qualitative study

that examined music and the academic success of students. Surveys were distributed to

adults and children who were choir members and another was conducted with 500

parents and 300 educators. The findings implied that students who were choir participants

had higher grades and some of their parents and teachers suggested that their academic

achievement increased considerably after joining the choir.

Kelly (2012) conducted a quantitative correlational study that examined academic

achievement between 12th grade students who participated in fine arts courses,

particularly music, and students who did not enroll in fine arts courses. The data

represented all 12th grade students as provided by the Florida Department of Education,

consisting of the Florida Comprehensive Assessment Test (FCAT), grade point averages,

and the SAT scores of 12th grade students between 2007-2008 and 2010-2011. The

findings indicated that a strong and reliable relationship exists between individuals who

40

participated in fine arts programs and higher academic achievement. Furthermore, the

findings suggested that students participating in fine arts courses for eight or more

semesters demonstrated higher academic success. Scores on the FCAT for reading,

mathematics, and writing of individuals enrolled in fine arts courses remained consistent

when comparing the results from the years of 2010-2011 to 2007-2008. Differentiation

between the years of 2007-2008 and 2010-2011 FCAT scores were not statistically

significant (p = .05); thus, demonstrating stability.

Conversely, Elpus (2012) conducted a quantitative correlational study that

examined the college entrance assessment results of music and non-music students across

the United States. The researcher extracted data from the Educational Longitudinal Study

of 2002 (ELS), a nationally representative educational study facilitated by the National

Center of Education Statistics. Data collected from the 2004 graduates in the United

States indicated that approximately 36% of high school graduates earned at least one

credit in music-related courses. Elpus (2012) employed fixed-effect regression

procedures to compare standardized test results of music students who were defined as

being enrolled in at least one music-related course to non-music students while

controlling extraneous variables such as prior academic achievement, disposition towards

school, and demographic nature. The findings revealed that music students did not

academically surpass non-music students on the SAT. While Kelly’s (2012) findings

support the significance of music programs in schools, Elpus (2012) argued that musical

participation does not correlate to higher academic achievement.

After examining the methodological approaches from both studies, Kelly (2012)

and Elpus (2012) employed a quantitative correlational research design while Gadberry

41

(2010) employed a qualitative case study. Both Kelly (2012) and Elpus (2012) utilized

archival data to retrieve information regarding both variables in the studies. Elpus’ (2012)

sample consisted of students across the United States while the examples of Kelly (2012)

and Gadberry (2010) consisted of participants from their respective states. Gadberry’s

(2010) study lacked quantifiable evidence due to its scope. Elpus (2012) employed the

results from SAT scores to measure academic achievement, while Kelly (2012) employed

several instruments such as FCAT, grade point averages, and SAT scores to measure

academic achievement, which suggests a stronger analysis.

Music and the brain. Although compelling empirical research supports the value

of music education in schools, many music educators find themselves fighting for its

existence (Cole, 2011; Fujioka, 2006; Phillips-Silver, 2009). This timely new resource

has encouraged neuroscientists such as Howard Gardner, Erika Skoe, Nina Kraus, Diana

Deutsch, and Laurel Trainor to investigate the neurological components of the brain as

they relate to music.

Butler and Trainor (2012) indicated that musical training appears to modify the

brain’s auditory cortex, which processes sound. Additionally, the researchers suggested

that musical training leads to advanced levels of memorization, attention, and cognitive

development. However, music practitioners question whether those auditory and

cognitive processes are similar in academic learning. On May 5, 2009 at the Learning,

Arts, and the Brain Summit, various neuroscientists from Harvard, Princeton, and Yale

conducted a quantitative study that examined musical training as it relates to various

mathematical concepts. Although the neuroscientists emphasized that the study only

focused on relationships rather than cause and effect (as cited in Cole, 2011), the

42

researchers indicated that a strong relationship exists between prolonged musical training

and geometrical shapes. Similarly, Michael Posner (as cited in Cole, 2011) suggested that

the arts, particularly instrumental music, support continuous motivation that “provides the

cognitive benefit of strengthening executive attention networks in the brain, such as those

found in the midline and lateral frontal areas” (p.17). Additionally, the researcher

suggested that the arts programs can “help students to pay better attention in school due

to structural brain modifications developed when the students were engaged in practicing

their art forms” (as cited in Cole, 2011, p.21).

Furthermore, Francois, Chobert, Besson, and Schon (2013) indicated that musical

learning connects, develops, and refines the neurological and motor brain systems.

Posner, Rothbart, Sheese, and Kieras (2008) conducted a quantitative study that

examined arts participation influences cognitive processes through underlying

mechanism of attention, conflict resolution, and motivation. Posner et al. (2008)

theorized that:

a) there are specific brain networks for different art form; b) there is a

general interest for the arts; c) participants who display high interest for

the arts, training/support in those arts, develop high motivation; d)

motivation sustains attention; and e) high sustained motivation, while

engaging in conflict-related tasks, improve cognition. (Posner et al., 2008,

p.1)

The researchers tested their hypothesis by randomly assigning participants into a

control group, which performed a basic task, and an experimental group, in which

participants performed specific tasks under motivating conditions (a reward system). The

43

findings suggested that high levels of motivation yielded stronger improvements in tasks

performance, especially when motivation was sustained for extended periods. The

researchers indicated that the results ultimately support the idea that interest in the arts,

particularly music, allows for continuous motivation.

Hyde et al. (2009) conducted a longitudinal study that investigated the effects of

musical training on structural brain development. The study consisted of two control

groups. The first group consisted of 16 children with an average age of approximately

5.90 years old at the start of the study; these children did not receive any instrumental

music preparation during a 15-month period, but did partake in a bi-weekly group music

class in school. The second group consisted of 15 children with an average age of

approximately 6.32 years old at the start of the study; these children received private

piano instruction for 15 consecutive months. The findings suggested that children who

received private piano instruction exhibited greater behavioral improvements over 15

months on the finger motor, melody, and rhythmic tasks.

Neurological differentiations between students with prior musical training and

students with little or any musical training can be found in many parts of the brain.

Additionally, students with prior musical training tend to display higher mathematical

and phonemic abilities than students with little or any musical training (Butler & Trainor,

2012). Musacchia, Sams, Skoe, and Kraus (2007) argued that a musician’s basic sensory

mechanisms for coding visual and auditory processes might also be specialized in the

brain. The findings from their study provided empirical evidence that musicians display

stronger aural and visual brainstem reactions to speech than non-musicians. For this

44

reason, it is logical to assume that musical training may enhance phonological awareness

among students.

Music and reading. Empirical evidence suggested that a strong correlation exists

between music training and reading achievement (Cole, 2011; Michener & Fishoff, 2012;

Pane & Salmon, 2011; Piro, 2009). Pane and Salmon (2011) proclaimed that music is

interconnected with thought and aids children in drawing, talking, and reading, which are

all components of literacy development. While Waller (2010) suggested that music

teachers rarely provide opportunities for writing music, Oare and Bernstorf (2010)

suggested that music instruction enhances phonological processes that assist in

developing good readers and writers. Standley (2008) indicated that music education

curriculum enhances reading achievement among elementary students. Brian Wandell (as

cited in Cole, 2011) conducted a quantitative study to determine whether a correlation

existed between reading fluency and musical training. The study consisted of 49 children

from the ages of 7 to 12. The study indicated that the children who read rhythms and

recognized pitches would most likely demonstrate fluency when reading books or stories.

Piro and Ortiz (2009) conducted a quantitative quasi-experimental study that

examined the relationship between musical training and literacy skills amongst second

grade students. The study selected two elementary schools located in the same

geographic vicinity with similar ethnic and socioeconomic qualities. As a part of a

comprehensive instructional intervention program, one elementary school (n=46)

engaged in musical learning by studying piano formally for three consecutive years.

Conversely, children attending the control school (n=57) received no musical training

from their respective school. Both elementary schools were engaged in a comprehensive

45

balanced literacy programme that engaged children in reading, writing, speaking, and

listening. All children were tested to evaluate their literacy development at the beginning

and at the ending of a standard school year utilizing the Structure of Intellect (SOI). The

findings indicated that children who studied piano formally had significantly higher

vocabulary and phonological awareness than children with no formal musical training.

Supportively, Ho, Cheug, and Chan (2003) conducted a similar study that

examined the relationship between musical instruction and phonemic reasoning. The

study consisted of 90 males between the ages of 6 and 15 from the same geographical

location. The study suggested that the participants with prior musical training had

significantly higher phonological awareness and retention abilities than those with little

or no musical training. Additionally, a follow-up study determined that the effect was

causal. Butzlaff (2000) conducted a meta-analysis of 25 correlational studies that

examined reading achievement and music training. The sample sizes ranged from 50

students to 500,000 students. The study indicated that a strong and reliable relationship

existed between reading comprehension and music participation. While Ho, Cheug, and

Chan (2003) found that musical instruction enhances phonological awareness, the study

only involved male participants. If the researchers would have included female

participants, it may have yielded different outcomes.

Deutsch, Dougherty, Shachar, Tsang, and Wandell (2009) conducted a

longitudinal study that investigated the development of reading ability and the brain. The

study consisted of 49 participants from the ages of 7 to 12. Although questionnaires were

sent to the participant’s guardian concerning their involvement in the arts, only the

parents of 41 children responded to the survey. The amount of musical training was then

46

correlated with the participant’s assessment scores in literacy development and

phonological awareness over three consecutive years. The study suggested that a

statistically significant correlation existed between the amount of musical training and the

participants’ reading fluidity.

Music and mathematics. More than two-thirds of students in the United States

are not proficient in mathematics (Campbell, Malkus, 2011; Hemmings, Grootenboer,

Kay, 2011; National Assessment of Educational Progress, 2009). This overwhelming

evidence has prompted researchers to investigate current trends to promote awareness in

public schools.

Empirical research indicated that math and music are related in the brain from

very early in life (E.A. Geist, K. Geist, Kuznik, 2012; Linder, Powers-Costello, &

Stegelin 2011; Zenter, Eerola, 2010). Geist et al. (2012) suggested that musical elements

such as tempo, rhythm, and steady beat enhance mathematical concepts such as spatial

properties, counting, and sequencing. McLelland (2005) conducted a quantitative study

that investigated participation in music on academic achievement. The researcher

examined standardized assessment data, mathematics, and reading from multiple school

years (2001-2002 and 2003-2004) from approximately 356 fifth grade students. The

researcher identified a statistically significant difference in reading and mathematics

performance between the fifth grade music participants and non-participants.

Additionally, students who participated in instrumental music had a mean score that was

7.9191 points higher in reading and 8.590 points higher in math.

Deere (2010) conducted a mixed method study that examined the influence of

music education programs on reading and mathematics achievement. The study consisted

47

of 57 fourth grade students and 63 eighth grade students. Additionally, the study

investigated the perception of music education programs by surveying administrators,

school board members, and teachers. Quantitatively, the author analyzed the reading and

mathematics scores on the Tennessee Comprehensive Assessment Program (TCAP). The

findings suggested that 76% of the respondents agreed on the importance of music

education programs in schools. Additionally, the findings indicated that fourth grade

participants who had prior musical training scored significantly higher than participants

who had no musical training on the TCAP. Yet, eighth grade students who had prior

music training only scored significantly higher than their peers who had no musical

training on the reading section of the TCAP. Further research is needed to determine if

the relationship between music participation and academic achievement is causal.

Spelke (2009) conducted a quantitative experimental study that examined the

relationship between musical training and cognitive systems in mathematics. Two

experiments examined the participants’ mathematical ability on a district-standardized

assessment. The first experiment consisted of participants from the ages of 5 to 17 from a

Massachusetts community. All participants attended either a Saturday musical training or

an athletic training event. Both groups were examined utilizing a multiple regression.

Predictors such as musical and athletic training were found to be poor predictors for

mathematical ability. Age was found to be a more reliable predictor.

Furthermore, Spelke’s (2009) second experiment examined 61 students, ages 8 to

13, while receiving intense musical instruction from music schools in Boston to a group

of students with little or no musical background. Participants were assessed on six

different mathematical abilities, equivalent to the assessment incorporated in the first

48

experiment. A statistical analysis, two-way ANCOVA with age and verbal IQ, were

applied to determine its relationship. The study indicated that children with intense

musical instruction performed higher on all measures of geometric abilities than children

with little or no musical instruction. However, the experiment did reveal that a

statistically significant relationship did not exist with musical training and spatial ability.

Indeed, empirically researched data supports the notion that music improves

mathematical achievement. However, Cox and Stephens (2006) conducted a quantitative

study that compared high school students who acquired music credits to those who had

none. The researcher concluded that no statistically significant difference was found in

their mean math grade averages.

With ever-increasing demands on academic performance in schools, researchers

have examined the value of music education programs. Although Spelke (2009), Deere

(2010), McLelland (2005), and Deutsch, Dougherty, Shachar, Tsang, and Wandell (2009)

found positive correlations between students who participated in music programs and

higher academic achievement, Cox and Stephens (2006) found no differences between

music participation and academic achievement. Interestingly, most of the empirical

evidence that displayed strong correlations between music participation and academic

achievement consisted of samples from a diverse population which exhibited a higher

generalizability. However, the sample of Cox and Stephens (2006) was limited to

predominantly one ethnic population and socioeconomic status. In addition, the study

consisted of primarily female participants, which limits the scope of the study.

Musical aptitude and academic achievement. Compelling empirical evidence

has examined musical aptitude (Gordon, 2007; Karma, 2007; Rutkowski, 1996; Ukkola-

49

Vuoti et al., 2013) and academic achievement (Cheema & Galluzzo; 2013; Maltese, Tai,

& Xitao, 2012; Musa, 2013; Rowe, Miller, Ebenstein, & Thompson, 2012; Schutz,

Simon, & Musgrave, 2013; Stanley & Stanley, 2011; Talley & Scherer, 2013; Toldson,

2012; Young, Hyuck, Sunyoung, & You Kyung, 2012). However, few studies have

examined the relationship between musical aptitude and academic achievement

(Holsomback; 2002; 2004; Kuhlman, 2005; Rubinson, 2010).

Holsomback (2001) conducted a quantitative correlational study that examined

the relationship between musical aptitude and various academic achievement measures of

beginner instrumental music students. The standardized academic performance

assessments utilized in the study were the composite performance levels of the reading

and mathematics sections of the Iowa Test of Basic Skills Test (ITBS), The Otis-Lemon

School Abilities Test (OLSAT), the Metropolitan Achievement Test (MAT), and the

Texas Assessment of Academic Skills Test (TAAS). The musical aptitude assessment

utilized in the study was the Selmer Music Guidance Survey. Since no research was

found in the literature with regard to the Selmer Music Guidance Survey, the reliability

coefficient was established for the composite scores.

The study consisted of 104 sixth grade band students in an east Texas school

district. The study employed a purposive sampling technique. Students were administered

the Selmer Music Guidance Survey to serve as a guide to instrument assignment and

performance indicator of the individual needs of the selected band students at the

beginning of the school year. Additionally, students were interviewed for physical

characteristics to identify strengths and weaknesses that may enhance instrumental music

training. Although the researcher hypothesized that no relationship exists between the

50

variables, the study revealed that a strong statistical relationship existed between musical

aptitude and academic achievement. Additionally, the study indicated that further

research should be conducted to investigate the relationship between musical aptitude and

other standardized assessments.

Similarly, Rubinson (2010) conducted a quantitative correlational study that

investigated the relationship between musical aptitude and the developing reading

abilities of kindergarten students. The researcher employed the tonal and rhythmic

components of the PMMA to determine the amount of musical aptitude of kindergarten

students. Reading achievement was measured by various subtests of Dynamic Indicators

of Basic Early Literacy Skills (DIBELS), a standardized assessment of early literacy

development that evaluates the alphabetic abilities, and phonological awareness of

kindergarten students. The study consisted of 80 kindergarten students from an

elementary school in central Connecticut. The convenience sample consisted of

kindergarten students in the Fall Semester of the 2008-2009 school year.

The study employed both bivariate and multivariate correlational statistics to

determine the relationship between musical aptitude and reading achievement scores.

Multiple regression analyses, yielding multiple correlation coefficients (R), were

employed to investigate the correlation between phonological awareness and tonal and

rhythmic aptitude. The findings suggested that musical aptitude is reliably and strongly

associated with phonological awareness and early reading development of kindergarten

students. Additionally, further research is recommended to determine whether the

relationships established in this study are causal.

51

Peynircioglu, Durgunoglu, and Uney-Kusefoglu (2002) conducted a quantitative

study that investigated the relationship between musical aptitude and phonological

awareness among pre-school children. The study consisted of 31 Turkish children and 29

American children ranging from 4 to 6 years of age. In experiment one, Turkish children,

and in experiment two, American children, completed multiple phoneme tasks with

words in their respective languages and with pseudo-words. Additionally, the children

completed a tone deletion task by listening to snippets of melodies. Due to limited

reading skills, both assessments were evaluated largely on pure auditory abilities.

The reading assessment consisted of reading simple phrases and then sequentially

simpler words. Participants who were capable of identifying even the simplest words

were excluded from the study. The musical aptitude assessment focused on pitch and

rhythm. The assessment consisted of singing back various melodies including major and

minor intervals and reproducing several rhythms varying in length and complexity. The

participants’ responses were recorded and two independent judges with expertise in

music evaluated each participant’s musical aptitude. In both experiments, the findings

concluded that a statistically significant correlation existed between musical aptitude and

overall phonological awareness. Particularly, children who demonstrated high musical

aptitude performed better on the verbal phonological awareness than children with low

musical aptitude (Peynircioglu, Durgunoglu, & Uney-Kusefoglu, 2002).

Empirical studies suggested that a positive correlation exists between musical

aptitude and academic achievement (Holsomback, 2001; Peynircioglu, Durgunoglu, &

Uney-Kusefoglu, 2002; Rubinson, 2010). Each study employed various academic and

52

musical aptitude assessments relevant to their population. Additionally, each study

contributed vital methodological information that influenced this study.

Methodology. The majority of the methodology from the research described

throughout this literature review section has been quantitative. Holsomback (2001)

employed a quantitative methodology followed with a correlational design. The study

investigated the relationship between musical aptitude, as measured by the Selmer Music

Guidance Survey, and academic achievement, as measured by the composite scores of the

ITBS, OLSAT, MAT, and TAAS. The study consisted of 104 sixth grade band students.

The data from the study was analyzed with the use of four Pearson r correlation

coefficients to determine the linear relationship between both variables. The study

indicated that moderate relationship existed between musical aptitude and academic

achievement (correlations ranging from .393 to .472).The study could have demonstrated

a stronger analysis by analyzing both composite and sub dimensions of all variables

presented in the study. Additionally, a longitudinal study may have revealed different or

similar relationship patterns, but this study was limited to the 1999-2000 school year.

Rubinson (2010) employed a quantitative study that examined the relationship

between the musical aptitude and phonological awareness of kindergarten students. The

study consisted of 62 kindergarten students. The data from the study was analyzed with

the use of a Pearson r correlation coefficient to determine the linear relationship between

both variables. The study indicated that moderate-sized correlation existed between

musical aptitude and academic achievement (correlations ranging from .27 to .38).The

study produced robust analyses by examining composite and sub dimensions

performance levels of both variables in the study. However, the study was limited by a

53

small sample of participants from only one public elementary school. Additionally, the

research findings may not be applicable to different student populations, geographical

locations, and grade levels. If the study consisted of sample populations from different

ethnic groups and socioeconomic status, the study may have benefited schools with

diverse populations. As correlational studies do not investigate cause and effect, causal

conclusions cannot be drawn from the study (Yin, 2009).

Peynircioglu, Durgunoglu, and Uney-Kusefoglu (2002) employed a quantitative

methodology with a correlational design to determine the relationship between musical

aptitude and phonological awareness of preschool children. The study consisted of 31

Turkish children and 29 American children ranging from 4 to 6 years of age. The study

indicated that a large correlation (ranging from .45 to .52) existed between musical

aptitude and phonological awareness. The study produced strong analyses by correlating

sub-dimensions of phonemes, initial consonant, and initial vowel to musical aptitude.

However, the study did not utilize a standard musical aptitude assessment. Rather, two

independent judges with expertise in music evaluated each participant’s musical aptitude.

This ultimately questions the consistency of the scoring; thus, bringing into question the

validity of the study.

The methodological strengths and weaknesses presented in this section provided

important information regarding the relationship between musical aptitude and academic

achievement. Each researcher (Holsomback, 2001; Peynircioglu, Durgunoglu, and Uney-

Kusefoglu, 2002; Rubinson, 2010) employed a quantitative methodology using a

correlational design to determine the extent of the relationship between both variables.

However, each study employed different assessments to measure musical aptitude and

54

academic achievement. Ultimately, their research identified the gaps in the literature and

allowed this study to analyze the relationship between musical aptitude and the composite

and sub-categorical performance levels of the STAAR among beginning band students.

Instrumentation. For the purpose of this research study, two sources of data were

used. Both sources of data have been successfully used in other empirical studies to

measure musical aptitude and academic achievement. Many music educators employ

tests such as Intermediate Measures of Musical Audiation (1986), Kwalwasser-Dykema

Music Tests (1930), and The Seashore Measures of Musical Talents (1919) to measure

musical aptitude. Yet, Gordon’s Intermediate Measures of Musical Audiation (1986) is

the only brief, longitudinally valid music aptitude test for Grades 1 through 6. In addition,

several empirical studies (Brophy, 2005; Hodges & O’Connell, 2005; Hornbach &

Taggart, 2005; Tomei, 2010) have employed the IMMA assessment to measure the level

of musical aptitude. Based on this information, the IMMA was the most appropriate

assessment to measure musical aptitude among middle school band students.

Developed in 2011, Texas public schools have administered the STAAR to

measure academic achievement in students. While prior academic assessments failed to

provide sufficient academic readiness for Texas students, the STAAR was designed to

enhance the rigidity of the previous assessment. This standardized academic assessment

meets the criteria mandated by the NCLB and is the most current and valid academic

standardized assessment for the state of Texas (TEA, 2012). Although few studies

(Johnson, Johnson, & Johnson, 2012; Johnson, Wilson, & Rossi, 2013) have utilized the

STAAR to measure academic achievement, it is the most current academic standardized

55

assessment in Texas. Based on this information, the STAAR was the most appropriate

assessment to measure academic achievement among middle school students in Texas.

Several validation studies were conducted on both, the IMMA and STAAR

instruments and demonstrated acceptable internal consistency across the studies and

yielded an acceptable split-half reliability estimate and acceptable test-retest reliability

coefficient alpha ranging from .76 to .91. Both of the scales’ reliabilities were generally

high, exceeding the conventional standard of .70 for internal consistency recommended

in the literature (Fraenkel, Wallen, and Hyun, 2012). Utilizing the data from Gordon

(1986) and Texas Education Agency (2014) validity testing, both instruments were valid

and acceptable for use in this study.

Summary

This chapter provided a broad overview of the existing literature related to the

variables in the study. Additionally, this study was formed primarily by the theoretical

foundations of Gordon’s music learning theory and Gardner’s theory of multiple

intelligences. This research advanced these theories by providing empirical data on

musical aptitude and academic achievement. The research questions in this study aligned

with multiple theoretical models as they provided a rationale on the relationship between

musical aptitude and academic achievement. Each theoretical model provided the

foundation for this study. Five thematic ideas discussed in the literature review were (a)

overview of musical aptitude, (b) academic achievement, (c) music in relation to

academic achievement, (d) musical aptitude and academic achievement, and (e)

methodological strengths and weakness from similar studies. These themes are related to

the focus of this study as they contribute to the overall understanding of the research

56

topic. The study addressed if, and to what degree, a correlation existed between musical

aptitude and the composite and sub-categorical performance levels of the STAAR among

sixth grade beginning band students.

A quantitative approach followed with a correlational design was chosen for the

purpose of collecting and analyzing numerical data regarding the relationship between

musical aptitude and academic achievement of sixth grade beginning band students.

Many of the previous studies on this topic employed quantitative methods and provided a

basis for the continued use of this approach (Holsomback, 2002; 2004; Rubinson, 2010;

Tomei, 2010). Additionally, these studies have utilized the IMMA assessment in order to

measure the level of musical aptitude.

Empirical evidence has identified strong relationships between musical learning,

musical aptitude and academic achievement (Deutsch et al., 2009; Gadberry, 2010; Ho et

al., 2003; Holsomback; 2002; Holsomback, 2004; Kuhlman, 2005; Piro & Ortiz, 2009

Rubinson, 2010). While it is reasonable to believe that a high quality music program

intended to increase musical aptitude may also increase standardized assessments, a gap

in the literature existed concerning the relationship between musical aptitude and the

composite and sub-categorical performance levels of the STAAR among beginning band

students. Additionally, future research should be conducted to determine if the

relationship is causal.

Previous studies suggested that a strong and reliable relationship existed between

musical aptitude and academic achievement (Holsomback; 2002; 2004; Rubinson, 2010;

Tomei, 2010). Additionally, previous studies identified a gap in the literature and allowed

this study to analyze the relationship between musical aptitude and the composite and

57

sub-categorical performance levels of the STAAR among beginning band students

(Holsomback, 2002; 2004). The study evaluated the gaps in the literature and added to

the existing research of musical aptitude and academic achievement. Chapter 3 will

provide a detailed discussion of the methodology that will be employed in the study.

58

Chapter 3: Methodology

Introduction

The purpose of this quantitative correlational study was to examine if, and to what

degree, a correlation existed between the level of musical aptitude and the reading and

mathematics scores of the STAAR among beginning band students. Specifically, this

quantitative correlational study examined the relationship between musical aptitude, as

measured by the Intermediate Measures of Musical Audiation (IMMA), and the reading

and mathematics performance levels of the STAAR. Previous empirical research has

examined the relationship between musical aptitude and various academic assessments

(Cavangh, 2009; Holsomback, 2004; Rubinson, 2010). Yet, a gap existed in the literature

concerning the relationship between musical aptitude, as measured by the IMMA, and the

current standardized assessment, as measured by the reading and mathematics sections of

the STAAR (Holsomback, 2002).

Previous research has defined a gap in the literature and justified the need for this

study. The findings of this study advanced the understanding of the relationship between

musical aptitude and academic achievement among sixth grade beginning band students.

In addition, the findings may assist teachers, administrators, and educational policy

makers in determining the relationship of music instruction to reading and mathematics

achievement of middle school students.

This chapter focused on the quantitative research method to examine the

relationship between the level of musical aptitude and the level of academic achievement

among middle school students. This chapter included a brief description of the problem,

research questions and hypotheses, population and sampling procedures, instrumentation,

59

and data collection and analysis procedures, validity, reliability, ethical considerations,

and potential limitations.

Statement of the Problem

It was not known if, and to what degree, a correlation existed between the level of

musical aptitude assessed on the IMMA and the level of reading and mathematics scores

on the STAAR among beginning band students. Empirical research has examined the

relationship between musical aptitude and various academic assessments (Cavangh,

2009; Holsomback, 2004). E. A. Geist, K. Geist, and Kuznik (2012) suggested that

musical elements such as tempo, rhythm, and steady beat enhance mathematical concepts

such as spatial properties, counting, and sequencing. Supportively, Oare and Bernstorf

(2010) suggested that music instruction enhances phonological processes that assist in

developing good readers and writers.

Although empirical evidence correlates musical elements to academic

achievement, minimum evidence has been presented on the relationship between musical

aptitude and the reading and mathematics sections of the STAAR. Holsomback (2002)

indicated that a strong statistical relationship existed between musical aptitude and

academic achievement. Yet, he suggested that further research should examine the

relationship between musical aptitude and other standardized assessments. For this

reason, it was necessary to determine whether a relationship existed between musical

aptitude and the current academic standardized assessment in Texas.

Understanding the relationship between musical aptitude and academic

achievement was essential in finding effective ways to utilize music instruction to

enhance reading and mathematics achievement of middle school students. In addition,

60

this study contributed to solving the problem by providing a quantitative analysis on the

relationship between the level of musical aptitude and the level of reading and

mathematics scores on the STAAR among beginning band students in Texas. Therefore,

this study investigated the composite and sub-categorical performance levels of the

IMMA and the STAAR. The study consisted of a middle school band program from a

rural school district in northeast Texas.

Research Questions and Hypotheses

The purpose of this quantitative correlational study was to examine if, and to what

degree, a correlation existed between the level of musical aptitude and the reading and

mathematics scores on the STAAR among beginning band students in Texas. This

research was framed in the theoretical context that learning in general, and more

specifically musical learning, is a person's ability to understand and process sound,

rhythm, patterns in sound, relationships between sounds, and process rhymes and other

auditory information. In order to understand various relationships between musical

aptitude and academic achievement among beginning band students, appropriate research

questions were essential. In addition, the research questions and hypotheses related to the

problem statement by examining the relationship between the level of musical aptitude

and the reading and mathematics scores on the STAAR. The following research questions

and hypotheses guided this study:

R1: Is there a correlation between the level of musical aptitude and the composite and

sub-categorical performance levels of the mathematics scores of the STAAR

among beginning band students?

61

H1: A correlation exists between the level of musical aptitude and the composite and

sub-categorical performance levels of the mathematics section of the STAAR

among beginning band students.

H01: A correlation does not exist between the level of musical aptitude and the

composite and sub-categorical performance levels of the mathematics section

STAAR among beginning band students.

R2: Is there a correlation between the level of musical aptitude and the composite and

sub-categorical performance levels of the reading scores of the STAAR among

beginning band students?

H2: A correlation does exist between the level of musical aptitude and the composite

and sub-categorical performance levels of the reading section of the STAAR

among beginning band students.

H02: A correlation does not exist between the level of musical aptitude and the

composite and sub-categorical performance levels of the reading section of the

STAAR among beginning band students.

In a correlational study, there are no true independent and dependent variables as

the design is non-experimental. In this study, variable 1 was measured by the IMMA

assessment. The IMMA is a 20-minute musical aptitude assessment that measures the

potential musical ability in an individual (Gordon, 2003). Variable 2 was measured by the

STAAR. The STAAR is a sequence of state mandated standardized assessments currently

used in Texas public schools to evaluate student achievement and knowledge in each

grade level. The STAAR includes annual assessments for grades three to eight in reading

and mathematics; assessments in writing at grades four and seven; in science at grades

62

five and eight and in social studies at grade eight. In addition, the STAAR includes end-

of-course assessments for English I, English II, Algebra I, Biology, and U.S. History

(Texas Education Agency, 2011). Both the IMMA and the STAAR are secondary data.

Research Methodology

A quantitative methodology was employed to determine if, and to what degree, a

correlation existed between musical aptitude and the STAAR among beginning band

students. Quantitative studies examine empirical theories utilizing numerical variables

designed to represent the theoretical concepts so that mathematical relationships can be

revealed (Yin, 2009). Fraenkel, Wallen, and Hyun (2012) suggested that quantitative

methodology comprises of explicit hypotheses. Additionally, the quantitative approach

utilizes objective instruments such as multiple choice standardized assessments,

questionnaires, personality scales, and aptitude assessments. The rationale of selecting a

quantitative methodology was to develop an objective way of testing theories by

examining relationships between variables that can be measured and analyzed using

statistical procedures, resulting in numerical results (Fraenkel, Wallen, & Hyun, 2012).

Prior empirical research has utilized a quantitative methodology to determine the

relationship between musical aptitude and standardized assessments (Holsomback, 2001;

Kuhlman, 2005; Rubinson, 2010). Quantitative research involves testing objective

theories by examining the relationships among measurable variables so the researcher can

utilize statistical procedures to analyze the numerical data. In addition, quantitative

research involves an empirical analysis of data collected from a sample of individuals

from specific populations to make a generalizable observation for the whole based on the

measure of relationships (Fraenkel, Wallen, & Hyun, 2012). Since this study sought to

63

investigate the relationship between musical aptitude and academic achievement, a

quantitative methodology was most appropriate.

Qualitative research expands the range of knowledge and understanding of the

world beyond the researchers themselves. It often helps one see why a situation is the

way it is, rather than just presenting a phenomenon (Yin, 2009). Since qualitative

research attempts to investigate naturally occurring phenomena in all their complexity

(Yin, 2009), a qualitative methodology was inappropriate for the research questions and

hypotheses. In a mixed methodology, qualitative and quantitative methods are used in

tandem to strengthen the study. Since this study involved the analysis of exclusively

secondary interval and ratio data and there was no interaction between the student and the

researcher, a mixed methodology was inappropriate for the current study. Proper

selection of methodology is imperative in comprehending and interpreting the results

based on the research questions and hypotheses (Fraenkel, Wallen, & Hyun, 2012).

The study utilized the results from the 2013-2014 IMMA assessment and the

results from the 2013-2014 reading and mathematics sections of the STAAR. The study

evaluated the results to determine if a statistically significant relationship existed between

musical aptitude and the reading and mathematics sections of the STAAR among

beginning band students. The study consisted of a middle school band program from a

rural school district in northeast Texas. The independent variable, musical aptitude, was

measured by IMMA assessment. The dependent variable, academic achievement, was

measured by the results from the reading and mathematics sections of the STAAR.

64

Research Design

A correlational research design was utilized for this study. Previous empirical

research has utilized a correlational research design to determine if a correlation exists

between musical aptitude and academic achievement (Holsomback, 2001; 2002;

Rubinson, 2010). Holsomback (2001; 2002) employed a quantitative correlational design

that examined the relationship between the independent variable, musical aptitude, and

the dependent variable, academic achievement, of sixth grade band students. Both studies

found strong correlations between musical aptitude and academic achievement. However,

Rubinson (2010) employed a correlational research design that examined the musical

aptitude and phonemic awareness of kindergarten students. The results revealed a strong

relationship existed between musical aptitude and phonemic awareness. As this

dissertation study did not “seek to determine reasons or causes for preexisting differences

in groups of individuals” (Fraenkel, Wallen, & Hyun, 2012, p. 365), a causal-comparative

research design was inappropriate for this study.

In experimental research, variables are manipulated, and the effects of this

manipulation are measured upon the dependent variable. Additionally, in experimental

research a treatment is deliberately imposed on a group of objects or participants

(Fraenkel, Wallen, Hyun, 2012). As this study did not attempt to manipulate the

variables, an experimental research design was inappropriate for the study.

According to Fraenkel, Wallen, and Hyun (2012), a correlational research design

“seeks to investigate the extent to which one or more relationships of some type exist”

(p.11). Furthermore, a correlational research design is useful to researchers who are

interested in determining to what degree two or more variables are related; however, a

65

correlational design does not determine causation. Therefore, a correlational study was

the most appropriate design to identify a strong and reliable relationship between musical

aptitude and the STAAR among beginning band students.

Examining the correlation between one independent variable and one dependent

variable will simplify the data analysis process (Yin, 2009). Additionally, the researcher

correlated the sub dimensions of each variable in order to obtain a more robust analysis.

The data from the study was analyzed with the use of a Pearson r correlation coefficient,

which provided a numerical summary of the data. According to Bluman (2012), the

correlation coefficient usually determines the strength of the relationships between

variables and the direction of that relationship. The range of the coefficient lies between -

1 and +1; the closer the correlation coefficient is to +1, the stronger the positive

relationship. Meanwhile, the closer the correlation coefficient is to -1, the stronger the

negative relationship. A positive relationship means that as one value increases, so does

the other; a negative relationship means that as one value increases the other decreases.

Furthermore, the purpose of a quantitative study usually predicts, explains, or controls

phenomena through a precise process of collecting numeric data (Fraenkel, Wallen, &

Hyun, 2012). As this study investigated the linear relationship between variables, the use

of a Pearson r correlation coefficient was most appropriate.

The researcher utilized archival data, results from the 2013-2014 IMMA

assessment, and the results from the 2013-2014 reading and mathematics sections of the

STAAR. The researcher evaluated the results to determine if a statistically significant

relationship existed between musical aptitude and the reading and mathematics sections

of the STAAR among beginning band students. The study consisted of 65 sixth grade

66

students from a rural school district in northeast Texas. The independent variable,

musical aptitude, was measured by IMMA assessment. The dependent variable, academic

achievement, was measured by the results from the reading and mathematics sections of

the STAAR.

Population and Sample Selection

The setting for this study was in a small rural east Texas public school district.

According to the 2013-2014 district’s annual report, the enrollment consisted of

approximately 1,100 students in grades kindergarten through 12, where 10% were

African American students, 81% were Caucasian American students, 9% were Hispanic

American students, and 71% of the total students were categorized as economically

disadvantaged. The district comprises three schools, one elementary school, one middle

school, and one high school. The 2013-2014 sixth grade class consisted of 65 students,

who were all required to enroll in a beginning band course and take a musical aptitude

assessment. Therefore, the target population (N = 65) consisted of sixth grade band

students enrolled in the school district. An a priori power statistical analysis was

conducted using G* Power 3.0.10. The analysis suggested that a minimum of 85 students

are required for this study. The researcher set statistical significance as α = .05 and power

at .8 (β = .8) for this study. Since this study did not meet minimum sample size, the study

was identified as significant only if a large correlation existed. After the data analysis

process, the researcher conducted a post hoc analysis to identify patterns not specified in

an a priori analysis. Post hoc power for all tests were calculated at greater than 0.8 (See

Appendix E). As all students were required to take both IMMA and STAAR assessments,

a randomization sample selection was not applicable to this study. Total population

67

sampling was appropriate for this study in order to identify a sample with certain

characteristics relevant to the population of interest (Fraenkel, Wallen, & Hyun, 2012).

Consent to conduct this study was attained from an east Texas school district and

Grand Canyon University Institutional Review Board. Student identity and

confidentiality was preserved through an alpha numeric coding system so that the

identification of participants was confidential. The data from the IMMA and STAAR

assessments was obtained from archival school records located in the counselor’s office

for the school year. The data from both the IMMA and STAAR assessments was

provided within the data spreadsheet. Identifiers such as student names, identification

numbers, dates of birth, and addresses was removed by the counselor and assigned a

numeric code to ensure anonymity. Then, the raw data was imported into SPSS for

analysis.

Instrumentation

For the purpose of this research study, two sources of data were used. Both

sources of data were successfully used in other empirical studies to measure musical

aptitude and academic achievement. IMMA (Gordon, 1986) was utilized to determine

overall musical aptitude. The STAAR (Texas Education Agency, 2014) was utilized to

evaluate the reading and mathematics achievements of study participants. The IMMA

was chosen because it is a well-established tool for measuring musical aptitude. The

STAAR was chosen because it is the most current assessment for evaluating academic

achievement in Texas. Both the STAAR (shown in Table 1) and the IMMA (shown in

Table 2) assessments have adequate reliability and validity, the composite alpha

coefficient ranging from .889 to .92 and yielding continuous data.

68

IMMA. The IMMA, according to Gordon (2003), is the only brief longitudinally

valid musical aptitude assessment for kindergarten through sixth grade students. The

IMMA was designed to measure developmental musical aptitude. The IMMA produces

three scores: Tonal, Rhythmic, and Composite. The IMMA assessment is divided into

two sub sections, Tonal and Rhythm, and each section takes 20 to 25 minutes to

administer. As specified in the test manual procedures, each subtest is administered on

different days with the Tonal subtest given first. Each subtest consists of approximately

40 pairs of short tape-recorded tonal or rhythmic sequential aural excerpts. The tonal

excerpts are played without rhythmic patterns. The duration of pitches are equal lengths.

The rhythmic excerpts are played on only one pitch. Each excerpt both Tonal and

Rhythmic are played twice. Participants are asked to distinguish by indicating if the first

and second excerpts are the same or different. A composite score is developed once the

results from both the Tonal and Rhythmic subtests are combined and yield continuous

data.

STAAR. The STAAR is a sequence of state mandated standardized assessments

currently used in Texas public schools to evaluate student achievement in each grade

level (TEA, 2009). Specifically, the STAAR includes annual assessments for grades three

through eight in reading and mathematics. The students will have four hours to complete

each assessment. Students will begin the assessment after instructions from their test

administrator. The four hour time limit stops only for a 30 minute lunch break. Both the

reading and mathematics sections of the STAAR for grade six consist of approximately

45 multiple choice questions and are identified as continuous data.

69

Validity

According to Freankel, Wallen, Hyun (2012), validity is the process of evaluating

whether an instrument is designed and successful at measuring what it is designed to

measure. Validity coefficients are considered statistically significant at .05 or .01 levels.

IMMA validity. The IMMA instrument has initial construct validity, longitudinal

validity, congruent validity, have at least some validity and some are more valid for

certain purposes than others. Both subjective considerations and objective evidence are

presented in support of validity for the IMMA.

Many research studies have been conducted to ensure the validity of the IMMA.

Gordon (1986) conducted several validation studies and included different samples.

Content validity was established through taxonomic research. Longitudinal predictive

validity was undertaken at a private academy for boys in Chestnut Hill, Pennsylvania.

The longitudinal predictive validity coefficients ranged from .64 to .90. This information

provided objective evidence for establishing concurrent validity. Gordon (1986)

established congruent validity by correlating two tests that were designed to measure the

same factor. The author examined the correlations between the Primary Measures of

Musical Audiation (PMMA) and the Musical Aptitude Profile (MAP). The correlation

coefficients ranged from .47 to .71. In addition, the IMMA was found to have congruent

validity with the PMMA. The correlation between the composite score of IMMA and the

composite score of PMMA is .61.

STAAR validity. The results of the STAAR assessment are used to guide

educational planning related to the knowledge and skills that students are acquiring in

each academic content area. Validity evidence from the STAAR originated from a variety

70

of sources. External and internal validity was collected to inform standard settings and

support interpretations of performance standards. Results on each assessment are

interconnected across grades to performance on similar assessments. One method by

which Texas provides validity evidence for the STAAR assessment is by examining the

relationship between the performance on the STAAR and the performance on other

standardized assessments. The process is commonly referred to as criterion-related

validity. Several validation studies were conducted and demonstrated acceptable internal

consistency across the studies and yielded an acceptable split-half reliability estimate and

acceptable test-retest reliability coefficient alpha of .76 (Texas Education Agency, 2014).

Utilizing the data from Gordon (1986) and Texas Education Agency (2014) validity

testing, both instruments were valid and acceptable for use in the study.

Reliability

The use of a reliable instrument is essential to a strong research study (Fraenkel,

Wallen, Hyun, 2012). In addition, the stability to test results is best interpreted through

reliability information. Subtle and extraneous factors that contribute to bias and

unreliability in human judgment have no effect on objective test scores. The less test

results are affected by individual physical and psychological deviations and by

distractions within and outside of the classroom, the more reliable the test (Gordon,

2003). Test reliability varies with type, content, and length. However, reliability

coefficients generally range between .00 and .95 (Frankel, Wallen, & Hyun, 2012).

IMMA reliability. Each reliability coefficient for the IMMA (summarized in

Table 1) is an index of the stability of the test scores. However, the split-halves

coefficient derived from only one administration of each test is more influenced by the

71

homogeneity of test content, and the test-retest coefficient is more influenced by physical

and psychological changes in the individual and by different environmental conditions.

Thus, test- retest coefficients are typically lower than split-halves coefficients for a given

test. Test-retest reliabilities for all grade levels for Tonal, Rhythm, and Composite scores

ranged from .76 to .80, .80 to .88, and .80 to .91 (Gordon, 1986), which exceeds the

conventional standard of .70 recommended in the literature (Fraenkel, Wallen, & Hyun,

2012).

Table 1

Intermediate Measures of Music Audiation Reliabilities, Standard Errors of

Measurement, and Standard Errors of a Difference

Tonal Rhythm Composite Standard error

of a difference Reliability Reliability Reliability

Grade 1-4

2.5

Split-Halves .76 .70 .80

Test-Retest with

Raw Scores .88 .84 .91

Test-retest with

Criterion Scores .78 .76 .81

Standard Error of

Measurement 1.8 1.7 2.4

Note: Adopted from “The primary measures of music audiation and the intermediate

measures of music audiation” by E. E. Gordon, 2010, p.12. GIA Publications Manual.

STAAR reliability. TEA utilized two types of internal consistency measures:

Kuder-Richardson 20 (KR20) 33 was used for tests with only multiple-choice items and

the stratified coefficient alpha was employed for tests containing a mixture of multiple-

choice and constructed-response items. According to TEA, the STAAR reading and

mathematics assessments (summarized in Table 2) have a stratified alpha reliability

ranging from 0.87 to 0.903, showing a high level of internal consistency among the

survey items. Both of the scales’ reliabilities were generally high, exceeding the

72

conventional standard of .70 for internal consistency recommended in the literature

(Fraenkel, Wallen, & Hyun, 2012).

Table 2

STAAR Grade 6 Total Group Descriptive Data

Subject Score

Points N Mean SD Alpha* SEM

Mean P-

Value**

Reading 40 327918 25.9 8.02 0.897 2.647 64.967

Mathematics 46 337288 30.05 9.39 0.903 2.83 65.34

SD = standard deviation

N represents population

SEM = Structural Equation Modeling

* Stratified Alpha Reliability computed for tests/reporting categories involving short-answer and/or

essay questions, KR-20 reliability computed for all others

** Multiple-choice and gridded items (if applicable) only

Data Collection Procedures

According to Frankel and Wallen (2012), quantitative research is prevalent in

developing procedures relating to the comparison of variables, groups, or relating factors

about individuals or groups in experiments through correlational studies and surveys.

Collecting data and analyzing numbers that measure distinct attributes of individuals and

groups is a trend that is prevalent in today’s studies (Frankel, Wallen, & Hyun, 2012).

The data collection procedures consisted of multiple steps to ensure the ethical validity of

data collection. First, permission was obtained from the superintendent at a northeast

Texas school district. This step involved obtaining a letter of authorization that validated

the ability to conduct research within the confines of the school district (see Appendix C).

After obtaining the letter of authorization from the superintendent, an Institutional

Review Board (IRB) application was submitted to Grand Canyon University (see

Appendix G). Once approval was granted, the researcher retrieved archival performance

73

data of the 2013-2014 STAAR and IMMA from the counselor’s office. Identifiers such as

student names, identification numbers, dates of birth, and addresses were removed by the

school district and assigned an alphabet to minimize a breach of confidentiality.

The researcher assumed that proper data collection methods were taken during

both the IMMA and STAAR assessments. The IMMA assessment packet included a

paper that allowed the test administrator to run the students’ answer documents through a

photocopier, which would reveal the correct answers marked with an elongated black

oval. The test administrator then counted the number of correct responses by indicating

which student-circled answers corresponded with the black ovals. The copy-machine

method was used in grading the answer sheets in this study because of its greater

efficiency and accuracy.

Unlike the IMMA data collection procedures, the STAAR data collection

procedures required state-mandated protocols to minimize a breach of security and to

ensure the validity of the data. The test administrator began by issuing each student a

sealed test booklet and a corresponding answer document. Next, the test administrator

instructed the students to break the seal of their test booklet using their pencils. Then, the

test administrator read the instructions and directed students to begin the test. After the

four-hour time limit expired, the test administrator collected all the test booklets and

answer documents. Once collected, the test administrator reported all testing materials to

the district testing coordinator to be mailed to the state for scoring. The state assessment

division loaded each answer document into the Texas Student Data System (TSDS) to

generate the test results. Once the test results were generated, the state assessment

division notified the school district and mailed each student’s performance data.

74

Once the archived data of the IMMA and STAAR was retrieved from the

counselor, it remained in the possession of the researcher and kept in a lock box located

in the counselor’s office for three years. After three years, the researcher will shred and

dispose all documents (including electronic data) in accordance to current research

standards. Data collected was exported into the Statistical Package for Social Science

(SPSS) 23.0 software.

Data Analysis Procedures

The raw data was provided by the junior high counselor and was organized and

prepared for descriptive analysis in several ways. First, the data collected from the

IMMA and the STAAR was coded utilizing an alpha numeric coding system to minimize

a breach of confidentiality. The data from both the IMMA and STAAR assessments was

provided within the data spreadsheet. Both descriptive and inferential statistical analyses

were performed. Data screening was performed to ensure data accuracy and to confirm

the adequacy of the planned statistical test. Descriptive statistics included first testing for

normal distribution by examining the modes, means, and median for the musical aptitude

variables (from the IMMA) and academic achievement variables (from the STAAR).

Then, scatterplots was employed to identify possible outliers. According to Fraenkel,

Wallen, and Hyun (2012) scatterplots are “pictorial representations of the relationship

between two quantitative variables” (p.201). The assumption of multicollinearity was

tested by calculating correlations between predictor variables and collinearity statistics

(tolerance and variance inflation factor). In addition, a homoscedasticity analysis was

conducted to determine whether a regression model's ability to predict a variable is

consistent across all values of that variable (Fraenkel, Wallen, and Hyun, 2012).

75

Two-tailed tests and an alpha level of .05 were used for all inferential statistical

tests. This means the probability of obtaining such an outcome is only five times (or less)

in 100 (Frankel, Wallen, & Hyun, 2012). The first research question of this study was: Is

there a correlation between the level of musical aptitude and the composite and sub-

categorical performance levels of the mathematics scores on the STAAR among

beginning band students? The null hypothesis to be tested is:

H01: A correlation does not exist between musical aptitude and the composite and

sub-categorical performance levels of the mathematics section of the STAAR

among beginning band students.

To test this null hypothesis, Pearson correlations were computed. Pearson

correlations are used to measure the degree of linear relationship between two variables.

The Pearson product-moment correlation coefficient analysis is parametric in nature and

requires a population with a normal distribution. When the data for both variables is

expressed in a quantifiable method, the Pearson r is the appropriate correlation

coefficient to utilize (Fraenkel, Wallen, & Hyun, 2012). Additionally, the Pearson

product-moment coefficient establishes the possibility of relationship without

determining the relationship is causation. This statistical technique measures the strength

of a relationship between two variables within a sample.

For the first null hypothesis, one Pearson correlation was computed between the

Mathematics score from the STAAR and the composite musical aptitude scale from the

IMMA. If any of the correlations were statistically significant, the null hypothesis was

rejected.

76

The second research question was: Is there a correlation between the level of

musical aptitude and the composite and sub-categorical performance levels of the reading

scores on the STAAR among beginning band students? The corresponding null

hypothesis was:

H02: A correlation does not exist between musical aptitude and the composite and

sub-categorical performance levels of the reading section of the STAAR among

beginning band students.

This null hypothesis was tested using one additional Pearson correlation. For this

null hypothesis, the correlation between STAAR Reading test scores and scores on the

composite musical aptitude scale were computed. If any of the correlations were

statistically significant, the null hypothesis was rejected.

Ethical Considerations

Confidentiality and privacy are essential in any quality research study (Yin,

2009). This study was conducted within all the ethical considerations of the Belmont

Principles (Grand Canyon University, 2013). The participants’ standardized test scores

are protected under the Family Educational Rights and Privacy Act (FERPA), which is

intended to protect the privacy and confidentiality of student’s educational records. Grand

Canyon University Institutional Review Board (IRB) approval was required prior to the

data collection process. Once IRB approval was granted, the researcher retrieved the

participants’ standardized assessment data from the school district. In addition, site

authorization was obtained in writing from the school district.

The researcher obtained each student’s performance data. The counselor, who

managed the data and replaced with numeric IDs to ensure anonymity, removed

77

identifiers such as student names, identification numbers, dates of birth, and addresses.

Since the sample consisted of students from a vulnerable population, all information

regarding the participants remained in the possession of the researcher and kept in a lock

box located in the researcher’s office for three years to protect the confidentiality of the

students. After three years, the researcher will shred and dispose all documents, including

electronic data, in accordance with current research standards. Data collected was

imported into the Statistical Package for Social Science (SPSS) 23.0 software. The results

of this study will be published in ProQuest and presented without the identification of the

school district, selected campus, or participants. Moreover, this study will be available to

the school district and any others who wish to receive the results. Copyrights for this

study will belong to the researcher.

Limitations

According to Fraenkel, Wallen, and Hyun (2012), knowledge concerning the

limitations of a study may assist other researchers in evaluating the degree to which the

findings can be generalized. Although none of the limitations are expected to pejoratively

affect the study, the researcher identified several limitations. First, the researcher’s access

to secondary data was limited to one school district in Texas with one middle school. As

a result, the researcher identified the following consequences: (a) population size for the

grade of interests (6th grade) is N = 65, which does not meet the minimal sample size

needed to capture medium-size correlations. The study can be identified as significant

only if a large correlation exists, (b) the population may not be typical for the entire state

of Texas (low external validity), and (c) since there are significant educational

differences among states, the results of this study cannot be generalized to other states.

78

Researchers could minimize negative consequences by increasing the sample size.

Participants could be selected from several school districts in Texas. Larger sample sizes

allow researchers to better determine the average values of their data (Fraenkel, Wallen,

& Hyun, 2012). Second, the scope of this study was limited by the confines of the

instrument used to measure musical aptitude and the scope of the mathematics and

reading tests. While the study presented several unavoidable limitations, it will not

negatively affect the results of the study.

Delimitations are simply defined as boundaries set by the researcher(s) to control

the study (Fraenkel, Wallen, & Hyun, 2012). The researcher made the following

delimitations: (a) a longitudinal study may reveal different or similar relationship

patterns, however, this study was limited to the 2013-2014 school year, and (b)

correlational studies do not investigate cause and effect (Yin, 2009), therefore, causal

conclusions cannot be drawn from the study.

Summary

An overview of the selected methodology for this study was provided in this

chapter. The study employed a quantitative, correlational research methodology to

address the research questions and hypotheses. The primary focus of this research study

was to examine if, and to what degree, a correlation existed between the level of musical

aptitude and the reading and mathematics sections of the STAAR among beginning band

students. This quantitative correlational study examined the relationship between musical

aptitude, as measured by the results from the 2013-2014 IMMA, and the results from the

2013-2014 reading and mathematics performance levels of the STAAR. The sample

79

population for this study consisted of approximately 65 beginning band students from an

east Texas middle school.

A correlational research design was employed for this study. According to

Fraenkel, Wallen, and Hyun (2012), correlational research design “seeks to investigate

the extent to which one or more relationships of some type exist” (p.11). Furthermore, a

correlational research design is useful to researchers who are interested in determining to

what degree two or more variables are related. Correlation coefficients were calculated

using the Pearson r in order to analyze the strength of the linear relationships between

both variables (Fraenkel, Wallen, Hyun, 2012). A bivariate analysis was applied to

describe the statistical relationship between the independent variable, musical aptitude,

and the dependent variable, academic achievement. The findings of this study may help

school administrators, music specialists, reading and mathematics instructors find

effective ways to utilize music instruction to enhance reading and mathematics

achievement of middle school students. In addition, this study contributed to the field by

providing new information and resources relevant to musical aptitude and academic

achievement among middle school students.

Chapter 3 included all of the necessary methods in developing this research study,

including the problem statement, research questions and hypotheses. Additionally, the

chapter discussed the organization that was targeted for this study, the sample population,

and sample participants. The chapter also identified the rationale for the quantitative

methodology, instrumentation, validity, and reliability. Finally, the chapter discussed the

data collection procedures, data analysis, ethical considerations, and limitations that were

identified within the study. Chapter 4 presents the results obtained from the data

80

collection procedures in this methodology chapter. In addition, Chapter 4 describes in

detail the statistical and non-statistical information obtained from the administration of

the researcher’s instruments to the selected sample and all raw data collected from this

study.

81

Chapter 4: Data Analysis and Results

Introduction

The purpose of this quantitative correlational study was to examine if, and to what

degree, a correlation existed between the musical aptitude, reading and mathematics

scores of the STAAR among beginning band students. Specifically, this quantitative

correlational study examined the relationship between musical aptitude, as measured by

the Intermediate Measures of Musical Audiation (IMMA), and the reading and

mathematics performance levels of the STAAR. Previous empirical research has

examined the relationship between musical aptitude and various academic assessments

(Cavangh, 2009; Holsomback, 2004; Rubinson, 2010). Yet, a gap existed in the literature

concerning the relationship between musical aptitude, as measured by the IMMA, and the

current standardized assessment, as measured by the reading and mathematics sections of

the STAAR (Holsomback, 2002). Based on this gap, this study examined the relationship

between musical aptitude and academic achievement of 65 sixth grade beginning band

students in northeast Texas. In addition, the following research questions and hypotheses

guided this study:

R1: Is there a correlation between the level of musical aptitude and the composite and

sub-categorical performance levels of the mathematics scores on the STAAR

among beginning band students?

H1a: A statistically significant correlation exists between musical aptitude and the

composite and sub-categorical performance levels of the mathematics section of

the STAAR among beginning band students.

82

H10: There is no statistically significant correlation between musical aptitude and the

composite and sub-categorical performance levels of the mathematics section of

the STAAR among beginning band students.

R2: Is there a correlation between the level of musical aptitude and the composite and

sub-categorical performance levels of the reading scores on the STAAR among

beginning band students?

H2a: A statistically significant correlation exists between musical aptitude and the

composite and sub-categorical performance levels of the reading section of the

STAAR among beginning band students.

H20: There is no statistically significant correlation between musical aptitude and the

composite and sub-categorical performance levels of the reading section of the

STAAR among beginning band students.

The remainder of this chapter is divided into three sections that provide a

summary of the results gleaned from the data analysis. First, a summary of the descriptive

statistics was tabulated regarding demographic characteristics of the study population.

Second, an explanation and description of the procedures is used to analyze data collected

from participants. Lastly, a presentation of the results is provided, summarizing the

results that were revealed from the statistical analysis procedures for each of the research

questions.

Descriptive Data

This study was conducted within a single school district in northeast Texas.

Permission to conduct the study was obtained from the district superintendent. According

to the 2013-2014 district’s annual report, the enrollment consisted of approximately 1,100

83

students in grades kindergarten through 12, among whom 10% were African American,

81% were Caucasian, and 9% were Hispanic. In addition, approximately 71% of the

student population was categorized as economically disadvantaged. The district comprises

three schools, one elementary school, one middle school, and one high school. The 2013-

2014 sixth grade class consisted of 65 students, who were all required to enroll in a

beginning band course and take a musical aptitude assessment. Therefore, the target

population (N = 65) consisted of sixth grade band students enrolled in the school district.

Figure 1 displays the ethnic diversity of the population from which the sample

was extracted. Three pie slices in the figure depict ethnic background and respective

percentages (%). As displayed in the figure, Caucasians represented a majority in the

population (81.2%).

Figure 1. District enrollment percentages by ethnicity for 2013-2014 school year.

From the sample of 6th grade students, 65 students participated. Based on the

ethnic profile of the sample, 8 were African American, 50 were Caucasian, and 7 were

9.8%

81.2%

9.0%

African American Caucasian Hispanic

84

Hispanic. In addition, there were 30 boys and 35 girls. Moreover, 50 students were

economically disadvantaged. Table 3 depicts the number of students by ethnicity and

gender along with respective totals and percentages (%).

Three columns were specified in Table 3; these columns headings were ethnicity,

gender, and total. For ethnicity, African American, Caucasian, and Hispanics were

represented. For gender, both boys and girls were represented. As shown in the table,

Caucasians represented the majority of students with 50 of 65 or 76.9%. Boys and girls

were mostly equally distributed with girls holding a slight majority of 53.8%.

Table 3

The Study Population: Gender and Ethnicity

Ethnicity Gender

Total/%

Boys/% Girls/%

African American 3 (37.5) 5 (62.5) 8 (12.3)

Caucasian American 23 (46.0) 27 (54.0) 50 (76.9)

Hispanic American 4 (57.1) 3 (42.9) 7 (10.8)

Total 30 (46.2) 35 (53.8) 65

For the purpose of this research study, two sources of data were used. Both

sources of data have been successfully used in other empirical studies to measure musical

aptitude and academic achievement. The study utilized archival data deriving from the

results of the 2013-2014 IMMA assessment and the 2013-2014 composite and sub-

categorical performance levels of the reading and mathematics sections of the STAAR.

Consent to conduct this study was attained from an east Texas school district and Grand

Canyon University Institutional Review Board. Student identity and confidentiality was

preserved through an alpha numeric coding system so that the identification of

85

participants cannot be breached. The data from the IMMA and STAAR assessments was

from archival school records located in the counselor’s office for the school year. The

data from both the IMMA and STAAR assessments was provided within the data

spreadsheet. Identifiers such as student names, identification numbers, dates of birth, and

addresses were removed by the counselor and assigned a numeric code to ensure

anonymity. Then, the raw data was imported into SPSS for analysis.

Data Analysis Procedure

Inferential statistics were used to draw conclusions from the sample tested. The

Statistical Package for the Social Sciences (SPSS) was used to code and tabulate scores

collected from the survey and provide summarized values where applicable including the

mean, variance, and standard deviation. A power statistical analysis was conducted using

G* Power 3.0.10. The analysis suggested that a minimum of 85 students are required for

this study. The researcher set statistical significance as α = .05 and power at .8 (β = .8) for

this study. Since this study did not meet minimum sample size, the study was identified

as significant only if a large correlation existed. After the data analysis process, the

researcher conducted a post hoc analysis to identify patterns not specified in an a priori

analysis. Post hoc power for all tests were recalculated at greater than 0.80 (See

Appendix E). Regression and multiple regression analyses were used to evaluate the two

research questions. The research questions and hypotheses were:

RQ1: Is there a correlation between the level of musical aptitude and the composite

and sub-categorical performance levels of the mathematics scores of the STAAR

among beginning band students?

86

Ho1: A correlation does not exist between the level of musical aptitude and the

composite and sub-categorical performance levels of the mathematics section

STAAR among beginning band students.

HA1: A correlation exists between the level of musical aptitude and the composite and

sub-categorical performance levels of the mathematics section of the STAAR

among beginning band students.

RQ2: Is there a correlation between the level of musical aptitude and the composite

and sub-categorical performance levels of the reading scores of the STAAR

among beginning band students?

Ho2: A correlation does not exist between the level of musical aptitude and the

composite and sub-categorical performance levels of the reading section of the

STAAR among beginning band students.

HA2: A correlation does exist between the level of musical aptitude and the

composite and sub-categorical performance levels of the reading section of the

STAAR among beginning band students.

Prior to analyzing the research question, data cleaning and data screening were

undertaken to ensure the variables of interest met appropriate statistical assumptions.

Thus, the following analyses were assessed using an analytic strategy in that the variables

were first evaluated for missing data, univariate and multivariate outliers, normality,

linearity, homoscedasticity, and multicollinearity. Finally, regression and multiple

regression analyses were run to determine if any relationships existed between the

variables of interest. The variables and statistical tests used to evaluate research questions

1 and 2 were displayed in Table 4.

87

Table 4

Summary of Variables and Statistical Tests used to Evaluate Research Questions 1 and 2

Hypothesis Criterion

Variable Predictor Variable Analysis

RQ1a

Musical

Aptitude

Composite

Mathematical Achievement Composite Regression

RQ1b

Musical

Aptitude

Composite

Quantitative Reasoning, Algebraic

Reasoning, Spatial Reasoning, Measurement

Statistics

Multiple

Regression

RQ2a

Musical

Aptitude

Composite

Reading Achievement Composite Regression

RQ2b

Musical

Aptitude

Composite

Understanding across Genres, Understanding

of Literary Text, Understanding of

Informational Text

Multiple

Regression

Research Question 1: data evaluation. Research question 1 was evaluated using

regression and multiple regression analyses to determine if any significant correlations

existed between students’ level of musical aptitude and the composite and sub-categorical

performance levels of the mathematics section of the STAAR. The criterion variable was

students’ musical aptitude composite scores as measured by the Intermediate Measures of

Music Addiction (IMMA). The predictor variable used in the regression analysis was

students’ mathematics achievement composite scores as measured by the State of Texas

Assessments of Academic Readiness (STAAR). For the multiple regression analysis, the

predictor variables were students’ scores on the five mathematics achievement subscales

including quantitative reasoning, algebraic reasoning, spatial reasoning, measurement,

and statistics scores.

Data cleaning. Data were collected from a sample of 65 sixth grade band students

enrolled in the school district. However, before the assumptions were assessed, the data

88

were screened for missing data, univariate outliers, and multivariate outliers. Missing

data were investigated using frequency counts and no cases were found to have missing

data. The data were screened for univariate outliers by transforming raw scores to z-

scores and comparing z-scores to a critical value of +/- 3.29, p < .001 (Tabachnick &

Fidell, 2007). Z-scores that exceed this critical value are more than three standard

deviations away from the mean and thus represent outliers. The distributions were

evaluated and no cases with univariate outliers were found.

For the multiple regression analysis, multivariate outliers were evaluated using

Mahalanobis distance. Mahalanobis distances were computed for each variable and these

scores were compared to a critical value from the chi-square distribution table.

Mahalanobis distance for five predictor variables indicates a critical value of 20.52 and

no cases were found to exceed this value. Thus, 65 responses from participants were

received and 65 were evaluated by the regression and multiple regression models (n =

65). Descriptive statistics of the criterion and predictor variables are displayed in Table 5.

Table 5

Descriptive Statistics of the Criterion and Predictor Variables

Variable N Min Max Mean Std.

Deviation Skewness Kurtosis

Musical Aptitude Composite 65 39 80 65.169 8.774 -0.668 0.203

Mathematics Achievement

Composite 65 1399 1874 1599.631 115.088 0.297 -0.510

Quantitative Reasoning 65 3 16 9.800 2.954 -0.001 -0.426

Algebraic Reasoning 65 0 11 6.185 2.984 -0.232 -0.879

Spatial Reasoning 65 1 8 5.292 2.075 -0.318 -0.847

Measurement 65 1 8 4.539 1.888 -0.220 -0.918

Statistics 65 1 8 4.923 2.041 -0.291 -1.122

Note. Total N = 65

Normality. Before the research question was analyzed, basic parametric

assumptions were assessed. That is, for the criterion (musical aptitude composite) and

89

predictor variables (mathematics achievement composite, quantitative reasoning,

algebraic reasoning, spatial reasoning, measurement, and statistics) assumptions of

normality, linearity, homoscedasticity, and multicollinearity were tested. The variables

were analyzed for linearity and homoscedasticity using scatterplots and the distributions

met the assumptions. To test if the distributions were normally distributed, the skew and

kurtosis coefficients were divided by the skew/kurtosis standard errors, resulting in z-

skew/z-kurtosis coefficients. This technique was recommended by Tabachnick and Fidell

(2007). Specifically, z-skew/z-kurtosis coefficients exceeding the critical range between -

3.29 and +3.29 (p < .001) may indicate non-normality. Thus, based on the evaluation of

the z-skew/z-kurtosis coefficients, no variables exceeded the critical range. Therefore,

normality was assumed for all distributions. Displayed in Table 6 are skewness and

kurtosis statistics of the criterion and predictor variables used to evaluate research

question 1.

Table 6

Skewness and Kurtosis Statistics of the Criterion and Predictor Variables Used to

Evaluate Research Question 1

Variable Skewness Skewness

Std. Error

z-

skew Kurtosis

Kurtosis

Std. Error

z-

kurtosis

Musical Aptitude Composite -0.668 0.297 -2.249 0.203 0.586 0.346

Mathematics Achievement

Composite 0.297 0.297 1.000 -0.510 0.586 -0.870

Quantitative Reasoning -0.001 0.297 -0.003 -0.426 0.586 -0.727

Algebraic Reasoning -0.232 0.297 -0.781 -0.879 0.586 -1.500

Spatial Reasoning -0.318 0.297 -1.071 -0.847 0.586 -1.445

Measurement -0.220 0.297 -0.741 -0.918 0.586 -1.567

Statistics -0.291 0.297 -0.980 -1.122 0.586 -1.915

Note. Total N = 65

Multicollinearity. The assumption of multicollinearity was tested by calculating

correlations between predictor variables (quantitative reasoning, algebraic reasoning,

90

spatial reasoning, measurement, and statistics) and collinearity statistics (tolerance and

variance inflation factor). Results indicated that correlations between predictor variables

did not exceed the critical value of .80. Tolerance was calculated using the formula T = 1

– R2 and variance inflation factor (VIF) is the inverse of Tolerance (1 divided by T).

Commonly used cut-off points for determining the presence of multicollinearity are T <

.10 and VIF > 10. Results indicated that the predictor variables did not exceed the critical

values. Thus, since the correlation, tolerance, and VIF coefficients did not exceed their

critical values, the presence of multicollinearity was not assumed. Displayed in Table 7

are summary details of the correlations between predictor variables used to evaluate

research question 1.

Table 7

Summary of Correlations between Predictor Variables used in Research Question 1

Predictor Variable Quantitative

Reasoning

Algebraic

Reasoning

Spatial

Reasoning Measurement Statistics

Quantitative Reasoning 1.000 .712 .634 .594 .697

Algebraic Reasoning 1.000 .713 .631 .726

Spatial Reasoning 1.000 .641 .670

Measurement 1.000 .737

Statistics 1.000

Note. Total N = 65

Research Question 2: data evaluation. Research question 2 was evaluated using

regression and multiple regression analyses to determine if any significant correlations

existed between students’ level of musical aptitude and the composite and sub-categorical

performance levels of the reading section of the STAAR. The criterion variable was

students’ musical aptitude scores as measured by the Intermediate Measures of Music

Addiction (IMMA). The predictor variable used in the regression analysis was students’

reading achievement composite scores as measured by the State of Texas Assessments of

91

Academic Readiness (STAAR). For the multiple regression analysis, the predictor

variables were students’ scores on the three reading achievement subscales including

understanding/analysis across genres, understanding of literary text, and

understanding/analysis of informational text.

Data cleaning. Before the assumptions were assessed, the data were screened for

missing data, univariate outliers and multivariate outliers. Results indicated there were no

cases with missing data and no univariate outliers were found. For the multiple regression

analysis, multivariate outliers were evaluated using Mahalanobis distance. Mahalanobis

distance for three predictor variables indicates a critical value of 16.27 and no cases were

found to exceed this value. Therefore, responses from 65 participants received and 65

were evaluated by the regression and multiple regression models (N = 65). Descriptive

statistics of the criterion and predictor variables are displayed in Table 8.

Table 8

Descriptive Statistics of the Criterion and Predictor Variables used to Evaluate Research

Question 2

Predictor Variable Min Max Mean

Std.

Deviatio

n

Skewnes

s

Kurtosi

s

Musical Aptitude Composite 39 80 65.169 8.774 -0.668 0.203

Reading Achievement Composite 1327 1864 1591.01

5 109.845 0.112 -0.070

Understanding Analysis across Genres 1 10 6.692 2.106 -0.490 -0.240

Understanding of Literary Text 2 20 13.385 3.896 -0.403 -0.214

Understanding Analysis of Informational

Text 5 17 12.015 3.319 -0.263 -0.862

Note. Total N = 65

Normality. Before the research question was analyzed, basic parametric

assumptions were assessed. That is, for the criterion (musical aptitude composite) and

predictor variables (understanding across genres, understanding of literary text, and

92

understanding of informational text) assumptions of normality, linearity,

homoscedasticity, and multicollinearity were tested. The variables were analyzed for

linearity and homoscedasticity using scatterplots and the distributions met the

assumptions. To test if the distributions were normally distributed, the skew and kurtosis

coefficients were divided by the skew/kurtosis standard errors, resulting in z-skew/z-

kurtosis coefficients. Results indicated that no variables exceeded the critical range (< -

3.29 and > 3.29). Therefore, normality was assumed for all distributions. Displayed in

Table 9 are skewness and kurtosis statistics of the criterion and predictor variables used

to evaluate research question 2.

Table 9

Skewness and Kurtosis Statistics of the Criterion and Predictor Variables Used to

Evaluate Research Question 2

Variable Skewness Skewness

Std. Error z-skew Kurtosis

Kurtosis

Std.

Error

z-kurtosis

Musical Aptitude Composite -0.668 0.297 -2.249 0.203 0.586 0.346

Reading Achievement

Composite 0.112 0.297 0.377 -0.070 0.586 -0.119

Understanding Analysis across

Genres -0.490 0.297 -1.650 -0.240 0.586 -0.410

Understanding of Literary Text -0.403 0.297 -1.357 -0.214 0.586 -0.365

Understanding Analysis of

Informational Text -0.263 0.297 -0.886 -0.862 0.586 -1.471

Note. Total N = 65

Multicollinearity. The assumption of multicollinearity was tested by calculating

correlations between predictor variables (understanding across genres, understanding of

literary text, and understanding of informational text) and collinearity statistics (tolerance

and variance inflation factor). Results indicated that correlations between predictor

variables did not exceed the critical value of .80. For tolerance and variance inflation

factor (VIF), results indicated that the predictor variables did not exceed the critical

93

values (T < .10 and VIF > 10). Thus, since the correlation, tolerance, and VIF

coefficients did not exceed their critical values, the presence of multicollinearity was not

assumed. Displayed in Table 10 are summary details of the correlations between

predictor variables used to evaluate research question 2.

Table 10

Summary of Correlations between Predictor Variables used in Research Question 2

Pearson Correlation

Predictor Variable 1 2 3

Understanding Analysis across Genres (1) 1.000 0.632 0.765

Understanding of Literary Text (2) 1.000 0.683

Understanding Analysis of Informational Text (3) 1.000

Note. Total N = 65

Results

This study consisted of two overarching research questions, each associated with

a pair of null and alternative hypothesis. In this section, the researcher utilizes the results

of the statistical analyses to answer each research question.

Results of Research Question 1. Regression and multiple regression analyses

using the general linear model were used to determine if any significant correlations

existed between students’ level of musical aptitude and the composite and sub-categorical

performance levels of the mathematics section of the STAAR. The criterion variable was

specified as students’ musical aptitude composite scores as measured by the Intermediate

Measures of Music Addiction (IMMA). The predictor variable specified in the regression

analysis was students’ mathematics achievement composite scores as measured by the

State of Texas Assessments of Academic Readiness (STAAR). The predictor variables

for the multiple regression analysis were students’ scores on the five mathematics

achievement subscales including quantitative reasoning, algebraic reasoning, spatial

94

reasoning, measurement, and statistics scores while the criterion variable was musical

aptitude composite scores as measured by the Intermediate Measures of Music Addiction

(IMMA). The null and alternative hypotheses were:

Ho1: A correlation does not exist between the level of musical aptitude and the

composite and sub-categorical performance levels of the mathematics section

STAAR among beginning band students.

HA1: A correlation exists between the level of musical aptitude and the composite and

sub-categorical performance levels of the mathematics section of the STAAR

among beginning band students.

Using SPSS 23, regression and multiple regression analyses was used to

determine if students’ musical aptitude scores were significantly correlated to the

composite and sub-categorical performance levels of the mathematics section of the

STAAR. Results from the regression analysis indicated there was a significant

relationship between students’ musical aptitude and mathematics achievement composite

scores, R = .673, R2 = .453, F (1, 63) = 52.132, p < .001. That is, 45.3% (R2 = .453) of the

variance observed in the criterion variable (musical aptitude) was due to the predictor

variable (mathematic achievement composite). Based on findings, the null hypothesis

was rejected in favor of the alternative. A model summary of the regression analysis was

displayed in Table 11.

95

Table 11

Model Summary of Regression for Research Question 1

Source R R2 Standard Error F Sig.

(p)

Omnibus .673 .453 6.542 52.132 < .001

Unstandardized

Coefficients

Standardized

Coefficients

Source B Std.

Error Beta t

Sig.

(p)

Part

Correlation

(Constant) -16.895 11.395 -1.483 .143

Mathematic Achievement

Composite 0.051 0.007 0.673 7.220 < .001 .673

Note. Criterion variable = musical aptitude; N = 65

Figure 2 displays the regression plot depicting the relationship between

mathematics and musical aptitude. The scatterplot displays a strong and significant

relationship between the two variables. The regression equation is Ý = -16.895 + .051*x.

This means that for every one unit increase in mathematics scores, musical aptitude

increases by .051.

96

Figure 2. Scatterplot depicting the relationship between mathematics achievement and

musical aptitude.

Results from the multiple regression analysis indicated there was a significant

relationship between students’ musical aptitude scores and five mathematics achievement

subscales (quantitative reasoning, algebraic reasoning, spatial reasoning, measurement,

and statistics), R = .709, R2 = .502, F (5, 59) = 11.915, p < .001. That is, 50.2% (R2 =

.502) of the variance observed in the criterion variable (musical aptitude) was due to a

model containing five predictor variables. Thus, the null hypothesis for research question

1 was rejected in favor of the alternative hypothesis. A model summary of the multiple

regression analysis is displayed in Table 12.

97

Table 12

Model Summary of Multiple Regression for Research Question 1

Source R R2 Standard Error F Sig. (p)

Omnibus .709 .502 6.446 11.915 < .001

Unstandardized

Coefficients

Standardized

Coefficients

Source B Std. Error Beta t Sig. (p) Part

Correlation

(Constant) 47.191 2.927 16.124 < .001

Quantitative Reasoning 0.477 0.425 0.161 1.124 .266 .103

Algebraic Reasoning 0.341 0.465 0.116 0.734 .466 .067

Spatial Reasoning 1.807 0.606 0.427 2.981 .004 .274

Measurement -0.450 0.664 -0.097 -0.678 .500 -.062

Statistics 0.746 0.707 0.174 1.055 .296 .097

Note. Criterion variable = musical aptitude; N = 65

The contribution of each predictor variable when the others were controlled for

was evaluated using the standardized Beta coefficient. That is, results indicated that one

predictor variable (spatial reasoning) made a significantly unique contribution in

explaining the criterion variable (standardized B = 0.427, p = .004). Specifically, there

was a significant and positive relationship between students’ musical aptitude and spatial

reasoning scores. The part correlation coefficient (part correlation = .274) indicated that

7.5% (part correlation2 = .075) of the variance observed in the criterion variable (musical

aptitude) was due to students’ spatial reasoning scores. The remaining predictor variables

did not make a significantly unique contribution in explaining the criterion variable

(quantitative reasoning p = .266, algebraic reasoning p = .466, measurement p = .500, and

statistics p = .296).

Results of Research Question 2. Research question 2 was evaluated using

regression and multiple regression analyses to assess if any significant correlations

existed between students’ level of musical aptitude and the composite and sub-categorical

98

performance levels of the reading section of the STAAR. The null and alternative

hypotheses were:

Ho2: A correlation does not exist between the level of musical aptitude and the

composite and sub-categorical performance levels of the reading section of the

STAAR among beginning band students.

HA2: A correlation does exist between the level of musical aptitude and the composite

and sub-categorical performance levels of the reading section of the STAAR

among beginning band students.

Using SPSS 23, regression and multiple regression analyses was used to

determine if students’ musical aptitude scores were significantly correlated to the

composite and sub-categorical performance levels of the reading section of the STAAR.

Results from the regression analysis indicated there was a significant relationship

between students’ musical aptitude and reading achievement composite scores, R = .848,

R2 = .718, F(1, 63) = 160.722, p < .001. That is, 71.8% (R2 = .718) of the variance

observed in the criterion variable (musical aptitude) was due to the predictor variable

(reading achievement composite). A model summary of the regression analysis is

displayed in Table 13.

99

Table 13

Model Summary of Regression for Research Question 2

Source R R2 Standard Error F Sig.

(p)

Omnibus .848 .718 4.693 160.722 <

.001

Unstandardized

Coefficients

Standardized

Coefficients

Source B Std.

Error Beta t

Sig.

(p)

Part

Correlation

(Constant) -42.548 8.517 -4.996 <

.001

Reading Achievement

Composite 0.068 0.005 0.848 12.678

<

.001 .848

Note. Criterion variable = musical aptitude; N = 65

Figure 3 displays the regression plot depicting the relationship between reading

achievement and musical aptitude. The scatterplot displays a strong and significant

relationship between the two variables. The regression equation is Ý = -42.548 + .068*x.

This means that for every one unit increase in mathematics scores, musical aptitude

increases by .068.

100

Figure 3. Scatterplot depicting the relationship between reading achievement and musical

aptitude.

Results from the multiple regression analysis indicated there was a significant

relationship between students’ musical aptitude scores and three reading achievement

subscales (understanding across genres, understanding of literary text, and understanding

of informational text), R = .861, R2 = .740, F(3, 61) = 58.022, p < .001. That is, 74.0%

(R2 = .740) of the variance observed in the criterion variable (musical aptitude) was due

to a model containing three predictor variables. Thus, the null hypothesis for research

question 2 was rejected in favor of the alternative hypothesis. A model summary of the

multiple regression analysis is displayed in Table 14.

101

Table 14

Model Summary of Multiple Regression for Research Question 2

Source R R2 Standard

Error F

Sig.

(p)

Omnibus .861 .740 4.578 58.022 < .001

Unstandardized

Coefficients

Standardized

Coefficients

Source B Std.

Error Beta t

Sig.

(p)

Part

Correlation

(Constant) 36.166 2.285 15.829 < .001

Understanding across Genres 1.129 0.434 0.271 2.602 .012 .170

Understanding of Literary Text 0.687 0.207 0.305 3.324 .002 .217

Understanding of

Informational Text 1.019 0.292 0.386 3.486 .001 .227

Note. Criterion variable = musical aptitude; N = 65

The contribution of each predictor variable when the others were controlled for

was evaluated using the standardized Beta coefficient. Results indicated that all three

predictor variables (understanding across genres, understanding of literary text, and

understanding of informational text) made significantly unique contributions in

explaining the criterion variable (musical aptitude). Specifically, students’ understanding

of informational text scores made the strongest unique contribution (standardized B =

0.386, p = .001) followed by understanding of literary text scores (standardized B =

0.305, p = .002), and then understanding across genres scores (standardized B = 0.271, p

= .012). That is, there were significant and positive relationships between students’

musical aptitude and all three reading achievement subscales. The part correlation

coefficients indicated that 2.9% (part correlation = .170, part correlation2 =.029) of the

variance observed in the criterion variable (musical aptitude) was due to students’

understanding across genre scores and 4.7% was due to understanding of literary text

scores (part correlation = .217, part correlation2 =.047). Lastly, 5.2% of the variance

102

observed in the criterion variable was due to students’ understanding of informational

text scores (part correlation = .227, part correlation2 = .052).

Summary

The purpose of this quantitative correlational study was to examine if, and to what

degree, a correlation existed between the level of musical aptitude, reading, and

mathematics scores of the STAAR among beginning band students. The study consisted

of 65 sixth grade beginning band students whom are all required to enroll in a beginning

band course and take a musical aptitude assessment. Secondary data (IMMA and

STAAR) were obtained from a northeast Texas school district.

This chapter presented the descriptive statistics and statistical analysis such as

regression, and multiple regression analyses to measure the relationship between

variables. The chapter began by describing the sample population using descriptive

statistics followed by data analysis procedures. In addition, there were no data limitations

that emerged based on data analysis. Findings from analyzing the data were significant:

For H1a, results from the regression analysis indicated there was a significant

relationship between students’ musical aptitude and mathematics achievement composite

scores. For H1b, results from the multiple regression analysis indicated there was a

significant relationship between students’ musical aptitude scores and five mathematics

achievement subscales (quantitative reasoning, algebraic reasoning, spatial reasoning,

measurement, and statistics). For H2a, results from the regression analysis indicated there

was a significant relationship between students’ musical aptitude and reading

achievement composite scores. For H2b, results from the multiple regression analysis

indicated there was a significant relationship between students’ musical aptitude scores

103

and three reading achievement subscales (understanding across genres, understanding of

literary text, and understanding of informational text). Table 15 displays summary

findings for Hypotheses 1 and 2. Post hoc power for all tests were calculated at greater

than 0.8.

Table 15

Summary of Results for Hypotheses 1 and 2

Hypothesis Criterion

Variable Predictor Variable Analysis Sig. (p)

Post Hoc

Power

H1a

Musical

Aptitude

Composite

Math Achievement

Composite Regression < .001 0.801

H1b

Musical

Aptitude

Composite

Quantitative Reasoning,

Algebraic Reasoning,

Spatial Reasoning,

Measurement, Statistics

Multiple

Regression < .001 0.855

H2a

Musical

Aptitude

Composite

Reading Achievement

Composite Regression < .001 0.801

H2b

Musical

Aptitude

Composite

Understanding across

Genres, Understanding of

Literary Text,

Understanding of

Informational Text

Multiple

Regression < .001 0.855

Note. Total N = 65

In the final chapter of this study, Chapter 5 will provide an overview of the

importance of this study and its contribution to the understanding of the topic. Chapter 5

will also reiterate the two research questions, and provide conclusions and

recommendations based on the description of the data findings related to the research

questions and hypotheses. Chapter 5 will also discuss the specific findings of this study,

the theoretical and future implications, suggestions on musical training, and

recommendations for future research.

104

Chapter 5: Summary, Conclusions, and Recommendations

Introduction

The purpose of this quantitative correlational study was to examine if, and to what

degree, a correlation existed between the musical aptitude, reading and mathematics

scores of the STAAR among sixth grade beginning band students from a middle school in

northeast Texas. Similar empirical studies have examined the relationship between

musical aptitude and previous standardized assessments in Texas (Holsomback, 2001;

Holsomback, 2002). Yet, a gap in the literature existed concerning the relationship

between the levels of musical aptitude and the current standardized assessment in Texas.

Based on this gap in the literature, the following research questions and hypotheses

guided this study:

R1: Is there a correlation between the level of musical aptitude and the composite and

sub-categorical performance levels of the mathematics scores on the STAAR

among beginning band students?

H1a: A statistically significant correlation exists between musical aptitude and the

composite and sub-categorical performance levels of the mathematics section of

the STAAR among beginning band students.

H10: There is no statistically significant correlation between musical aptitude and the

composite and sub-categorical performance levels of the mathematics section of

the STAAR among beginning band students.

R2: Is there a correlation between the level of musical aptitude and the composite and

sub-categorical performance levels of the reading scores on the STAAR among

beginning band students?

105

H2a: A statistically significant correlation exists between musical aptitude and the

composite and sub-categorical performance levels of the reading section of the

STAAR among beginning band students.

H20: There is no statistically significant correlation between musical aptitude and the

composite and sub-categorical performance levels of the reading section of the

STAAR among beginning band students.

By examining such relationships, school administrators, teachers, and policy

holders are capable of identifying tactics that may improve test results. Understanding the

relationship between musical aptitude and academic achievement was essential in finding

effective ways to utilize music instruction to enhance reading and mathematics

achievement of middle school students. The findings from this study have opened

opportunities for further research in this area.

This chapter provides a summary of the study, conclusions, limitations, and future

theoretical and practical implications. The conclusions and findings associated with each

research question and hypothesis were examined. The researcher’s observations and

summary of the study were noted. In addition, this chapter concludes with a discussion of

recommendations for future practice and research.

Summary of the Study

The researcher embarked on this study in an attempt to understand the

relationship between the levels of musical aptitude and the current standardized

assessment in Texas. This study focused exclusively on 65 sixth grade beginning band

students in one northeast Texas school as the district requires all sixth grade students to

enroll in a beginning band course and take the State of Texas Assessment of Academic

106

Readiness (STAAR). Although compelling empirical evidence examined the relationship

between the levels of musical aptitude and previous standardized assessments

(Holsomback, 2001; Rubinson, 2010), as of 2015 there has been no research concerning

the correlation between musical aptitude and the State of Texas Assessments of

Academic Readiness (STAAR) among sixth grade beginning band students. Therefore,

the purpose of this quantitative correlational study was to examine if, and to what degree,

a correlation existed between the musical aptitude and reading and mathematics scores of

the STAAR among beginning band students.

The findings of this study advanced the understanding of the relationship between

musical aptitude and academic achievement among beginning band students. The

collection of data from this study added to the literature in this area by broadening the

knowledge surrounding the problem statement. Moreover, the data extended the literature

relating music to cognitive abilities by examining the correlation of musical aptitude to

specific areas of academic performance.

Chapter 1 provided an introduction and rationale for the study and presented the

research questions that were used to justify the purpose and address the current lack of

research on musical aptitude and academic achievement. A discussion of how it will

advance scientific knowledge and the significance of the study was also presented. The

researcher provided the rationale for the selected methodology and research design. The

chapter concluded with definitions of research terms, assumptions, limitations, and the

study’s delimitations.

Chapter 2 synthesized the foundational and current literature related to musical

aptitude and academic achievement. In addition, chapter 2 presented an organized

107

literature review covering the background to the problem, the theoretical framework

providing the foundation to the study, and various topics and themes within those topics

related to the study. Specifically, chapter 2 included a discussion of musical aptitude in

relation to reading and mathematics achievement. Chapter 2 provided a synopsis of the

gaps in prior research and methodological strengths and weaknesses found in earlier

studies.

Chapter 3 presented an overview of the selected methodology and research

design. A quantitative methodology and a correlational research design were employed as

they are useful to researchers who are interested in determining the extent to which two

or more variables are related. The researcher utilized secondary data from one school

district in northeast Texas to determine the extent of the relationship between the levels

of musical aptitude and the composite and sub-categorical performance levels of the

STAAR. Chapter 3 outlined the data analysis procedures needed to process the data and

answer each research question and hypothesis.

Chapter 4 presented the descriptive statistics and statistical analysis such as

regression and multiple regression analyses to measure the relationship between

variables. The chapter began by describing the sample population using descriptive

statistics followed by data analysis procedures. Results from the data analyses were then

used to answer the research questions and hypotheses.

Chapter 5 provides an overview of the importance of this study and its

contribution to the understanding of the topic. Chapter 5 reiterates the two research

questions, and provided conclusions and recommendations based on the description of

the data findings related to the research questions and hypotheses. Chapter 5 discusses the

108

specific findings of this study, the theoretical and future implications, suggestions on

musical training, and recommendations for future research.

Summary of Findings and Conclusion

The purpose of this quantitative correlational study was to examine if, and to what

degree, a correlation existed between the musical aptitude, reading and mathematics

scores of the STAAR among beginning band students. Specifically, this quantitative

correlational study examined the relationship between musical aptitude, as measured by

the Intermediate Measures of Music Audiation (IMMA Gordon, 2002), and the reading

and mathematics performance levels of the STAAR. In order to attain this objective, this

study presented two research questions, each supported by an alternative hypothesis and a

null hypothesis.

The first research question was:

R1: Is there a correlation between the level of musical aptitude and the composite and

sub-categorical performance levels of the mathematics scores of the STAAR

among beginning band students?

The following hypotheses were associated with this research question:

H1: A statistically significant correlation exists between the level of musical aptitude

and the composite and sub-categorical performance levels of the mathematics

section STAAR among beginning band students.

H10: There is no statistically significant correlation between the level of musical

aptitude and the composite and sub-categorical performance levels of the

mathematics section of the STAAR among beginning band students.

Research question 1 was evaluated using regression and multiple regression

109

analyses to determine if any significant correlations existed between students’ level of

musical aptitude and the composite and sub-categorical performance levels of the

mathematics section of the STAAR. The criterion variable was students’ musical aptitude

composite scores as measured by the Intermediate Measures of Music Addiction

(IMMA). The predictor variable used in the regression analysis was students’ mathematic

achievement composite scores as measured by the State of Texas Assessments of

Academic Readiness (STAAR). For the multiple regression analysis, the predictor

variables were students’ scores on the five mathematic achievement subscales including

quantitative reasoning, algebraic reasoning, spatial reasoning, measurement, and statistics

scores. Results from the regression analysis indicated there was a significant relationship

between students’ musical aptitude and mathematics achievement composite scores, R =

.673, R2 = .453, F (1, 63) = 52.132, p < .001. That is, 45.3% (R2 = .453) of the variance

observed in the criterion variable (musical aptitude) was due to the predictor variable

(mathematic achievement composite). In addition, results from the multiple regression

analysis indicated there was a significant relationship between students’ musical aptitude

scores and five mathematics achievement subscales (quantitative reasoning, algebraic

reasoning, spatial reasoning, measurement, and statistics), R = .709, R2 = .502, F (5, 59) =

11.915, p < .001. That is, 50.2% (R2 = .502) of the variance observed in the criterion

variable (musical aptitude) was due to a model containing five predictor variables. Based

on findings, the null hypothesis was rejected in favor of the alternative.

The second research question was:

110

R2: Is there a correlation between the level of musical aptitude and the composite and

sub-categorical performance levels of the reading scores of the STAAR among

beginning band students?

H2: A statistically significant correlation exists between musical aptitude and the

composite and sub-categorical performance levels of the reading section of the

STAAR among beginning band students.

H20: There is no statistically significant correlation between musical aptitude and the

composite and sub-categorical performance levels of the reading section of the

STAAR among beginning band students.

Research question 2 was evaluated using regression and multiple regression

analyses to determine if any significant correlations existed between students’ level of

musical aptitude and the composite and sub-categorical performance levels of the reading

section of the STAAR. The criterion variable was students’ musical aptitude scores as

measured by the Intermediate Measures of Music Addiction (IMMA). The predictor

variable used in the regression analysis was students’ reading achievement composite

scores as measured by the State of Texas Assessments of Academic Readiness (STAAR).

For the multiple regression analysis, the predictor variables were students’ scores on the

three reading achievement subscales including understanding/analysis across genres,

understanding of literary text, and understanding/analysis of informational text. Results

from the regression analysis indicated there was a significant relationship between

students’ musical aptitude and reading achievement composite scores, R = .848, R2 =

.718, F(1, 63) = 160.722, p < .001. That is, 71.8% (R2 = .718) of the variance observed in

the criterion variable (musical aptitude) was due to the predictor variable (reading

111

achievement composite). In addition, results from the multiple regression analysis

indicated there was a significant relationship between students’ musical aptitude scores

and three reading achievement subscales (understanding across genres, understanding of

literary text, and understanding of informational text), R = .861, R2 = .740, F(3, 61) =

58.022, p < .001. That is, 74.0% (R2 = .740) of the variance observed in the criterion

variable (musical aptitude) was due to a model containing three predictor variables. Thus,

the null hypothesis for research question 2 was rejected in favor of the alternative

hypothesis.

In conclusion, the findings of this study suggested a strong correlation existed

between the level of musical aptitude and the composite and sub-categorical performance

levels of the reading and mathematics section of the STAAR among sixth grade

beginning band students. The findings of this study may help school administrators,

music specialists, and reading and mathematics instructors find effective ways to utilize

music instruction to enhance reading and mathematics achievement of middle school

students. In addition, this study contributed to the field by providing new information and

resources relevant to musical aptitude and academic achievement among middle school

students.

The study provided empirical evidence that correlations do exist between musical

aptitude and academic achievement, which was predictable under Gardner’s theory of

multiple intelligence (2006) and Gordon’s music learning theory (1971) as they both are

ultimately designed to enhance musical intelligence. Although Gordon’s (2003) and

Kuhlman’s (2005) premise were not supported in this research, Holsomback’s, (2001);

Kuhlman’s, (2005); and Rubinson’s (2010) inferences were partially supported.

112

Specifically, although the omnibus test revealed a significant relationship between

mathematics achievement and musical aptitude, not all of the five mathematics

achievement subscales (quantitative reasoning, algebraic reasoning, spatial reasoning,

measurement, and statistics) yielded significant predicative findings. That is, only spatial

reasoning was found to significantly predict musical aptitude. Quantitative reasoning,

algebraic reasoning, measurement, and statistics were not found to predict musical

aptitude.

Interestingly, and consistent with other studies, Dastjerdi, Ozker, Foster, ,

Rangarajan, , and Parvizi (2013) found that electrical activity in a particular group of

nerve cells in the intraparietal sulcus spiked when, and only when, volunteers were

performing calculations. The area of the brain responsible for this activity is different

from the area of the brain that is responsible for spatial reasoning and reading

(Anthamatten, 2010). This evidence suggests that there is, perhaps, a biological reason

why math ability may not be as related to musical aptitude as reading or spatial

reasoning. Findings from this study, in conjunction with Dastjerdi et al. (2013) findings,

may in fact provide the impetus to change researcher’s perception about the relationship

between mathematics, reading, and music aptitude.

Comparatively, findings from testing hypothesis 2 supported the premise that

musical aptitude is related to knowledge and skills, as posited by Holsomback (2001).

Findings from hypothesis 2 also provided some credibility to Gordon’s (2003) and

Kuhlman’s (2005) assertion that music programs positively affect musical aptitude.

Specifically, this study confirmed a relationship between reading (and associated reading

113

sub-constructs) and musical aptitude. This study does not confirm music program

efficacy, which may affect musical aptitude and promote reading achievement.

Implications

The purpose of this quantitative correlational study was to examine if, and to what

degree, a correlation existed between the musical aptitude, reading, and mathematics

scores of the STAAR among beginning band students. There are a number of

implications based on the results of this study. Some of the implications are associated

with the theoretical framework upon which the research was built. Moreover, practical

and future implications should be considered as they may be meaningful to researchers

and educators.

Theoretical implications. The theoretical implications encompass the

interpretation of data in terms of the research questions. In addition, the theoretical

implications encompass the interpretation of findings in the existing literature framework.

The research study was guided by two overarching research questions. Both questions

examined the relationship between the level of musical aptitude and the composite and

sub-categorical performance levels of the reading and mathematics scores, respectively,

on the STAAR among beginning band students. The results provided an answer to both

research questions, revealing some theoretical implications. The findings indicated that a

positive correlation existed between level of musical aptitude and the composite and sub-

categorical performance levels of the STAAR among beginning band students. The

study’s findings did support scholarly research concerning the relationship between

musical aptitude and academic achievement (Holsomback 2001; Holsomback, 2002;

Rubinson, 2010). In addition, Gardner’s theory of multiple intelligence (2006) identified

114

musical intelligence as one of eight different types of intelligence. Musical learning and

participation, which incorporates instruction geared toward the plurality of various

intelligences and aptitudes, may ultimately have a positive effect on reading and

mathematical achievement. Supportively, Gordon’s (1971) musical learning theory

analyzes musical learning processes; it was ultimately designed to enhance musical

intelligence. Both theoretical implications supported the research findings that

correlations existed between musical aptitude and reading and mathematical performance

levels of the STAAR among beginning band students. The results of this study are

consistent with Holsomback’s (2001) and Rubinson’s (2010) findings that positive

correlations existed between musical aptitude and various standardized academic

assessments. The study’s theoretical implications and findings may assist teachers,

administrators, and educational policy makers in developing effective methods for

increasing reading and mathematics achievement of middle school students.

Practical implications. As previously stated in the literature review, more than

two-thirds of students in the United States are not proficient in reading and mathematics

(Campbell, Malkus, 2011; Hemmings, Grootenboer, Kay, 2011; National Assessment of

Educational Progress, 2009). More staggering, the statistical evidence of academic

proficiency for minority and lower socioeconomic students is even lower (Arbona &

Jimenez, 2014; Morales, 2010; Olszewski-Kubilius & Thomson, 2010). This evidence

prompted the researcher to examine the relationship between the level of musical aptitude

and academic achievement among beginning band students.

This study indicated a strong and statistical relationship existed between the level

of musical aptitude and both reading and mathematics achievement. The findings of this

115

study have several practical implications regarding musical aptitude testing and its

relationship to reading and mathematical achievement. First, Gordon (2001) argued that a

person’s musical aptitude would greatly influence how he or she learns to develop

audiation skills. The primary purpose of musical aptitude testing, then, is “to enable

teachers and parents to adapt music guidance and instruction to each child’s individual

needs” (Gordon, 2003, p. 13). Assessing a student’s musicality based upon his or her

academic achievement is usually misleading, and musical aptitude tests can prove

especially useful when scores do not match an educator’s expectations. The awareness of

a student’s musical aptitude and academic achievement, in comparison to his or her peers

as well as identifying a student’s strengths and weaknesses (such as differences in tonal

and rhythmic aptitude) allows teachers and administrators to provide aligned learning

objectives and activities.

Future implications. There are several future implications from the findings of

this research. Based on the finding of this study and similar studies previously mentioned

in the Literature Review, researchers should examine school districts with different

demographics and ethnic backgrounds. Analyzing the same variables with different

populations would provide expanded insight about the merit of these findings.

Furthermore, analyzing other grade levels would provide insight on the significance of

the variables. Another implication is the need for school districts to support music

education programs. Recent empirical evidence indicated that arts programs, particularly

music, are being eradicated at an exponential rate (Major, 2013; Slaton, 2012; Spohn,

2008). If early skills in reading are strongly related to the auditory processing of speech

components, then strengthening auditory analysis skills necessary for increasing musical

116

aptitude may be a valuable tool in reinforcing reading and mathematics comprehension of

middle school students (Rubinson, 2010). This notion ultimately may have a positive

effect on standardized assessments.

Strengths and weaknesses. This study displayed strengths and weaknesses based

on the methodology, research design, and data. This study provided a robust analysis by

examining the composite and sub-dimensions of the reading and mathematics sections of

the STAAR to the level of musical aptitude. However, the researcher’s accessibility to

secondary data is limited to one school district in Texas that has only one middle school

which affected the sample size. It was determined that a larger sample size was required.

Population size for the grade of interest (6th grade) was N = 65, which did not meet the

minimal sample size needed to capture medium-size correlations. The findings were

identified as significant only as a large correlation existed between the level musical

aptitude and the composite and sub-categorical performance levels of the reading and

mathematics sections of the STAAR. In addition, as there are significant educational

differences among states, the results of this study cannot be generalized to other states.

Recommendations

Recommendations for future research. Although there are many research

studies that examine musical aptitude and academic achievement respectively, there are

limited studies correlating the level of musical aptitude to academic achievement. As a

result of limited research regarding the findings of this study, the recommendations for

further research are as follows:

1. Perform a similarly designed study with a significantly larger sample. Students could be selected from several school districts in Texas. Larger sample sizes

allow researchers to better determine the average values of their data (Fraenkel,

Wallen, Hyun, 2012).

117

2. Replicate this study with a different academic assessment. As states adopt new academic standards, different states are creating new standardized assessments for

reading and mathematics for all grade levels. As a result, this study needs to be

replicated using a new academic assessment from respective states.

3. Perform a similarly designed study with a different grade level to check for similarities in findings. Broadening the scope could show more dynamics of

information regarding the relationship between musical aptitude and academic

achievement.

4. Conduct a longitudinal study to examine if the level of musical aptitude correlates to reading and mathematics achievement over a period of years.

5. Perform a similarly designed study utilizing a point biserial correlation coefficient to determine the relationship between musical aptitude and academic achievement

among students of different genders, ethnic, and socioeconomic backgrounds. By

investigating populations form different social and ethnic backgrounds, the

findings could be generalized to school with similar demographics in Texas.

6. Replicate this study using the more advanced version of the IMMA (Advanced Measures of Musical Audiation, AMMA) to measure the level of musical

aptitude. It would be interesting to see if similar correlations exist between the

reading and mathematics sections of the STAAR and a more advanced version of

the musical aptitude assessment.

Recommendations for future practice. There are three key recommendations

for practice based on the results of this study.

1. Raise school district awareness. The researcher will share his findings with the administration in the school district that authorized this study. This study will be

summarized in a scholarly article and submitted to the Texas Music Educator

Research in order to make the information more accessible for all stakeholders.

The findings of this study showed a strong statistical correlation between the level

of musical aptitude and the reading and mathematics sections of the STAAR. By

investigating such relationships, school administrators, teachers, and policy

holders are capable of identifying tactics that may improve test results.

2. Administrators and policy makers need to support music education programs. The results of this study indicated that a strong correlation existed between the level of

musical aptitude and reading and mathematics section of the STAAR. With high

quality music instruction, musical aptitude can be raised to its highest possible

level (Gordon, 2003). Therefore, it is reasonable to believe that high quality music

programs designed to increase musical aptitude may also have a positive effect on

reading and mathematics achievement.

118

3. Incorporate musical aptitude testing periodically in every music classroom. Musical aptitude testing is designed to act as objective aids to teachers and

parents by providing students with appropriate opportunities and instruction

(Gordon, 2003). Specifically, musical aptitude testing can be beneficial to music

educators by adapting formal and informal instructions to the individual musical

needs of students with high, average, and low music aptitudes.

These recommendations are provided as practical suggestions to improve

academic achievement among middle school students. As noted earlier, with high quality

music instruction, musical aptitude can be raised to its highest possible level (Gordon,

2003). Hence, it is reasonable to believe that high quality music programs designed to

increase musical aptitude may also have a positive effect on standardized assessments.

119

References

Abril, C. R., & Gault, B. M. (2006). The state of music in the elementary school: The

principals’ perspective. Journal of Research in Music Education, 54(1): 6-20.

Retrieved December 9, 2012 from EBSCOhost database.

Allen, K. (2011). Mathematics as thinking. Democracy & Education, 19(2): 1-7.

Anthamatten, P. (2010). Spatial thinking concepts in early grade-level geography

standards. Journal of Geography, 109(5), 169-180.

doi:10.1080/00221341.2010.498898

Arbona, C., & Jimenez, C. (2014). Minority stress, ethnic identity, and depression among

Latino/a college students. Journal of Counseling Psychology, 61(1): 162-168. doi:

10.1037/a0034914

Argabright, R. (2005). Connecting with music. General Music Today, 18(2): 5-6.

Ashford, E. (2004). NCLB’s unfunded art programs seek refuge. Education Digest,

70(2): 22–26.

Barrett, D.L. (1993). Relationship of musical audiation to musical training, musical

proficiency and scholastic achievement among advantaged and disadvantaged 6-

to-8 year olds. (Doctoral dissertation, Columbia University Teachers College).

Dissertation Abstracts International, 54 (12), 4383A.

Beveridge, T. (2010). No child left behind and fine arts classes. Arts Education Policy

Review, 111(1): 4-7. doi:10.1080/10632910903228090

Bluman, A. G. (2012). Elementary statistics: A step by step approach. New York:

McGraw-Hill.

120

Brophy, T. S. (2005). A longitudinal study of selected characteristics of children's

melodic improvisations. Journal of Research in Music Education, 53(2), 120-133.

Retrieved from http://search.proquest.com/docview/214476818?accountid=737

Burdick, J. (2012). The legacy of "A Nation at Risk". Online Submission.

Burns, T. G. (2010). Wechsler Individual Achievement Test-III: What is the 'gold

standard' for measuring academic achievement? Applied Neuropsychology, 17(3):

234-236.

Butler, B. E., & Trainor, L. J. (2012). Sequencing the cortical processing of pitch-

evoking stimuli using EEG analysis and source estimation. Frontiers in

Psychology, 3, 1-13.

Butzlaff, R. (2000). Can music be used to teach reading? Journal of Aesthetic Education,

34(3-4): 167-178. doi:10.1080/09084282.2010.499803

Campbell, P. F., & Malkus, N. N. (2011). The impact of elementary mathematics coaches

on student achievement. Elementary School Journal, 111(3): 430-454.

Cavanagh, S. (2009). Music and academic achievement. Education Week, 28(23): 4.

Circle, D. (2005). To test or not to test? Music Educators Journal, 92(1): 4.

Chapman, L. (2004). No child left behind in art? Arts Education Policy Review, 106 (2):

3–17.

Cheema, J. R., & Galluzzo, G. (2013). Analyzing the gender gap in math achievement:

Evidence from a large-scale US sample. Research in Education, (90), 98-112.

doi:10.7227/RIE.90.1.7

121

Cheema, J., Kitsantas, A., & Ware, H. W. (2011). Mathematics achievement: The role of

homework and self-efficacy beliefs. Journal of Advanced Academics, 22(2): 310-

339.

Cole, K. (2011). Professional notes: Brain-based-research music advocacy. Music

Educators Journal, 98(1): 26-30. doi: 10.1177/0027432111416574

Colwell, R. (2005). Whither programs and arts policy? Arts Education Policy Review 106

(6): 19–29.

Cox, H. A., & Stephens, L. J. (2006). The effect of music participation on mathematical

achievement and overall academic achievement of high school students.

International Journal of Mathematical Education in Science & Technology,

37(7): 757-763. doi: 10.1080/00207739050013781

Dastjerdi, M., Ozker, M., Foster, B. L., Rangarajan, V., & Parvizi, J. (2013). Numerical

processing in the human parietal cortex during experimental and natural

conditions. Nature Communications, 4.

Davies, G. (2007). See government grow: Education politics from Johnson to Reagan.

Lawrence: University Press of Kansas.

Deere, K. B. (2010). The impact of music education on academic achievement in reading

and math. (Order No. 3425720, Union University). ProQuest Dissertations and

Theses,135. Retrieved from

http://search.proquest.com/docview/759833158?accountid=7374. (759833158).

Deutsch, G. K., Dougherty, R. F., Scachar, M. B., Tsang, J. M., & Wandell, B. (2009).

Training in the arts, reading, and brain imaging. Washington, DC: Dana Press.

122

Duncan, C. (2011). Re/Viewing student success in an era of accountability. Journal of

Unschooling & Alternative Learning, 5(10): 58-99.

Eisele-Dyrli, K. (2010). Study finds impact of NCLB on art ed generally negative.

District Administration, 46(3): 18.

Elmore, R. F., & Fuhrman, S. H. (2001). Research finds the false assumption of

accountability. Education Digest, 67(4): 9.

Elpus, K. (2012). Is it the music or is it selection bias? A nationwide analysis of music

and non-music students’ SAT scores. Journal of Research in Music Education.

Fieldman, R. S. (2010). Child development. Upper Saddle River: Prentice Hall.

Fraenkel, J. R., Wallen, N. E., & Hyun, H. H. (2012). How to design and evaluate

research in education. New York: McGraw-Hill Higher Education.

Francois, C., Chobert, J., Besson, M., & Schon, D. (2013). Music training for the

development of speech segmentation. Cerebral Cortex, 23(9): 2038-2043. doi:

10.1093/cercor/bhs180

Fujioka, T. (2006).One year of musical training affects development of auditory cortical

evoked fields in young children. Brain, 129, 2593–2608.

Gadberry, D. (2010). Music participation and academic success. Kodaly Envoy, 36(4):

13-14.

Gardner, H. (2006). Multiple intelligences: New horizons. New York: BasicBooks.

Gardner, H., & Moran, S. (2006). The Science of multiple intelligences theory: A

response to Lynn Waterhouse. Educational Psychologist, 41(4): 227-232. doi:

10.1207/s15326985ep4104_2

123

Geist, K., Geist, E. A., & Kuznik, K. (2012). The Patterns of music: Young children

learning mathematics through beat, rhythm, and melody. Young Children, 67(1):

74-79.

Good, C. J. (2010). A nation at risk. American Educational History Journal, 37(1/2):

367-386.

Gordon, E. (1971). The psychology of music teaching. Englewood Cliffs: Prentice-Hall.

Gordon, E. (1986). Learning sequences in music. Chicago: GIA

Gordon, E. E. (1999). All about audiation and music aptitudes. Music Educators Journal,

86(2): 41.

Gordon, E. E. (2002). Developmental and stabilized music aptitudes: Further evidence of

the duality. Chicago: GIA Publications.

Gordon, E. (2003). Learning sequences in music: Skill, content, and patterns: A music

learning theory. Chicago: GIA Publications.

Gordon, E. (2007). Learning sequences in music; Skill, content and patterns. Chicago;

GIA Publications.

Gordon, E.E. (2011). Roots of music learning theory and audiation. Chicago: GIA

Publications.

Grand Canyon University. (2013). Institutional review board handbook.

Grey, A. C. (2010). No child left behind in art education policy: A review of key

recommendations for arts language revisions. Arts Education Policy Review,

111(1): 8-15. doi:10.1080/10632910903228132

Hall, K. (2013). The Carnegie Hall Royal Conservatory Achievement Program. Journal

of Singing, 69(4): 467-470.

124

Hemmings, B., Grootenboer, P., & Kay, R. (2011). Predicting mathematics achievement:

The influence of prior achievement and attitudes. International Journal of Science

& Mathematics Education, 9(3): 691-705. doi:10.1007/s10763-010-9224-5

Ho, Y., Cheung, M., & Chan, A. (2003). Music training improves verbal but not visual

memory: Cross-Sectional and Longitudinal Explorations in Children.

Neuropsychology, 17(3), 439–450. doi:10.1037/0894-4105.17.3.439.

Hobbs, C. (1985). A Comparison of the music aptitude, scholastic aptitude, and academic

achievement of young children. Psychology of Music, 13(2): 93-98. doi:

10.1177/0305735685132003

Hodges, D. A. & O’Connell, D. S. (2005). The impact of music education on academic

achievement. Chapter 2 in Sounds of learning status report. International

Foundation for Music Research,

Holsomback, J. R. (2001). Testing the relationship between musical aptitude and

standardized test scores of beginner band students, Reports of Research in Music

Education, Duke, Robert A. (Ed.), Texas Music Educators Association, Austin.

Retrieved from www.tmea.org/080_College/Research/Hol2001.pdf

Holsomback, J. R. (2002). Testing the relationship between musical aptitude and

standardized test scores of beginner band students: Results of the second-year

study, Reports of Research in Music Education, Duke, Robert A. (ed.) Texas

Music Educators Association, Austin. Retrieved from

www.tmea.org/080_College/Research/Hol2002.pdf

Holsomback, J. R. (2004). Testing the relationship between musical aptitude and

standardized test scores of beginner band students: Results of the second-year

125

study, Reports of Research in Music Education, Duke, Robert A. (ed.) Texas

Music Educators Association. Austin. Retrieved from

www.tmea.org/080_College/Research/Hol2002.pdf

Hornbach, C. & Taggart, C. (2005). The relationship between developmental tonal

aptitude and singing achievement among kindergarten, first-, second-, and third-

grade students. Journal of Research in Music Education, 53(4), 322-331.

Hyde, K. L., Lerch, J., Norton, A., Forgeard, M., Winner, E., Evans, A. C., & Schlaug, G.

(2009). The effects of musical training on structural brain development. Annals of

the New York Academy of Sciences, 1169: 182-186. doi:10.1111/j.1749-

6632.2009.04852.x

Johanningmeier, E. V. (2010). "A nation at risk" and "Sputnik": Compared and

reconsidered. American Educational History Journal, 37(2): 347-365.

Johnson, D. L. (2010). Teaching band in a "program improvement" (NCLB) district.

School Band & Orchestra, 13(10): 44-46.

Johnson, K.A., Wilson, C.M. Wilson, & Williams-Rossi, D. (2014). All reading tests are

not created equal: A comparison of the state of Texas assessment of academic

readiness (STAAR) and the Gray Oral Reading Test-4 (GORT-4). Texas Journal

of Literacy Education, 1(1), 13-22.

Johnson, W. L., Johnson, A. M., & Johnson, J. (2012). Predicting student success on the

Texas chemistry STAAR Test: A logistic regression analysis. Online Submission.

Jorgenson, O. (2012). What we lose in winning the test score race. Principal, 91(5), 12-

15.

126

Karma, K. (2007). Musical aptitude definition and measure validation: Ecological

validity can endanger the construct validity of musical aptitude tests.

Psychomusicology: A Journal of Research in Music Cognition, 19(2): 79-90.

doi:10.1037/h0094033

Kelly, S. N. (2012). Fine arts-related instruction’s influence on academic success.

Journal of Research in Music Education, 54(4): 293-307.

Kitsantas, A., Cheema, J., & Ware, H. W. (2011). Mathematics achievement: The role of

homework and self-efficacy beliefs. Journal of Advanced Academics, 22(2): 310-

339.

Knoeppel, R. C., & Brewer, C. A. (2011). Education reform, equal opportunity and

educational achievement: Do trend data adequately report progress? Education

Policy Analysis Archives, 19(10): 1-27.

Krumenaker, L. (2009). The modern U.S. high school astronomy course, its status and

makeup, and the effects of No Child Left Behind. Astronomy Education Review,

8(1): 010107-1-010107-8. doi: 10.3847/AER2009042

Kuhlman, K. (2005). Musical aptitude versus academic ability as a predictor of beginning

instrumental music achievement and retention: Research and implications.

UPDATE: Applications of Research in Music Education, 24(1): 34-43.

Linder, S. M., Powers-Costello, B., & Stegelin, D. A. (2011). Mathematics in early

childhood: research-based rationale and practical strategies. Early Childhood

Education Journal, 39(1): 29-37. doi: 10.1007/s10643-010-0437

127

Major, M. L. (2013). How they decide: A case study examining the decision-making

process for keeping or cutting music in a K–12 public school district. Journal of

Research in Music Education, 61(1): 5-25. doi:10.1177/0022429412474313

Maltese, A. V., Tai, R. H., & Xitao, F. (2012). When is homework worth the time?

Evaluating the association between homework and achievement in high school

science and math. High School Journal, 96(1): 52-72.

Marshall, D. (1998). "snowballing technique." A Dictionary of Sociology. Retrieved from

http://www.encyclopedia.com/doc/1O88-snowballingtechnique.html

Martin, D. (2012). Rich assessment in a first-year, teacher education (primary)

mathematics education subject. International Journal of Pedagogies & Learning,

7(1): 62-72.

Mathis, W. J. (2004). NCLB and high-stakes accountability: A cure? Or a symptom of

the Disease? Educational Horizons, 82(2): 143-152.

McGuinn, P. J. (2006). No child left behind and the transformation of federal education

policy, 1956-2005. Lawrence: University Press of Kansas

McLelland, M. (2005). The impact of instrumental music on student achievement in

reading and mathematics. Doctoral dissertation, Wilmington College, Delaware.

Meyer, L. (2005). The complete curriculum: Ensuring a place for the arts in America’s

schools. Arts Education Policy Review, 106 (3): 35–392005. UMI No. 3156552.

Michener, D., & Fishoff, J. (2012). Surprising insights: Literacy and music. California

Reader, 45(3): 25-32.

Mishook, J., and M. Kornhaber. (2006). Arts integration in an era of accountability. Arts

Education Policy Review, 107(4): 3–11.

128

Morales, E. E. (2010). Linking strengths: Identifying and exploring protective factor

clusters in academically resilient low-socioeconomic urban students of color.

Roeper Review, 32(3): 164-175. doi:10.1080/02783193.2010.485302

Morse, L. L. (2010). Involvement in sixth-grade beginner band: A relational study of the

impact on mathematics achievement. The University of Mississippi. ProQuest

Dissertations and Theses, 86. Retrieved from

http://search.proquest.com/docview/858094089?accountid=7374 (858094089).

Mursell, J. L. (1937). What about music tests? Music Educators Journal, 24(2): 16. doi:

10.2307/3385164

Musa, A. J. (2013). Gender, geographic locations, achievement goals and academic

performance of secondary school students from Borno State, Nigeria. Research in

Education, (90), 15-31. doi:10.7227/RIE.90.1.2

Musacchia, G., Sams, M., Skoe, E., & Kraus, N. (2007). Musicians have enhanced

subcortical auditory and audiovisual processing of speech and music. Proceedings

of the National Academy of Sciences, 104(40), 15894-15898. doi:

10.1073/pnas.0701498104

National Assessment of Educational Progress. (2009): Math assessment. ED Review, 5.

Oare, S., & Bernstorf, E. (2010). Literacy in the instrumental classroom. Kodaly Envoy,

37(1): 17-20.

Olszewski-Kubilius, P., & Thomson, D. L. (2010). Gifted programming for poor or

minority urban students: Issues and lessons learned. Gifted Child Today, 33(4):

58-64.

129

Palmer, R. T., Davis, R. J., Moore III, J. L., & Hilton, A. A. (2010). A nation at risk:

Increasing college participation and persistence among African American males

to stimulate U.S. global competitiveness. Journal of African American Males in

Education, 1(2): 105-124.

Pane, D., & Salmon, A. (2011). Author's camp: Facilitating literacy learning through

music. Journal of Reading Education, 36(2): 36-42.

Peynircioglu, Z. F., Durgunoglu, A. Y., & Uney-Kusefoglu, B. (2002). Phonological

awareness and musical aptitude. Journal of Research in Reading, 25(1): 68-80.

doi: 10.1111/1467-9817.00159

Phillips-Silver, J. (2009). On the meaning of movement in music, development and the

brain. Contemporary Music Review, 28(3): 293-314.

doi:10.1080/07494460903404394

Piro, J. (2009). Music training and literacy development. Literacy Today, 59, 32-34.

Piro, J., & Ortiz, C. (2009). Music education can help children improve reading skills.

Journal of Psychology of Music, 65 (1): 32-37.

Posner, M., Rothbart, M., Sheese, B., & Kieras, J., (2008) How arts training influences

cognition. Learning, Arts and the Brain (pp.1-11). Dana Consortium Report on

Arts and Cognition.

Powell, D., Higgins, H. J., Aram, R., & Freed, A. (2009). Impact of no child left behind

on curriculum and instruction in rural schools. Rural Educator, 31(1): 19-28.

Proctor Duax, K. (2013). How important is music education? Journal of Singing, 69(3):

257-259.

130

Radocy, R. E., & Boyle, J. D. (2003). Psychological foundations of musical behavior.

Springfield: Thomas.

Rainbow, E. L. (1965). A pilot study to investigate constructs of musical aptitude.

Journal of Research in Music Education, 13, 3-14.

Ravitch, D., & Cortese, A. (2009). Why we’re behind: What top nations teach their

students but we don't. Education Digest, 75(1): 35-38.

Rowe, E. W., Miller, C., Ebenstein, L. A., & Thompson, D. F. (2012). Cognitive

predictors of reading and math achievement among gifted referrals. School

Psychology Quarterly, 27(3): 144-153. doi:10.1037/a0029941

Rubinson, L. E. (2010). A correlational study of the relationships between music aptitude

and phonemic awareness of kindergarten children. Capella University). ProQuest

Dissertations and Theses, 137. Retrieved from

http://search.proquest.com/docview/305243668?accountid=7374 (305243668).

Rutkowski, J. (1996). The effectiveness of individual/small group singing activities on

kindergartners' use of singing voice and developmental music aptitude. Journal of

Research in Music Education, 44(4): 353-68.

Schutz, D., Simon, D. M., & Musgrave, R. (2013). Toward educational testing reform:

Inside reading achievement tests. Education Policy Analysis Archives, 21(90): 1-

28.

Seashore, C. E. (1919). The psychology of musical talent. Boston: Silver, Burdett and

Company.

Seashore, C. E. (1938). The Psychology of music. XIII. Musical intelligence. Music

Educators Journal, 24(5): 32. doi: 10.2307/3385427

131

Simmons, J. C. H. (1981). An investigation of relationships among primary-level student

performance on selected measures of music aptitude, scholastic aptitude, and

academic achievement. (Doctoral dissertation, Peabody College for Teachers of

Vanderbilt University). Abstract retrieved from Dissertations & Theses: A&I

database.

Slaton, E. (2012). Collegiate connections: Music education budget crisis. Music

Educators Journal, 99(1): 33-35. doi:10.1177/0027432112454837

Spelke, E. S. (2009). Effects of music instruction on developing cognitive systems at the

foundations of mathematics and science. Learning, Arts and the Brain: The Dana

Consortium Report on Arts and Cognition. NY/Washington D.C.: Dana Press.

Spohn, C. (2008). Teacher perspectives on no child left behind and arts education: A case

study. Arts Education Policy Review, 109(4): 3-12.

Standley, J. M. (2008). Does music instruction help children learn to read? Evidence of a

meta-analysis. Update: Applications of Research in Music Education, 27(1): 17-

32. doi: 10.1177/8755123308322270

Stanley, N., & Stanley, L. (2011). Predicting FCAT reading scores using the reading-

level indicator. Reading Psychology, 32(2): 99-112.

doi:10.1080/02702711.2010.507529

Stanton, H. M. (1922). The inheritance of specific musical capacities. Cold Spring

Harbor.

Stubnisky, R. H., Perry, R. P., Hall, N. C., & Guay, F. (2012). Examining perceived

control level and instability as predictors of first-year college students' academic

achievement. Contemporary Educational Psychology, 37(2): 81-90.

132

Talley, C. P., & Scherer, S. (2013). The enhanced flipped classroom: Increasing

academic performance with student-recorded lectures and practice testing in a

"flipped" STEM course. Journal of Negro Education, 82(3): 339-347.

Texas Education Agency. (2003). Enrollment in Texas public schools, 2002–03

(Document No. GE09 601 06). Austin: Author.

Texas Education Agency. (2006). 2006 accountability system state summary. Retrieved

March 2, 2014 from http://www.tea.state.tx.us/accountability/

Texas Education Agency. (2007). 2007 accountability system state summary. Retrieved

March 2, 2014 from http://www.tea.state.tx.us/accountability/

Texas Education Agency. (2008). 2008 accountability system state summary. Retrieved

March 2, 2014 from http://www.tea.state.tx.us/accountability/

Texas Education Agency. (2009). Snapshot 2009. Austin: Author. Retrieved from

http://ritter.tea.state.tx.us/perfreport/snapshot/2008/state.html

Texas Education Agency. (2012). Snapshot 2012. Austin, TX: Author. Retrieved from

http://www.tea.state.tx.us/index4.aspx?id=2147505027

Texas Education Agency. (2014, January 9). STAAR® resources. Texas Education

Agency. Retrieved from http://www.tea.state.tx.us/student.assessment/staar/

Toldson, I. A. (2012). Editor's comment: When standardized tests miss the mark. Journal

of Negro Education, 81(3): 181-185.

Tomei, M. C. (2010). The relationship between musical aptitude and academic

achievement of third-grade students (Order No. 1480763). Available from

ProQuest Dissertations & Theses Full Text: The Humanities and Social Sciences

133

Collection. (755055207). Retrieved from

http://search.proquest.com/docview/755055207?accountid=7374

Ukkola-Vuoti, L., Kanduri, C., Oikkonen, J., Buck, G., Blancher, C., Raijas, P., Karma,

K., Lahdesmaki, H., & Järvelä, I. (2013). Genome-wide copy number variation

analysis in extended families and unrelated individuals characterized for musical

aptitude and creativity in music. Plos ONE, 8(2): 1-9.

doi:10.1371/journal.pone.0056356

United States Department of Education. (1983). A nation at risk: The imperative for

educational reform (National Commission on Excellence in Education).

Washington, DC: Author.

United States Department of Education. (2009). United States Department of Education

FY 2009 Annual Performance Report. United States Department of Education.

U.S. Department of Education. (1983). A nation at risk: The imperative for educational

reform: A report to the Nation and the Secretary of Education, United States

Department of Education. Washington, D.C.: National Commission on

Excellence in Education.

U.S. Department of Education. (2010). United States Department of Education FY 2009

annual performance report. U.S. Department of Education.

Wadsworth, B. J. (1996). Piaget's theory of cognitive and affective development:

Foundations of constructivism. White Plains: Longman USA.

Waller, D. (2010). Language literacy and music literacy: A pedagogical asymmetry.

Philosophy of Music Education Review, 18(1): 26-44.

134

Weaver, K. (2011). Standardized testing: Measurement of academic achievement. Online

Submission.

Webb, M. N. A. (1984). An investigation of the relationship of musical aptitude and

intelligence of students at the third grade level. (Doctoral dissertation, The

University of North Carolina at Greensboro). Abstract retrieved from

Dissertations & Theses: A&I database.

West, C. (2012). Teaching music in an era of high-stakes testing and budget reductions.

Arts Education Policy Review, 113(2): 75-79.

doi:10.1080/10632913.2012.656503

William, D. (2010). Standardized testing and school accountability. Educational

Psychologist, 45(2): 107-122. doi:10.1080/00461521003703060

Williams, A. (2011). A Call for change: Narrowing the achievement gap between white

and minority students. Clearing House, 84(2): 65-71.

doi:10.1080/00098655.2010.511308

Yin, R. K. (2009). Case study research: Design and methods, 4th edition. London: SAGE

Publishing.

Young, W. T. (1971). The Role of Musical Aptitude, Intelligence, and Academic

Achievement in Predicting the Musical Attainment of Elementary Instrumental

Music Students. Journal of Research in Music Education, 19(4): 385.

Young Ju, J., Hyuck, S., Sunyoung, J., & You Kyung, L. (2012). The effects of academic

self-efficacy, learning strategies, and perceived instructional strategies on high

and low achievers' in the middle school Korean language. KEDI Journal of

Educational Policy, 9(2): 239-257.

135

Zentner, M., & Eerola, T. (2010). Rhythmic engagement with music in infancy.

/Proceedings of the National Academy of Sciences, 107(13): 5768-5773. doi:

10.1073/pnas.1000121107

136

Appendix A

Site Authorization Letter

137

Appendix B

Permission to Use Instrument

138

Appendix C

IMMA Instrument

139

140

141

142

143

144

145

146

147

Appendix D

Informed Consent

148

149

Appendix E

Computation of Minimum Sample Size

Exact - Correlation: Bivariate normal model

Options: exact distribution

Analysis: A priori: Compute required sample size

Input: Tail(s) = Two

Correlation ρ H1 = 0.475

α err prob = 0.05

Power (1-β err prob) = 0.8

Correlation ρ H0 = 0

Output: Lower critical r = -0.2759365

Upper critical r = 0.2759365

Total sample size = 85

Actual power = 0.9507848

t-tests - Linear bivariate regression: One group, size of slope

Analysis: Post hoc: Compute achieved power

Input: Tail(s) = One

Slope H1 = 0.15

α err prob = 0.05

Total sample size = 65

Slope H0 = 0

Std dev σ_x = 1

Std dev σ_y = 1

Output: Noncentrality parameter δ = 1.2231777

Critical t = 1.6694022

Df = 63

Power (1-β err prob) = 0.8018498

150

Appendix F

IRB Approval Letter