The evolution of the gender test score gap through seventh grade: new insights from Australia using unconditional quantile regression and decomposition

Le, Huong Thu; Nguyen, Ha Trong

doi:10.1186/s40172-018-0062-y

Original article
Open access
Published: 21 February 2018

The evolution of the gender test score gap through seventh grade: new insights from Australia using unconditional quantile regression and decomposition

Huong Thu Le^1,2 &
Ha Trong Nguyen³

IZA Journal of Labor Economics volume 7, Article number: 2 (2018) Cite this article

4894 Accesses
9 Citations
1 Altmetric
Metrics details

Abstract

This paper documents the patterns and examines the factors contributing to a gender gap in educational achievements in early seventh grade of schooling using a recent and nationally representative panel of Australian children. Regression results indicate that females excel at non-numeracy subjects at later grades whereas males outperform females in numeracy in all grades, whether at the mean or along the distribution of the test score. Our results also reveal a widening gender test score gap in numeracy as students advance their schooling. Regression and decomposition results also highlight the importance of controlling for pre-school cognitive skills in examining the gender test score gap.

1 Introduction

Gender differentials in educational achievements have long been the focus of research. This is not surprising given that education has been shown to improve many life outcomes such as health and labour market outcomes (Card 1999; Schoeni et al. 2008). The underrepresentation of women in science, technology, engineering and mathematics (STEM) careers has resulted in research and policies focusing on gender gaps in test scores, particularly in maths-related subjects in the early years of schooling (Fryer and Levitt 2010; Justman and Méndez 2016). While there has been a rich literature on gender gaps in educational achievements, little consensus exists about the evolution as well as the factors contributing to the gaps in early childhood. One major issue plaguing researchers in documenting the evolution of the gaps is the lack of rich panel data. This study sets out to contribute to the literature by using a recent and nationally representative Longitudinal Study of Australian Children (LSAC) survey to document the evolution and examine factors contributing to gender gaps in academic achievements in early seventh grade of schooling.

This paper contributes to the international literature on the gender test score gap by not only introducing the Australian case study but also bringing three other additions to the current literature. The first addition is that with the remarkably rich panel data relative to previous international literature—containing five assessments over the first 7 years of schooling of the same children, and an exhaustive list of home and school environments—enables the testing of several socialisation theories. For example, one of the particular advantages of the data is that pre-school cognitive skills^{Footnote 1} of students are observed, allowing investigation of the way that initial academic endowments contribute to the gender test score over their first 7 years of schooling. As another example, the data contain test scores of students up to the seventh grade while current US studies, which use a comparable US data set from the Early Childhood Longitudinal Study Kindergarten cohort, only examine the gender test score gap up to the fifth grade (Fryer Jr and Levitt 2004; Fryer and Levitt 2010; Sohn 2012; Bertrand and Pan 2013). These Australian data thus allow examination of the evolution of the gender test score gap through higher grades than that of the US studies.

The second addition is that this paper is one of a few papers in the literature applying a quantile regression to investigate the relative performance of male and female students along the whole distribution of test scores rather than at means (Husain and Millimet 2009; Sohn 2012; Gevrek and Seiberlich 2014). Analysis based solely on means may miss important information in other parts of the distribution (Firpo et al. 2009). This is especially relevant when policy concern is focused on the tail of the test score distribution, and when evaluating and decomposing the gender test score gap at different points of the test score distribution is of interest (Husain and Millimet 2009; Sohn 2012; Gevrek and Seiberlich 2014). To do so, this paper applies an unconditional quantile regression developed by Firpo et al. (2009). The advantage of the unconditional quantile regression over the traditional conditional quantile regression of Koenker and Bassett (1978) is that its estimates can be interpreted as the impact of changes in explanatory variables on the dependent variable for those at a specific point in the distribution.^{Footnote 2} The estimates from the unconditional quantile regression can then be directly applied to an Oaxaca-Blinder (OB) decomposition method to examine factors contributing to the gender test score gap across the entire distribution. Therefore, this study makes its third addition to the literature as one of a few papers (Sohn 2012; Gevrek and Seiberlich 2014) applying a quantile decomposition method to study the gender test score gap.

By using the first five waves of the LSAC survey, we find that males excel at numeracy at all grades, whether at means or along the distribution. Also, we uncover heterogeneous patterns in the gender test score gap across the test score distribution, by test subjects and test grades. The regression results also reveal a widening gender test score gap in numeracy as students advance their schooling. The decomposition results indicate that gender disparities in pre-school cognitive skills can explain a large part of the differences in academic performance.

The remainder of the paper is structured as follows. Section 2 summarises the most relevant literature while Section 3 describes the data. Section 4 presents this study’s empirical regression and decomposition models and Section 5 discusses the regression results. Section 6 reports decomposition results of factors contributing to the gender test score gap, and, finally, Section 7 concludes.

2 Literature review

International literature has consistently shown significant gender test score gaps, with male students generally outperforming female students in maths and science while female students excel at literacy subjects (Wilder and Powell 1989; Marks 2008; Bedard and Cho 2010; Fryer and Levitt 2010; Christopher et al. 2013; Falch and Naper 2013; Stoet and Geary 2013; Dickerson et al. 2015). In addition, studies have often documented that the gender gap in a particular subject only appears at certain educational levels and tends to increase as students advance their schooling (Coleman et al. 1966; Husain and Millimet 2009; Fryer and Levitt 2010).

Research that has been devoted to attempting to explain the recognised patterns in the gender educational gap has proposed a wide range of different contributing factors. For example, some studies have demonstrated that differences in the brain between genders may explain these patterns as males tend to be better at analysing systems, while females tend to be better at reading the emotions of other people (Kimura 2000; Baron-Cohen 2007). Furthermore, gender differences in competition (Gneezy et al. 2003; Niederle and Vesterlund 2010), parental time investment in children (Baker and Milligan 2016), or social and cultural conditioning and gender-biased environments (Guiso et al. 2008; Bedard and Cho 2010; Dickerson et al. 2015) are possible explanations for the observed gender gaps in academic achievements. An emerging number of studies also highlight the roles of non-cognitive skills (Jacob 2002; Duckworth and Seligman 2006; Christopher et al. 2013; Golsteyn and Schils 2014) in contributing to the gender test score gap.^{Footnote 3} This present paper contributes to the literature by assessing the role of pre-school cognitive skills in contributing to the gender academic achievement gap and how that role evolves as students advance in their schooling.

Australian studies have documented gender differences in academic outcomes at all educational levels. For example, Nghiem et al. (2015) used the first four waves of the LSAC data to report that male students outperform their female counterparts in grade 3 and 5 numeracy. In contrast, female students outperform in grade 3 writing and grade 5 reading and grammar. More recently, Justman and Méndez (2016) used administrative data from Victoria to show that male students score higher than female students in mathematics and lower in reading in grades 7 and 9. As another example, Marks (2008) used the OECD’s 2000 Programme for International Student Assessment (PISA) project to document that 15-year-old Australian females perform better than males in reading but worse in mathematics. Using various datasets, Homel et al. (2012) reported that 18-year-old Australian females are more likely to complete Year 12 than males. At the tertiary educational level, Booth and Kee (2011) used aggregate data to report that since 1987 Australian females were more likely than males to be enrolled at university. These studies often attempt to capture the gender educational achievement gap by including a gender dummy variable in a multivariate regression framework and only examine the mean gap.

3 Data and descriptive statistics

3.1 Data and sample

We use data from the first five waves of the biannual national representative LSAC survey. The LSAC, initiated in 2004, contains comprehensive information about children’s test scores and other socio-economic and demographic background of the children and their parents. The LSAC sampling frame consists of all children born between March 2003 and February 2004 (the birth or “B cohort”, infants aged 0–1 year in 2004), and between March 1999 and February 2000 (the kindergarten or “K cohort”, children aged 4–5 years in 2004). In this study, children of K cohort are used because measures on student test scores are more widely available for this cohort in the first five waves of the survey.

To indicate the academic achievements of students, we employ results from the National Assessment Program – Literacy and Numeracy (NAPLAN) tests.^{Footnote 4} The NAPLAN test is required of all Australian students in grades 3, 5, 7 and 9 in the five domains of reading, writing, spelling, grammar and numeracy. The test scores range from 0 to 1000 and are comparable across students and over time (ACARA 2014). The NAPLAN test results of the children were collected via data linkage with the LSAC data (Daraganova et al. 2013). At the time of this study, the linkage data for LSAC were mainly available for students in grades 3, 5 and 7. Thus, we employ these test results at these grades to measure the academic achievements of students. Following the previous Australian literature (Justman and Méndez 2016; Cobb-Clark and Moschion 2017) and for brevity purposes, we focus on two main test subjects: reading and numeracy.^{Footnote 5} Since the NAPLAN test dates and LSAC survey dates are not the same, test results and survey data are merged in the way that test results are not pre-dated by survey data.^{Footnote 6} This matching exercise shows that NAPLAN test scores in grades 3, 5 and 7 are merged with survey data in waves 2, 3 and 4, respectively. As is generally done in the literature (Husain and Millimet 2009; Fryer and Levitt 2010; Sohn 2012; Golsteyn and Schils 2014), NAPLAN test scores are standardised (with mean 0 and standard deviation 1) by grade and domain in this paper.

To measure the initial stocks of students’ cognitive skills, we use the Peabody Picture Vocabulary Test (PPVT) and Who Am I (WAI). The PPVT is an interviewer-administered test to assess a child’s knowledge of the meaning of spoken words and his or her receptive vocabulary for standard English (Dunn and Dunn 1997). The PPVT test requires a child to show the picture that best represents the meaning of a stimuli word spoken by the examiner. The WAI test is also administered by an interviewer to measure the general cognitive ability of pre-school age children to perform literacy and numeracy tasks, such as reading, copying and writing letters, words, shapes and numbers (Lemos and Doig 1999). PPVT and WAI scores are used in wave 1 when the student is 4 or 5 years old (i.e., before enrolling in primary school). Similar to NAPLAN test scores, PPVT and WAI test scores are standardised for ease of interpretation.

3.2 Sample

As discussed in Section 3.1, this study focuses on K cohort children because test scores are more widely available for them. Furthermore, among students who took any test in any test grade, the focus is on about 96% of those who completed all five test subjects. Moreover, the sample is restricted to students without missing information on a list of important explanatory variables. To keep the results comparable over time, specifications that use variables which are available in all waves of the LSAC and contain the least missing information (see Table 1 and Section 4 for a list of variables included in our baseline models) are used. These variables are commonly used in studies which employ a popular and comparable US data set from the Early Childhood Longitudinal Study Kindergarten cohort (Fryer Jr and Levitt 2004; Fryer and Levitt 2010; Sohn 2012; Bertrand and Pan 2013) to study a gender test score gap of school students.^{Footnote 7}

Table 1 Summary statistics by gender

The evolution of the gender test score gap through seventh grade: new insights from Australia using unconditional quantile regression and decomposition

Abstract

1 Introduction

2 Literature review

3 Data and descriptive statistics

3.1 Data and sample

3.2 Sample

3.3 Summary statistics by gender

4 Empirical models

4.1 Regression models

4.2 Decomposition models

5 Empirical regression results

5.1 Estimates of gender test score gap at means of test score distribution

5.2 Estimates of gender test score gap along the test score distribution

6 Empirical decomposition results

7 Conclusions

Notes

References

Acknowledgements

Funding

Availability of data and materials

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Publisher’s Note

Appendices

Appendix 1

Appendix 2

1.1 Supplemental materials for refereeing purposes and on-line publication

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL Classification