An investigation of gender-based differences in assessment instruments: A test of measurement invariance
SA Journal of Industrial Psychology
ISSN: (Online) 2071-0763, (Print) 0258-5200
Page 1 of 12
Original Research
An investigation of gender-based differences
in assessment instruments: A test of
measurement invariance
Authors:
Renier Steyn1
Gideon P. de Bruin1
Affiliations:
1
Department of Industrial
Psychology and People
Management, University of
Johannesburg, Johannesburg,
South Africa
Corresponding author:
Renier Steyn,
Dates:
Received: 01 June 2019
Accepted: 02 Nov. 2019
Published: 18 Mar. 2020
How to cite this article:
Steyn, R., & De Bruin, G.P.
(2020). An investigation of
gender-based differences in
assessment instruments: A
test of measurement
invariance. SA Journal of
Industrial Psychology/SA
Tydskrif vir Bedryfsielkunde,
46(0), a1699. https://doi.
org/10.4102/sajip.v46i0.1699
Copyright:
© 2020. The Authors.
Licensee: AOSIS. This work
is licensed under the
Creative Commons
Attribution License.
Orientation: Practitioners and researchers often assume that the psychometric instruments they
use are invariant and that they therefore measure similar constructs in a comparable manner across
men and women respondents. This assumption is, however, rarely tested, leading to an undetected
bias in research findings or an adverse impact because of the presence of non-invariance.
Research purpose: After presenting essential information about measurement invariance
(MI) and arguing for the testing thereof, this research aims to reveal the prevalence of MI
across several frequently used psychometric instruments credulously used based on the
assumption the revenant constructs are measured equivalently across gender exists.
Motivation for the study: Firstly, this study aims to increase awareness regarding MI, a property
that can be tested statistically. Secondly, the research aims to make practitioners aware of the
presence of bias in psychometric instruments, specifically to identify instruments that could be
included in investigations which attempt to understand gender matters in the workplace.
Research approach/design and method: Cross-sectional survey data, pertaining to seven
standard instruments, related to innovative work behaviour, were analysed. Pairwise,
multigroup confirmatory factor analyses with robust maximum likelihood estimation were
used to examine configural, metric, intercept and strict invariance, as well as the equivalence
of the latent means.
Main findings: The findings were binary, with four of the instruments showing MI at an equal
latent means level, whilst three instruments were non-invariant at the configural level.
Measurement invariance was either accepted completely or rejected completely.
Practical/managerial implications: The serratedness of findings, even when using wellrecognised and frequently used psychometric instruments, exposes the prevalence of noninvariance in some instruments, thereby necessitating the standard testing for MI. These
findings also specify the instruments that are MI (in terms of gender), which allow other
researchers and practitioners to use these instruments with more confidence when measuring
and comparing men and women respondents in their studies.
Contribution/value-add: This research demonstrates the ease with which MI testing can be
performed and alerts researchers to do MI testing when conducting cross-group studies, as the
prevalence of measurement non-invariance is high.
Keywords: gender; measurement invariance; bias; adverse effect; group differences; innovative
work behaviour.
Introduction
Read online:
Scan this QR
code with your
smart phone or
mobile device
to read online.
Gender is a prominent, salient variable within the organisational workplace. In many studies, the
perceptions of men and women are compared, or measures of perceptions are used in models to test
hypotheses related to gender differences (Cropley & Cropley, 2017; Eagly, 1997; Eagly & Wood,
1999; Koch, D’Mello, & Sackett, 2015). Differential outcomes based on gender are often reported
(Eagly & Karau, 2002; Ismail & Nakkache, 2015). Often, these differential outcomes are explained
from a sociological perspective, where differences are attributed to gender-specific roles, attributions,
stereotypical expectations, performance or attitudes (Hatlevik, Scherer, & Christophersen, 2017). In
many of these studies, it is assumed that measures of perceptions are accurate and equally valid for
men and women. Examples of such ‘naïve’ studies are plentiful (Eagly, Johannesen-Schmidt, & Van
Engen, 2003; Selvarajan, Slattery, & Stringer, 2015; Tabvuma, Georgellis, & Lang, 2015; Wang &
Gorenstein, 2015; Yi, Ribbens, Fu, & Cheng, 2015). In all of the aforementioned studies, it is assumed
http://www.sajip.co.za
Open Access
Page 2 of 12
that measures of perceptions are not gender-specific. It was
found that in none of these studies the authors tested for the
possibility that the measurement characteristics might differ
depending on gender.
When considering differences between groups (e.g. men and
women), it could be meaningful to go beyond the sociological
explanations (Hatlevik et al., 2017) and (firstly) question the
assumption that individual test items and/or the entire scale
operate equally across the groups (Millsap, 2011; Tsaousis &
Kazi, 2013; Vandenberg & Lance, 2000). The concern, as raised
above, is that this assumption is hardly ever tested explicitly,
and according to Tsaousis and Kazi (2013), this omission renders
all such comparative studies’ findings highly questionable. Do
different groups of respondents interpret a given measure in a
conceptually similar manner? Stated more operationally, are the
relationships between manifest indicator variables (scale items,
subscales) and the underlying construct the same across groups
(Bialosiewicz, Murphy, & Berry, 2013)? Should the construct not
be measured equivalently, it will cause bias in the inferences
drawn and therefore threaten the validity of the comparisons
made (Hatlevik et al., 2017). These potential deviations from
equivalence are referred to as measurement non-invariance
(Holland & Wainer, 1993). To rule out the possibility that
variations in the functioning of a scale result in biased
interpretations of results, testing for measurement invariance
(MI) can assist with clarity in this regard (Hatlevik et al., 2017).
The academic community is certainly not naïve with regard to
the possibility of gender-based MI, and some studies do
include tests of MI (Kuhn & Holling, 2009; Van Zyl, 2016;
Zampetakis, Bakatsaki, Litos, Kafetsios, & Moustakis, 2017).
This is a relatively new trend, however, with most studies
failing to test for MI across groups of interest prior to making
comparisons (Tsaousis & Kazi, 2013). This article aims to
contribute to the literature and the practice of gender-based
research by critically analysing the present-day call for the
testing of (gender-based) measuring invariance in studies
where group (gender) differences are investigated. This will
firstly be done b (...truncated)