Behavior Research Methods

https://link.springer.com/journal/13428

List of Papers (Total 6,006)

Reliability of gaze-contingent perimetry

Standard automated perimetry, a psychophysical task performed routinely in eyecare clinics, requires observers to maintain fixation for several minutes at a time in order to measure visual field sensitivity. Detection of visual field damage is confounded by eye movements, making the technique unreliable in poorly attentive individuals and those with pathologically unstable...

Test–retest reliability of reinforcement learning parameters

It has recently been suggested that parameter estimates of computational models can be used to understand individual differences at the process level. One area of research in which this approach, called computational phenotyping, has taken hold is computational psychiatry. One requirement for successful computational phenotyping is that behavior and parameters are stable over...

Singing Ability Assessment: Development and validation of a singing test based on item response theory and a general open-source software environment for singing data

We describe the development of the Singing Ability Assessment (SAA) open-source test environment. The SAA captures and scores different aspects of human singing ability and melodic memory in the context of item response theory. Taking perspectives from both melodic recall and singing accuracy literature, we present results from two online experiments (N = 247; N = 910). On-the...

Validation of scrambling methods for vocal affect bursts

Studies on perception and cognition require sound methods allowing us to disentangle the basic sensory processing of physical stimulus properties from the cognitive processing of stimulus meaning. Similar to the scrambling of images, the scrambling of auditory signals is aimed at creating stimulus instances that are unrecognizable but have comparable low-level features. In the...

LexMAL: A quick and reliable lexical test for Malay speakers

Objective language proficiency measures have been found to provide better and more consistent estimates of bilinguals’ language processing than self-rated proficiency (e.g., Tomoschuk et al., 2019; Wen & van Heuven, 2017a). However, objectively measuring language proficiency is often not possible because of a lack of quick and freely available language proficiency tests (Park et...

Do mturkers collude in interactive online experiments?

One of the issues that can potentially affect the internal validity of interactive online experiments that recruit participants using crowdsourcing platforms is collusion: participants could act upon information shared through channels that are external to the experimental design. Using two experiments, I measure how prevalent collusion is among MTurk workers and whether...

The validation of online webcam-based eye-tracking: The replication of the cascade effect, the novelty preference, and the visual world paradigm

The many benefits of online research and the recent emergence of open-source eye-tracking libraries have sparked an interest in transferring time-consuming and expensive eye-tracking studies from the lab to the web. In the current study, we validate online webcam-based eye-tracking by conceptually replicating three robust eye-tracking studies (the cascade effect, n = 134, the...

Why we need to abandon fixed cutoffs for goodness-of-fit indices: An extensive simulation and possible solutions

To evaluate model fit in confirmatory factor analysis, researchers compare goodness-of-fit indices (GOFs) against fixed cutoff values (e.g., CFI > .950) derived from simulation studies. Methodologists have cautioned that cutoffs for GOFs are only valid for settings similar to the simulation scenarios from which cutoffs originated. Despite these warnings, fixed cutoffs for popular...

The Seven-parameter Diffusion Model: an Implementation in Stan for Bayesian Analyses

Diffusion models have been widely used to obtain information about cognitive processes from the analysis of responses and response-time data in two-alternative forced-choice tasks. We present an implementation of the seven-parameter diffusion model, incorporating inter-trial variabilities in drift rate, non-decision time, and relative starting point, in the probabilistic...

Geofencing in location-based behavioral research: Methodology, challenges, and implementation

This manuscript presents a novel geofencing method in behavioral research. Geofencing, built upon geolocation technology, constitutes virtual fences around specific locations. Every time a participant crosses the virtual border around the geofenced area, an event can be triggered on a smartphone, e.g., the participant may be asked to complete a survey. The geofencing method can...

The Stroop legacy: A cautionary tale on methodological issues and a proposed spatial solution

The Stroop task is a seminal paradigm in experimental psychology, so much that various variants of the classical color–word version have been proposed. Here we offer a methodological review of them to emphasize the importance of designing methodologically rigorous Stroop tasks. This is not an end by itself, but it is fundamental to achieve adequate measurement validity, which is...

e-Babylab: An open-source browser-based tool for unmoderated online developmental studies

The COVID-19 pandemic massively changed the context and feasibility of developmental research. This new reality, as well as considerations about sample diversity and naturalistic settings for developmental research, highlights the need for solutions for online studies. In this article, we present e-Babylab, an open-source browser-based tool for unmoderated online studies targeted...

Interindividual variations in associative visual learning: Exploration, description, and partition of response characteristics

Relying on existing literature to identify suitable techniques for characterizing individual differences presents practical and methodological challenges. These challenges include the frequent absence of detailed descriptions of raw data, which hinders the assessment of analysis appropriateness, as well as the exclusion of data points deemed outliers, or the reliance on comparing...

Deep learning models for webcam eye tracking in online experiments

Eye tracking is prevalent in scientific and commercial applications. Recent computer vision and deep learning methods enable eye tracking with off-the-shelf webcams and reduce dependence on expensive, restrictive hardware. However, such deep learning methods have not yet been applied and evaluated for remote, online psychological experiments. In this study, we tackle critical...

A tutorial on automatic post-stratification and weighting in conventional and regression-based norming of psychometric tests

Norm scores are an essential source of information in individual diagnostics. Given the scope of the decisions this information may entail, establishing high-quality, representative norms is of tremendous importance in test construction. Representativeness is difficult to establish, though, especially with limited resources and when multiple stratification variables and their...

Mouth and facial informativeness norms for 2276 English words

Mouth and facial movements are part and parcel of face-to-face communication. The primary way of assessing their role in speech perception has been by manipulating their presence (e.g., by blurring the area of a speaker’s lips) or by looking at how informative different mouth patterns are for the corresponding phonemes (or visemes; e.g., /b/ is visually more salient than /g...

ESMira: A decentralized open-source application for collecting experience sampling data

This paper introduces ESMira, a server and mobile app (Android, iOS) developed for research projects using experience sampling method (ESM) designs. ESMira offers a very simple setup process and ease of use, while being free, decentralized, and open-source (source code is available on GitHub). The ongoing development of ESMira started in early 2019, with a focus on scientific...

The Children’s Picture Books Lexicon (CPB-Lex): A large-scale lexical database from children’s picture books

This article presents cpb-lex, a large-scale database of lexical statistics derived from children’s picture books (age range 0–8 years). Such a database is essential for research in psychology, education and computational modelling, where rich details on the vocabulary of early print exposure are required. Cpb-lex was built through an innovative method of computationally...

Filling the gap: Cloze probability and sentence constraint norms for 807 European Portuguese sentences

Sentence processing is affected by the sentence context and word expectancy. To investigate sentence comprehension experimentally, it is useful to have sentence completion norms with both context constraint and word expectancy measures. In this study, two experiments were conducted to collect norms for completion of 807 European Portuguese sentences. Context constraint was...

Orthography-phonology consistency in English: Theory- and data-driven measures and their impact on auditory vs. visual word recognition

Research on orthographic consistency in English words has selectively identified different sub-syllabic units in isolation (grapheme, onset, vowel, coda, rime), yet there is no comprehensive assessment of how these measures affect word identification when taken together. To study which aspects of consistency are more psychologically relevant, we investigated their independent and...

Bayesian adaptive method for estimating speed–accuracy tradeoff functions of multiple task conditions

The speed–accuracy tradeoff (SAT) often makes psychophysical data difficult to interpret. Accordingly, the SAT experimental procedure and model were proposed for an integrated account of the speed and accuracy of responses. However, the extensive data collection for a SAT experiment has blocked its popularity. For a quick estimation of SAT function (SATf), we previously developed...

Evaluating the Tobii Pro Glasses 2 and 3 in static and dynamic conditions

Over the past few decades, there have been significant developments in eye-tracking technology, particularly in the domain of mobile, head-mounted devices. Nevertheless, questions remain regarding the accuracy of these eye-trackers during static and dynamic tasks. In light of this, we evaluated the performance of two widely used devices: Tobii Pro Glasses 2 and Tobii Pro Glasses...

A novel protocol to induce mental fatigue

Mental fatigue is a commonplace human experience which is the focus of a growing body of research. Whilst researchers in numerous disciplines have attempted to uncover the origins, nature, and effects of mental fatigue, the literature is marked by many contradictory findings. We identified two major methodological problems for mental fatigue research. First, researchers rarely...

Probing apathy in children and adolescents with the Apathy Motivation Index–Child version

Apathy is linked to mental health and altered neurocognitive functions such as learning and decision-making in healthy adults. Mental health problems typically begin to emerge during adolescence, yet little is known about how apathy develops due to an absence of quantitative measurements specific to young people. Here, we present and evaluate the Apathy Motivation Index–Child...

How Many Participants? How Many Trials? Maximizing the Power of Reaction Time Studies

Due to limitations in the resources available for carrying out reaction time (RT) experiments, researchers often have to choose between testing relatively few participants with relatively many trials each or testing relatively many participants with relatively few trials each. To compare the experimental power that would be obtained under each of these options, I simulated...