Behavioral analysis of habit formation in modern slot machine gambling

Behavioral analysis of habit formation in modern slot machine gambling
Behavioral analysis of habit formation in modern slot machine gambling

Habit formation is a key process in contemporary models of addictive behaviors but has received limited attention in the context of gambling and problem gambling. Methods for examining habit formation and expression in relation to gambling are also lacking. In this study, 60 participants with no prior slot machine experience attended three sessions spaced 6–8 days apart, during which they played a short 200-spin session on a realistic simulation of a modern multi-line slot machine. Behavioral data were analyzed to characterize habit formation within and between sessions. Fixed-effects regressions, integrating trial- and session-level effects, assessed predictors of gambling speed (spin initiation latencies) and betting rigidity (the likelihood of switching the bet amount), as two putative markers of habit formation. Participants gambled faster and showed less variability in betting strategy as they accumulated experience in the number of trials and sessions gambled. Simultaneously, as the number of sessions gambled increased, participants showed a more pronounced tendency to slow their betting after larger wins (i.e. the post-reinforcement pause increased from session 1 to session 3). Our methods provide a basis for future research to examine habits in the context of slot machine gambling.

Introduction

In the field of gambling studies, the term ‘habit’ is often used colloquially to refer to recurrent or frequent patterns of gambling involvement. Recent studies have begun to characterize habit formation in gambling (Boffo et al., 2018; Dickerson, 1993; Griffiths, 1993; Van Timmeren et al., 2018; Wyckmans et al., 2019). In the Pathways Model of problem gambling (Blaszczynski & Nower, 2002), ‘habituation’ – which could reflect either the development of tolerance to gambling outcomes and/or the formation of habitual behavioral routines – is positioned as a key component between gambling initiation and disordered status. According to the Pathways Model, habitual processes are established gradually as the gambler experiences intermittent operant rewards (i.e. euphoria from winning), with further classical conditioning to gambling cues (e.g. lights and jingles).

In models of problem gambling, habit is presently an underspecified construct, compared to its role in contemporary, neuroscience-oriented models of drug addiction. For example, Everitt and Robbins (2005) hypothesized that habit formation is a key process in the gradual loss of control over drug seeking behaviors, in which initially ‘goal-directed’ drug taking progressively transitions to automatic and stimulus-driven behavior. Used in this way, the term ‘habit’ is operationalized in neurocognitive terms that converge on two defining features (Robbins & Costa, 2017; Wood & Rünger, 2016): 1) a gradual and learned transition to involuntary and cue-driven behavior, and 2) a progressive decoupling of the behavior from the mental representation of its consequences, which allows for persistence of the behavior despite changes in (e.g. removal of) the original reinforcer of the behavior.

As gambling behavior involves both winning and losing outcomes, neurocognitive conceptualizations of habit can be interpreted and applied to the etiology of problem gambling in a number of ways. For example, given that the house edge in gambling produces eventual losses on average, faster or more persistent gambling (e.g. loss chasing) may represent a reduced sensitivity to negative outcomes, and thus habit (Wyckmans et al., 2019). Stated differently, habit formation may be expressed in gambling as a gradual decoupling of betting from the aversiveness of losing. Alternatively, from an appetitive perspective, gambling experience may be accompanied by a devaluation of winning outcomes, such that the behavior is less controlled by the value of wins, and more by gambling-related cues and their learned associations. Cues such as images and sounds of gambling products have been shown to elicit cravings and concomitant neural activation in individuals with problem gambling (see, Brevers et al., 2019). Furthermore, gambling products are host to a wide variety of audiovisual stimuli that may condition habits through repetition during gambling sessions, such as celebratory animations, music, and sound effects generated by electronic gambling machine (EGM) wins. Losing outcomes, by comparison, ostensibly lack these features but can still present clear cues that signal the end of a round (e.g. the sight of the reel’s stopping) or elicit affective cues, such as frustration or boredom, as losses accumulate in succession.

One experimental lens for studying habit is via the effects of repeated practice or familiarity with gambling devices. Gambling practice effects have been observed as within- and between-session increases in risk-taking (e.g. expenditure) during laboratory blackjack and roulette tasks (Bednarz et al., 2013; Blascovich et al., 1973; Ladouceur et al., 1986). EGMs, in contrast, are understudied as a class of gambling product, in relation to practice effects. EGMs are widely recognized to be one of the most harmful types of gambling products (Binde et al., 2017), and these harms are thought to be underpinned by various structural characteristics including a fast speed of gambling and audiovisual feedback that may facilitate habit formation (Griffiths, 1993). Yücel et al. (2018) propose a role of operant conditioning in the establishment of stimulus-response-type gambling habits, which they frame as developing specifically within the context of EGMs. In a brain imaging study in 43 healthy participants, Shao et al. (2013) examined fMRI BOLD responses to the anticipatory (i.e. reel spinning) and outcome phases of a simple three-reel slot machine simulation. Participants were randomly assigned to either practice the task prior to their fMRI scan or complete the task for the first time during their fMRI scan. Compared to the non-practiced participants, those with prior task experience showed altered neural signals in brain reward circuitry, such that anticipatory signaling was enhanced, and the win-related activity was attenuated. Reinforcement learning models are predicated on a similar shift in dopamine cell firing from the unconditioned stimulus (e.g. food) to the conditioned stimulus (e.g. the bell; Schultz et al., 1997).

A formal investigation into habits in slot machine gambling requires further specification of the scope of behaviors to examine. By analogy to habits in substance addictions, slot machine habits might be framed as acquisition-type behaviors (e.g. choosing which slot machine to play on in the casino) or, alternatively, could pertain to behaviors occurring during gambling sessions. Gambling on multi-line slot machines involves responding by using a spin button, with the further option to vary one’s bet via either the number of lines played and/or the ‘bet multiplier’ (i.e. the number of credits per line). These betting decisions affect both the risk variance of bets (e.g. larger bets risk greater losses but are rewarded with larger wins) and level of reinforcement (selecting more lines produces more frequent feedback from ‘hits’; Barr & Durbach, 2008; Harrigan et al., 2015). Therefore, any adjustment of the bet amount becomes a behavioral variable that can be automated through conditioned responses to preceding stimuli (i.e. habit), such as audio-visual feedback from the prior bet outcome. This is consistent with a framework in which habits are expressed as increased behavioral rigidity in contexts with varying outcomes (Smith & Graybiel, 2014). Some indirect evidence suggests that regular gamblers tend to adopt a more-rigid betting strategy. In a telephone survey study, EGM gamblers’ qualitative reports on their betting strategy tended to identify a common, baseline pattern of wagering a low number of credits on the maximum number of lines, which has been termed a ‘mini-max’ style (Livingstone & Woolley, 2008). In a large dataset tracking real-world betting behavior of regular slot machine gamblers, Salaghe et al. (2020) found that gamblers tended to alter their bets only infrequently, and usually in response to successive wins.

A second behavioral marker that is relevant to habit formation is the speed with which a behavioral sequence is initiated and finished (Smith & Graybiel, 2014). In the context of slot machine gambling, individual wagers can be viewed as discrete choices with an observable ‘spin initiation latency.’ These processes control the speed of slot machine gambling and might also be susceptible to influence from the habit system. Individuals with gambling problems tend to prefer faster gambling products, and regular gamblers place bets at a faster pace than non-regular gamblers (for a review, see, Harris & Griffiths, 2018). Importantly, spin initiation latencies during slot machine gambling are further influenced by in-game cues, as evidence for an underlying stimulus-response mechanism. For example, winning outcomes are associated with delayed initiation of the subsequent bet, termed the ‘post-reinforcement pause’ (Delfabbro & Winefield, 1999). These latencies are highly sensitive to valuation processes, to the extent that they scale proportionately with the size of the win (Dixon et al., 201420182013). Conversely, gambling losses, which make up a greater proportion of outcomes than wins, prompt a faster initiation of the next gamble, compared to neutral outcomes (Verbruggen et al., 2017).

This study sought to develop a slot machine gambling procedure that is suitable for studying behavioral markers of habit formation and expression. For this goal, we required a higher degree of game-level control (over profits and losses) and behavioral surveillance than is viable using authentic EGMs in a lab environment (Ladouceur et al., 2003; Stewart et al., 2002). Using a highly realistic slot machine simulation, our aim was to assess how practice effects are expressed within and across three sessions of gambling, in novice participants without prior experience of playing slot machines. Based on the criteria for habits in Smith and Graybiel (2014), we focus on two behavioral markers as candidates for habit formation: 1) faster action initiation and 2) increased rigidity in betting strategy. Viewed from the lens of slot machine gambling, these criteria are analogous to an increasing speed of gambling and fewer bet changes, respectively. As putative markers of habit formation, we first hypothesized that participants would gamble faster and be less likely to change their bet, as they gained experience with slot machine gambling.

We further hypothesized that with increasing experience, any habit expression would be increasingly influenced by game-related events. Stated differently, we anticipated that the predictive effect of game-related events on our behavioral markers would strengthen with increasing practice. Because winning and losing outcomes are inherently different with regard to the presence of certain events, such as monetary reward and celebratory audio-visual feedback, our predictions according to this hypothesis were two-fold. First, we predicted that following win outcomes, the size of the win would become increasingly predictive of spin latencies and bet changes. Thus, for spin latencies, an accumulation of slot machine experience may be expressed as an amplification of the post-reinforcement pause effect (i.e. a slowing in gambling speed following wins), which is distinct from our first hypothesis that the overall speed of gambling will get faster with practice. For losing trials, we predicted that successive losses (i.e. losing ‘streak length’) would become increasingly predictive of both spin latencies and bet changes.

Materials and methods

Participants

Participants were recruited via online campus advertisement and from a pool of undergraduate students seeking to attain course credit through research participation. Advertisements indicated that the study involved slot machine gambling with an opportunity to receive a bonus up to $15, and participants recruited from advertisements were offered $10 remuneration per hour of participation. Participants were screened for eligibility prior to participating. Eligible participants were 19 years of age or older (the legal age of gambling in the jurisdiction), had no prior direct experience with slot machine gambling (land-based or online), had not experienced problems with gambling, had normal or corrected vision, and were not taking medications for mental health problems. In total, 60 individuals participated in this study. Following study completion, four participants were excluded who initially endorsed the inclusion criteria but indicated prior slot machine experience during the first test session. Thus, 56 eligible participants (Mage = 21.89, SDage = 3.61; 37 women) completed the three sessions. We aimed to space the sessions at intervals of 6–8 days. Two participants attended one session outside of this interval but were retained in the dataset. One participant missed their final session and therefore contributed data for only two sessions. All participants provided informed consent before the commencement of the study protocol, which was approved by the University of British Columbia Behavioral Research Ethics Board (H17-03122).

Materials

Gambling experience

Forms of gambling involvement over the past 12 months were assessed using the first section from the Canadian Problem Gambling Index (CPGI; Ferris & Wynne, 2001). We screened participants for exclusion based on any endorsement of slot machine experience, which was re-iterated in an open-ended prompt inquiring about any prior slot machine experience. Participants also completed the 9-item Problem Gambling Severity Index (PGSI) section of the CPGI, which is the gold-standard screening tool for problem gambling (see Dowling et al., 2018). To be eligible, participants were required to score below 8, the cutoff for likely problem gambling. Most participants (n = 39) attained an initial PGSI score of 0, placing them in the non-problem gambler category, while 13 participants scored 1 or 2 (low-risk) and 4 scored 3–7 (moderate risk).

Slot machine simulation

To measure behavioral changes in novice slot machine gamblers, we designed a realistic multi-line slot machine simulation, called ‘Cleo’s Gold,’ using Custom Slots Framework 1.4 with Unity 2017.3.1 (Lafrontier, 2018; Unity Technologies, 2017). This simulation was run within a genuine slot machine frame that included two vertically aligned monitors, a button panel, and an (inactive) cash bill acceptor (see Figure 1). The participant was seated on an authentic casino stool and interacted with the game via the slot machine button panel. The game used an ancient Egyptian theme; for example, icons were ankhs and sphinxes, and auditory feedback for bonus features played Egyptian themed music. This artwork was obtained commercially (Envato Pty Ltd., Melbourne, VIC) and used under license. Structurally, the game emulated multi-line slot machines currently found in North American casinos: the five-reel display allowed up to 40 possible line combinations, meaning that it was possible for the participant to attain symbol matches along a maximum of 40 different paths or ‘lines’ across the reels. The button panel allowed a choice between five line combinations (1, 5, 9, 20, or 40 lines) and five bet increments (1 to 5 credits per line), thus allowing a 200 credit maximum wager (i.e. 5 credits per line on 40 lines). There was a larger ‘repeat bet’ button situated on the right-hand side. The simulator recorded the bet configuration, the timings of the spin initiation and the various game event durations, and the monetary outcome for each trial. To reduce outcome variability across participants and across gambling sessions, we generated three symbol sequences (i.e. arrangements of symbols appearing on the reels) of 200 trials each, which participants received in a counterbalanced order (see Supplementary S1).

Figure 1. The ‘Cleo’s Gold’ slot machine simulation task. Top: a screen capture from the game’s main graphical display that depicts one segment of a multiline win animation involving the ‘wild’ pyramid symbol. The artwork was from a commercial graphics set (Envato Pty Ltd., Melbourne, VIC), used under license. Touch input was disabled on the display. Bottom: an illustration of the button panel configuration with which participants gambled. Participants configured the number of lines (top button row) and/or the bet amount per line (bottom button row), or used the repeat bet button, for each spin.

 

Procedure

Participants attended the laboratory on three occasions spaced 6–8 days apart. Upon arrival at the first session, following the consent procedure, the participant completed the CPGI and further questionnaires assessing affect, trait characteristics, and gambling attitudes. The participant then completed the simulated slot machine either before or after a second learning task (that did not involve gambling stimuli or financial incentives). For the slot machine simulation, the participant was directed to the Cleo’s Gold game that was situated in a bank of four different slot machines (the other three machines were authentic machines). The experimenter provided verbal instructions on how to place wagers using different line and bet increments, and gave the participant a card displaying the game’s payout schedule and possible line combinations. The bonus structure was explained that winnings exceeding the $40 endowment would be honored up to a maximum of $15. The participant was then able to begin their session and gamble at their own speed, while the experimenter sat out of view. After the 200th trial, the slot machine displayed a ‘call attendant’ notification to mask the inauthenticity of the game, as participants were not informed that the game was programmed until the debriefing on the third test day. The participant relocated to an area not in view of the slot machines, to complete measures of immersion and affect, and were then paid any cash bonuses.

Procedures for sessions two and three were similar to session one; the participant repeated the PGSI on each session to ensure eligibility and completed a gambling attitude measure (see Supplementary S2). The participant was then seated at the slot machine simulator. The experimenter provided condensed instructions as a reminder, and the participant then began the session. Upon completion of the third session, the experimenter debriefed the participant on the specific aims and measures in the experiment. The experimenter provided verbal and written explanation of the mathematics of modern slot machines, and the risks of slot machine gambling, such that participants would leave our laboratory better informed and educated about gambling than on their arrival. Debriefing further emphasized differences between the simulated slot machine and authentic gambling products, namely that authentic slot machine gambling would be likely to cause the gambler to incur harms, such as increasing financial losses with continued gambling. The training of research assistants included specific training on the debriefing procedure and was overseen by the first author (MAF), a Clinical Psychology graduate student who was on site or contactable during all testing sessions. All participants were given a written list of mental health and gambling help resources in British Columbia. Debriefing information and materials were provided electronically in cases of discontinuation before the third session.

Data analysis

Our analysis focused on two behavioral parameters: the speed of gambling and bet switching. We analyzed these data at the trial level, with each trial (i.e. spin) as the lowest level of data, nested within gambling session and then within the participant. The speed of gambling was expressed as the participant’s ‘Spin Initiation Latency’ at the trial level. This was recorded for each trial as the interval between the end of a spin (loss outcomes) or the offset of any ensuing feedback (on trial n) and the moment that the next bet (trial n + 1) was placed. For Bet Switching, each trial was binary coded according to whether the participant changed or repeated their previous bet amount on trial n + 1 (coded 1 and 0, respectively). This would include changes to both the ‘line style’ and ‘bet multiplier’, and we acknowledge that these options can exert differential effects on the reinforcement profile (Barr & Durbach, 2008).

Our analytic method applied linear and logistic ‘fixed effects’ regression to the analysis of Spin Initiation Latency and Bet Switching, respectively (Allison, 2005; Chu et al., 2018; Limbrick-Oldfield et al., 2022). Fixed-effects regression substitutes a unitary model intercept with the participant identifier variable entered as a categorical predictor. This method can accommodate unbalanced data and, unlike mixed-effects methods, produces estimates of longitudinal (i.e. ‘within’) effects that are unbiased by unmodelled, stable person-level confounds (Bell et al., 2019; Allison, 2005). These characteristics of the fixed-effects approach are well suited to our data. The win and loss frequencies are influenced by outcome sequence and betting strategy, especially at individual levels of win magnitude and losing streak length, and thus these data are inherently unbalanced and likely to be influenced by unmodelled participant characteristics (Chu et al., 2018; Limbrick-Oldfield et al., 2022; Murch et al., 2020).

To test our first hypothesis, that participants would gamble faster and be less likely to change their bet, we estimated models for each dependent variable (Spin Initiation Latency, Bet Switching) that integrated wins and losses, controlling for the outcome on the previous trial. Trial outcomes were classified according to whether the trial was a Loss, a ‘Loss Disguised as a Win’ (an amount lower than the bet amount is ‘won’, resulting in a net loss; Dixon et al., 2010), or a Win (exceeded the bet amount). Next, to test our hypothesis that the habit markers would be subject to an increasing influence of game-related events, we computed four models, which distinguished two dependent variables and two trial outcomes (i.e. Spin Initiation Latency after a Win, Spin Initiation Latency after a Loss, Bet Switching after a Win, Bet Switching after a Loss). In these models, we binarized ‘winning’ trials (as trials on which any credits were awarded and thus winning audio-visual feedback was delivered) and losing trials, which produced no such audio-visual feedback, in order to model the effects of win magnitude and losing streak length, as effects that were exclusive to these two outcome types, respectively (Limbrick-Oldfield et al., 2022).

For all models, within session effects were modeled by Trial Number (centered at trial 100) and between session effects by Session (centered at session 2), and we further modeled their interaction term. Note that Session is entered as a continuous predictor, but as it has only three levels, it is effectively a two-level factor comparing Session 1 and Session 3. Control variables were the participant’s current Credit Balance (taken at the end of each trial) and the Sequence version (A, B, and C) entered as a categorical predictor with Sequence A set to the reference category. The Spin Initiation Latency models additionally controlled for the effect of Bet Switching (0 = not switched, 1 = switched) because this deliberative action would inherently increase the initiation time. For the two models considering only winning trials, the credit payout (i.e. the ‘Win Magnitude’) preceding the current trial was entered as a predictor, in addition to two- and three-way interactions with Trial Number and Session. Similarly, the two models for losing trials included a predictor variable indicating the number of losses preceding the current trial (i.e. the ‘Loss Streak’), and corresponding interaction terms. Credit Balance, Win Magnitude, and Loss Streak were grand mean centered to improve the interpretability of the model estimates. To test our predictions on the increasing influence of game-related cues on the putative markers of habit, we examined the two-way interactions between the time course (i.e. Trial Number and Session) and the Loss Streak and Win Magnitude variables.

For all participants, we removed the first and last trials in each session, as well as trials occurring before (n – 1), within, and after (n + 1) bonus rounds, which run automatically without the participant’s involvement. Several methods were used to reduce the influence of skew and outliers in the analyses. We removed trials with spin latencies exceeding 10 seconds, which removed 1.17% of spin latencies overall (1.56% of wins; 1.01% of losses; Limbrick-Oldfield et al., 2022). On sessions one to three, 2.45%, 0.84%, and 0.22% of spin latencies were discarded. Data for Spin Initiation Latency, Loss Streak, and Credits Won remained positively skewed and were thus log transformed. All models were estimated and tested using robust, MM-type regression (Maechler et al., 2021). This type of regression estimation algorithmically adjusts the influence (i.e. the ‘weight’) of outlying data points to reduce model bias and performs better than ordinary least-squares estimation when model assumptions are violated (Field & Wilcox, 2017). To ensure that the robust estimates were not biased by systematic de-weighing of the data points, the applied weights were inspected visually. Reasonableness of model assumptions was assessed by visual inspection of the residuals. All analyses were conducted using R 4.0.5 (R Core Team, 2021). Alpha was set to .05. Data and R analysis scripts from this study are publicly archived (https://doi.org/10.5683/SP3/KWJTFU).

Results

Descriptive statistics

Descriptive statistics for each of the gambling sessions are presented in. Four participants interrupted one trial by temporarily ‘cashing out’ the machine (see Figure 1), and the four impacted trials were removed from the dataset. There was incomplete trial data for nine gambling sessions, due to participants exhausting their endowment before completing the 200 trials (average duration: 154.89 trials; 3, 1, and 5 occurrences for sessions 1–3, respectively). Overall, nearly three-quarters of trial outcomes were losses, with the remainder either paying back a proportion (i.e. losses disguised as wins) or more than the bet amount (i.e. wins). The proportions of trial outcomes produced by our slot machine simulation were in keeping with those of commercial slot machines

Leave a Reply

Your email address will not be published. Required fields are marked *

Related Posts