The selective serotonin reuptake inhibitor sertraline alters learning from aversive reinforcements in patients with depression

evidence from a randomized controlled trial

  • Malamud, Jolanda
  • Lewis, Gemma
  • Moutoussis, Michael
  • Duffy, Larisa
  • Bone, Jessica
  • Srinivasan, Ramya
  • Lewis, Glyn
  • Huys, Quentin J. M.
Psychological Medicine 54(10):p 2719-2731, July 2024. | DOI: 10.1017/S0033291724000837

Introduction

Anxiety and depression are the most common mental health problems, often occurring together and constituting a significant fraction of the global disease burden (Bandelow et al., ; Evans-Lacko et al., ; Hirschfeld, ). Selective serotonin reuptake inhibitors (SSRIs) are commonly prescribed as first-line pharmacological treatments for both depression (Bogowicz et al., ; Kendrick, Stuart, Newell, Geraghty, & Moore, ) and anxiety disorders (Garakani et al., ). However, how SSRIs work beyond their initial pharmacological action on the serotonin transporter remains unclear (Harmer, Duman, & Cowen, ).

Neuropsychological models propose that antidepressants, including SSRIs, may alter cognitive processing, leading to improvements in depressive and anxiety symptoms (Harmer, Goodwin, & Cowen, ). Reinforcement learning provides a framework for investigating links between cognitive and biological processes and hence the effect of SSRIs on cognition (Huys, Browning, Paulus, & Frank, ; Lan & Browning, ; Maia & Frank, ). Preclinical and experimental research has established that several cognitive functions relevant to the etiology of anxiety and depression are sensitive to SSRIs (e.g. Geurts, Huys, den Ouden, & Cools, ; Guitart-Masip, Duzel, Dolan, & Dayan, ; Harmer, ; Michely, Eldar, Erdman, Martin, & Dolan, ; Michely, Eldar, Martin, & Dolan, ; Roiser, Elliott, & Sahakian, ; Roiser et al., ). However, there is little evidence tying these experimental effects of SSRIs on cognition to improvement in symptoms in clinical settings as only a few clinical randomized controlled trials (RCTs) have evaluated candidate mechanisms to explain treatment effects (Ahmed et al., ; Cuthbert & Insel, ; Morris et al., ; Pizzagalli et al., ). Evaluations in the context of RCTs comparing SSRIs and placebo provide a strong test of whether specific cognitive or learning processes are the mechanisms through which SSRIs alleviate symptoms of anxiety and depression.

The present study investigates whether SSRIs improve symptoms by modulating reinforcement learning processes, specifically aversive Pavlovian control. Aversive Pavlovian control refers to the automatic, stereotyped inhibition of actions in the face of negative expectations (Bolles, ; Dayan, Niv, Seymour, & Daw, ), an effect that can be robustly observed in humans using neurocognitive probes (Boureau & Dayan, ; Guitart-Masip et al., ; Huys, Moutoussis, & Williams, ). Aversive Pavlovian control is a promising candidate mechanism for the treatment of SSRIs. It is sensitive to serotonergic functioning in animal (Abela et al., ; Amo et al., ; Doya, Miyazaki, & Miyazaki, ; Ohmura, Tanaka, Tsunematsu, Yamanaka, & Yoshioka, ) and preclinical studies (Crockett, Clark, Apergis-Schoute, Morein-Zamir, & Robbins, ; Crockett, Clark & Robbins, ; Geurts, Huys, den Ouden, & Cools, ; Hebart & Gläscher, ). Moreover, aversive Pavlovian control is associated with symptoms of depression and anxiety in clinical samples (Huys et al., ; Nord, Lawson, Huys, Pilling, & Roiser, ) and in general population samples with anxiety traits (Mkrtchian, Aylward, Dayan, Roiser, & Robinson, ). Influential reviews have highlighted the prominence of inhibition in response to negative expectations in depression (Roiser et al., ) and of avoidance driven by negative expectations in anxiety (LeDoux, Moscarello, Sears, & Campese, ). Modifying aversive Pavlovian control is hence clinically promising (Huys, Russek, Abitante, Kahnt, & Gollan, ; Martell, Dimidjian, & Herman-Dunn, ).

In terms of underlying mechanisms, computational models have proposed formal relationships between rumination, acute reductions in central serotonin levels, and the attenuation of aversive Pavlovian control (Dayan & Huys, , ; Huys et al., ; Robinson et al., ). At the neural level, the subgenual anterior cortex has been implicated in aversive Pavlovian control in research involving primates (Amemori & Graybiel, ) and healthy volunteers (Lally et al., ). This brain region is also recognized for its involvement in anxiety, as demonstrated in a study of healthy volunteers with contextual fear (Alvarez, Chen, Bodurka, Kaplan, & Grillon, ; Hasler et al., ) and it has been linked to depression in both preclinical (Drevets, Savitz, & Trimble, ; Ramirez-Mahaluf, Perramon, Otal, Villoslada, & Compte, ) and depression treatment studies (Mayberg et al., ).

Appetitive Pavlovian control may also be affected. In both clinical and subclinical depression samples, there have been reports of blunted reward responses (Bylsma, Morris, & Rottenberg, ; Eshel & Roiser, ; Halahakoon et al., ; Pizzagalli, Jahn, & O'Shea, ; Steele, Kumar, & Ebmeier, ), possibly due to reduced specificity (Huys et al., ; Nord et al., ). Serotonergic manipulations have also shown effects on appetitive Pavlovian processes in animals (Cohen, Amoroso, & Uchida, ) and healthy volunteers (Michely et al., ).

As such, Pavlovian control may be a candidate mediator of the effect of SSRIs on anxiety and depression. Here, we report a test of this hypothesis in the context of the PANDA RCT (Lewis et al., ). This trial compared sertraline to placebo for the treatment of depression in primary care in the UK (Duffy et al., ; Lewis et al., ; Salaminios et al., ). PANDA found no evidence that sertraline reduced depressive symptoms to a clinically meaningful extent at 6 weeks, with only a weak effect at 12 weeks. However, they found evidence that sertraline reduced anxiety at 6 and 12 weeks. We measured Pavlovian inhibition and a number of other reinforcement learning processes during this trial using computational modeling of the affective Go/NoGo task (Guitart-Masip et al., ). This is a well-established learning paradigm in which computational analyses allow appetitive and aversive Pavlovian processes to be measured (Guitart-Masip et al., ).

We pre-registered an analysis plan investigating five main hypotheses (osf.io/7q8v2). The primary analyses aimed to test whether treatment with the SSRI sertraline alters aversive Pavlovian control and whether aversive Pavlovian control is related to anxiety, i.e. whether Pavlovian inhibition might mediate the effect of sertraline on anxiety. We also examined the relationship between appetitive Pavlovian biases and depressive symptomatology. Overall, task compliance was poor, and the primary hypotheses were not supported. However, exploratory analyses did reveal that higher changes in aversive Pavlovian bias early on were linked to more severe depression after 12 weeks. Additionally, there was an effect of SSRI treatment on the aversive learning rate at week two and an association between learning from losses and anxiety.

Methods

Ethics

The National Research Ethics Service Committee, East of England – Cambridge South approved the study (ref: 13/EE/0418). The Medicines and Healthcare Products Regulatory Agency gave clinical trial authorization. Written informed consent was obtained from each participant before the study.

Participants

We present secondary analyses of data acquired in the context of the PANDA trial. PANDA was a randomized, double-blind, placebo-controlled pragmatic study investigating the clinical effectiveness of sertraline on depressive symptoms as the primary outcome.

Patients (aged 18–74 years) were recruited from 179 primary care surgeries in four UK sites (Bristol, Liverpool, London, York). The critical entry criterion was that general practitioners (GPs) and/or patients were uncertain about the potential benefits of an antidepressant. No lower or higher thresholds were set on depression severity or duration. The study aimed for a diverse participant pool by including doctors with varied decision-making approaches, promoting clinical equipoise, and capturing the spectrum of depressive symptom severity. The exclusion criteria were: unable to understand or complete study questionnaires in English; antidepressant treatment in the past eight weeks; comorbid psychosis, schizophrenia, mania, hypomania, bipolar disorder, dementia, eating disorder, or major alcohol or substance abuse; and medical contraindications for sertraline.

Patients were randomized to sertraline or placebo, stratified by severity, duration, and site, and followed up after 2, 6, and 12 weeks. For the first week, patients received one capsule (50 mg sertraline or placebo) a day. From week two onwards, they took two capsules per day, either containing 100 mg of sertraline or placebo, for up to 11 weeks. Medication could be increased to 150 mg in consultation with the local principal investigator in cases of non-response after six weeks. The study was double-blind: study patients, care providers, and all members of the research team were blinded to the study treatment allocation (Salaminios et al., ).

Measurements

The Go/NoGo task (Fig. 1a) was designed to study Pavlovian appetitive and aversive influence on choice by crossing action (go v. nogo) and valence (rewards v. losses; Guitart-Masip et al., ). Participants were verbally instructed that each fractal would lead to a more favorable outcome with either go or nogo, but that outcomes were probabilistic (cf. Fig. 1 for detailed task description). Each task administration employed a different fractal set. Fractal sets were randomized across participants and assessment timepoints. The Go/NoGo task was assessed at baseline, at two weeks (follow-up 1), and at six weeks (follow-up 2), but it was not part of the 12 weeks assessment (follow-up 3). The Generalized Anxiety Disorder Assessment (GAD-7; Spitzer, Kroenke, Williams, & Löwe, ), the Patient Health Questionnaire-9 (PHQ-9; Kroenke, Spitzer, & Williams, ), and the Beck Depression Inventory (BDI; Beck, Steer, & Brown, ) were completed at baseline and every follow-up. Several baseline variables were acquired (cf. Table 1).

Open multimedia modal

Figure 1

Open multimedia modal

Task and performance. (a) The Go/NoGo task consisted of four different conditions. On each trial one of four possible fractal images was shown. Actions were required in response to a circle that followed the fractal image after a variable delay. After a brief delay, the outcome was presented: a green upward arrow for a win, a red downward arrow for a loss, or a horizontal bar for a neutral outcome. In the go-to-win condition, pressing the key (‘go’) led to a reward with 80% and a neutral outcome with 20% probability, vice versa if they did not press the key (‘nogo’). In the go-to-avoid condition, pressing the key (‘go’) led to a neutral outcome with 80% and a loss with 20% probability. In the nogo-to-win, not pressing the key (‘nogo’) led to a reward with 80% and a neutral outcome with 20% probability. In the nogo-to-avoid condition, not pressing the key (‘nogo’ response) led to a neutral outcome with 80% and a loss with 20% probability. Each task administration consisted of 96 trials, with 24 trials per condition. (b) Mean percentage of correct responses in each of the four conditions. Black dots depict participants and black error bars depict standard deviation of the mean (S.D.). Dashed lines depict chance level. Post hoc comparisons were implemented by means of repeated measures t tests showing a significant difference in accuracy between Pavlovian congruent (got to win and nogo to avoid) and incongruent conditions (go to avoid and nogo to win). Significance * ≤ 0.05, ** ≤ 0.01, *** ≤ 0.001, **** ≤ 0.0001.

Computational models

Previously published computational models for this task (Guitart-Masip et al., ; Mkrtchian et al., ; Moutoussis et al., ; Scholz et al., ) provide formal, quantitative descriptions of the evolution of decisions over the course of learning during the task. The core parameters of interest in the models are the Pavlovian parameters. These capture appetitive Pavlovian influences through the extent to which participants automatically emit ‘go’ responses when faced with reward stimuli, and aversive Pavlovian inhibition through the extent to which they automatically emit ‘nogo’ responses when faced with loss stimuli. The Pavlovian processes are separate from instrumental learning processes, which emit ‘go’ and ‘nogo’ according to which of the two actions is more likely to lead to the better outcome. Other parameters include reward and loss sensitivity, learning rates, irreducible noise, and an overall ‘go’ bias.

Data validation

To evaluate whether the existing data was in principle sufficient to assess the key hypotheses, and to provide an informative a-priori estimate of power, two authors (J. M. and Q. J. M. H) were provided with blinded access to the behavioral task data only, but without access to group allocation, demographics, or measures of symptoms. These authors fitted different reinforcement learning (RL) models (for a list of the models, see Supplementary Materials B.1 RL Models) as described previously in the literature (cf. Huys et al. () and Supplementary Materials B.2 Model Fitting Procedure & B.3 Model Comparison). All datasets of the study were combined, disregarding within-subject information (i.e. treating repeated sessions as independent task assessments). In the supplements, we report the recoverability and reliability of the parameters (cf. Supplementary Materials Fig. B.3 and B.4).

Models were fitted separately to the data and compared using the integrated Bayesian Information Criterion (iBIC; Figure 2a) at the group level, where the individual likelihoods were first integrated over the individual parameters using a sampling procedure and then summed over all individuals. The most parsimonious model included learning rates, outcome sensitivities, and Pavlovian biases, all separated into rewarding and punishing contexts. Figure 2c shows that simulated data captured the empirical data qualitatively. Hence, standard models of the task are able to parametrically capture the variability of behavioral performance in the task across individuals and sessions on a trial-by-trial level.

Open multimedia modal

Figure 2

Open multimedia modal

Computational modeling of the Go/NoGo task. (a) shows the differences in integrated Bayesian Information Criterion (iBIC) scores for all models tested compared to the most parsimonious model (red star), where a smaller iBIC score indicates a more parsimonious model. All models are modified Q-learning models (Rescorla Wagner – RW) with two pairs of action-values (‘go’ and ‘nogo’) for each stimulus. The y-axis shows the number of free parameters for each model. The most parsimonious model includes separate learning rates for rewards and punishments, win and loss sensitivities, appetitive and aversive Pavlovian biases, irreducible noise, and a constant bias factor added to the action-value for ‘go’. (b) shows the histogram of the difference between the integrated loglikelihood (iLL) of the most parsimonious model and the iLL of the random baseline model. Datasets were declared as informative if the data was more than three times more likely to have occurred under the most parsimonious model (vertical red dashed line). (c) The four subplots show the average learning curves in blue (averaged over participants; solid line) for each condition separately. Each row of the raster images shows the choices of each participant. ‘Go’ responses are depicted in white, and ‘nogo’ responses are depicted in grey. Additionally, the average ‘go’ probability was separated into included datasets (orange) and excluded datasets (green). The solid line refers to empirical data and the dashed line to simulated data from the most parsimonious model. Informative datasets (orange) show that participants, on average, seem to learn over trials, which can be captured qualitatively well by the most parsimonious model. In contrast, the average ‘go’ probability of non-informative/excluded datasets (green) appears to have no temporal relation, hence showing no learning over trials. Further, it is well captured by the random baseline model.

In the Go/NoGo task, non-informative responses (e.g. always emitting the same response) cannot provide information about Pavlovian or other cognitive processes and therefore do not inform parameter estimates. Whether the data of a particular task run are meaningful can be evaluated formally by examining whether a model encompassing the core processes provides a more parsimonious account of the behavioral data than a random baseline model. In other words, to examine whether the observed behavioral data meaningfully constrained the model parameter estimates, we compared the integrated likelihood of the most parsimonious model to the integrated likelihood of a random baseline model for each dataset from each individual at each session. The integrated likelihood integrated over an individual parameters refers to the likelihood of the data given the group-level hyperparameters. A task run was deemed as missing if the integrated likelihood of the random baseline model was more than three times higher than that of the most parsimonious model at the group level (Fig. 2b). Note, the model selection process conducted only on the informative task runs yielded consistent results with those obtained on the complete dataset (cf. Supplementary Materials Fig. B.2).

The parameters for each informative task run were extracted from the most parsimonious model to test the hypotheses.

Preregistration

The key hypotheses and analyses were pre-registered on OSF (osf.io/7q8v2; cf. Supplementary Materials Table D.4).

Statistical analyses

Predictors of missing and non-informative data at baseline were identified using a univariate logistic regression. Significantly related baseline variables were used as covariates in all further analyses.

To investigate drug effects, we employed a mixed-effects linear regression (1) using group allocation as the independent variable and the parameter estimate (e.g. aversive Pavlovian bias as the dependent variable) controlling for stratification variables (baseline CIS-R total score in three categories, duration of depressive episode in two categories, and site) and including random intercepts. We reported mean differences (MD), 95% confidence intervals (CI), and the corresponding p values (p).

Next, we examined whether parameter estimates relate to depressive or anxiety symptoms using a mixed-effects multiple linear regression (2) with the parameter estimate as independent variable and log-transformed symptom scores (e.g. GAD-7 total score) as dependent variable. Random slopes and intercepts per individual were included. We controlled for group allocation and stratification variables. We reported regression coefficients (β), 95% confidence intervals (CI), and the corresponding p values (p).

For both analyses, we performed separate mixed-effects models for baseline and week two, baseline and week six and over all three time-points. To investigate a potential drug time interaction, we additionally performed a regression including a group-time interaction. The group variable in the mixed-effects models was coded [0,1,1] for a patients allocated to sertraline and [0, 0, 0] for a patients allocated to placebo. Both groups have a 0 at baseline because they were unmedicated at that time.

To investigate whether a baseline parameter estimate predict treatment outcome, we performed a simple linear regression predicting symptoms core at the at week 12 controlling for symptoms at baseline, group allocation and stratification variables.

As an exploratory analysis we examined whether early change in aversive Pavlovian bias (week 2 – baseline) relates to log-transformed BDI total score at week 12 using a simple linear regression including an interaction effect between group-allocation and Pavlovian bias.

Exploratory analyses repeated the analysis type 1 above for each individual parameter and used Bonferroni-correction to correct for testing multiple parameters (p ≤ (0.05/8) ≤ 0.00625).

Additionally, we conducted simple linear regression examining group differences in parameter slopes (early change = week two – baseline; late change = week six – week two). We also repeated analysis type 2 for each of the parameter estimates and the three psychological measures (GAD-7, PHQ-9, BDI) and used Bonferroni-correction to correct for testing multiple parameters (p ≤ (0.05/8 × 3) ≤ 0.002).

Finally, to assess test–retest reliability we calculated Pearson correlation of individuals' parameters between the different time points and employed intra-class correlation coefficients (ICCs; McGraw & Wong, ) using the informative data (cf. Supplementary Materials B.6 Test-Retest Reliability).

Results

A total of 655 patients were recruited and randomly assigned to sertraline (326, 50%) and placebo (329, 50%). Two patients in the sertraline group did not complete a substantial proportion of the baseline assessment and were excluded. Additionally, 25 patients (9 from the sertraline group and 16 from placebo) did not complete the Go/NoGo task at any time-point. This left 628 participants (315 sertraline and 313 placebo) for analyses (cf. Fig. A.1 in Supplementary Materials). Task data for seven patients at baseline, 99 patients at 2 weeks, and 145 patients at 6 weeks were missing. Missing follow-up data were more common in participants who had higher baseline depressive and anxiety symptoms, financial difficulties, were from ethnic minorities and were recruited from London (cf. Supplementary Materials Table E.5). Missing data did not differ statistically by treatment allocation.

Basic task characteristics

Examination of the average percent correct response per condition showed the typical interaction pattern characteristic of Pavlovian inference found in previous studies (Guitart-Masip et al., ; Mkrtchian et al., ; Moutoussis et al., ; Scholz et al., ) at all measurement points (Fig. 1b). Performance was better in Pavlovian congruent (go to win and nogo to avoid) than incongruent (go to avoid and nogo to win) conditions (|t| ∈ [4.65, 16.70], p < 0.001). There were no differences in average performance between patients allocated to sertraline and patients allocated to placebo (|MD| ∈ [0.00, 0.03], p > 0.05).

Computational modeling results

Overall, 747 (46%) task runs did not contain interpretable and informative behavioral data. Variables associated with non-informative behavior were higher age, lower education, and past antidepressant use. At week 2 non-informative task runs (N = 230, 43%) were more likely in patients who were allocated to the sertraline group (57%, X2 = 7.06, p = 0.008). In addition, baseline anxiety score, depression severity, and employment status were predictive of non-informative behavior at week 6 (cf. Supplementary Materials Table E.6). For all further analyses we focused on the 886 informative task runs from 435 patients (66% of those originally randomized) and adjusted for significant predictors of non-informative data as covariates. Characteristics of the remaining sample according to the study arm are shown in Table 1. Baseline characteristics of the sample were not statistically distinguishable between treatment groups.

The effect of sertraline on anxiety remained significant in the smaller included sample (week 6: MD = −0.1, CI[ − 0.17 to − 0.03], p = 0.005; week 12: MD = −0.12, CI[ − 0.17 to − 0.06], p ≤ 0.001; over time: MD = −0.08, CI[ − 0.12 to − 0.04], p ≤ 0.001). This is important since our preregistered hypotheses were developed under consideration of this effect.

Preregistered hypotheses

The preregistered hypotheses were not supported (Table 2): there was no evidence that the aversive Pavlovian inhibition was affected by sertraline (Fig. 3a and b); that aversive Pavlovian inhibition was related to anxiety symptoms; that the baseline aversive Pavlovian bias was predictive of treatment response; that the appetitive Pavlovian bias was associated with depression or that the reward sensitivity was related to anhedonia.

Open multimedia modal

Figure 3

Open multimedia modal

Effects of sertraline on RL parameters. (a) Shows the aversive Pavlovian bias at baseline and at the follow-ups separated into drug groups (blue, left = placebo; red, right = sertraline). (b) Shows the change in aversive Pavlovian bias between sessions separately for the drug groups. (c) Early changes in the aversive Pavlovian bias predict treatment outcome. This figure shows the relation between the change from baseline to week two in the aversive Pavlovian bias and log-transformed BDI total score (only of participants who had an informative task run at baseline and week 2). In blue the placebo group and in red the sertraline group. An interaction effect was observed between group and early change in the aversive Pavlovian predicting depression at 12 weeks driven by a significant association between the early change and log-transformed BDI total score at 12 weeks. (blue, left = placebo; red, right = sertraline). (d) Shows the loss learning rate at baseline and at the follow-ups separated into drug groups (blue, left = placebo; red, right = sertraline). (e) Shows the change in loss learning rate between sessions separately for the drug groups. Significance * ≤ 0.05, ** ≤ 0.01, *** ≤ 0.001, **** ≤ 0.0001.

Exploratory analyses

Exploratory analyses of a subsample with better test–retest correlation, and of a subsample with low symptoms did not support the pre-registered hypotheses (cf. Supplementary Materials F Subsample Analyses).

Two sets of results in the exploratory analyses are noteworthy. The first relates to early change in the aversive Pavlovian bias. The slope of the aversive Pavlovian bias between baseline and week two was positively related to depressive symptoms at week 12 (log-transformed PHQ9 total score: β = 0.06, CI[0.0 − 0.11], p = 0.044; log-transformed BDI total score: β = 0.07, CI[0.01 − 0.13], p = 0.016). A larger increase in aversive Pavlovian bias was associated with more severe subsequent depressive symptoms. Furthermore, the BDI model revealed an interaction between group allocation and early change in the aversive Pavlovian bias (β = 0.14, CI [0.02–0.26], p = 0.024; Figure 3c). That is, early change in aversive Pavlovian bias was more strongly related to BDI scores at week 12 in the sertraline group (β = 0.14, CI[0.05 − 0.23]), than in the placebo group (β = 0.02, CI[ − 0.03 to 0.07]). However, note that sertraline had no effect on the early change in aversive Pavlovian bias (MD = −0.08, CI[ − 0.32 to 0.15], p = 0.49). The second set of findings relates to the speed at which participants adapted behavior following losses (the loss learning rate). There was an effect of sertraline on the loss learning rate at week 2 (MD = 0.6, CI[0.22 − 0.97], p = 0.002; Figure 3d). The sertraline group learned faster from losses at week 2 than the placebo group. Early change in loss learning rate (week two – baseline) was higher in the sertraline group (MD = 0.75, CI[0.18 − 1.3], p = 0.009; Figure 3e), whereas later change (week six minus week two) was lower in the sertraline group (MD = −0.72, CI[ − 1.27 to − 0.17], p = 0.011; Figure 3e). In the sertraline group, the early change was different from zero (t = 2.74, p = 0.007), whereas the later change was not (t = −0.32, p = 0.75). In contrast, in the placebo group, the early change did not differ from zero (t = −0.70, p = 0.483), but the late change did (t = 3.44, p< 0.001). Hence, the group difference in the late change was due to an increase in loss learning rate from baseline to week 6 in the placebo group. The aversive learning rate is strongly driven by switching after losses in the early part of the learning curve. Indeed, there was an elevated switching probability after losses during the first eight trials in the sertraline group (MD = 0.21, CI [0.0–0.41], p = 0.048; averaged across the go-to-avoid and nogo-to-avoid conditions). Finally, the loss learning rate was also positively associated with the anxiety scores (at week 2: β = 0.01, CI[0.0 − 0.02], p = 0.047; at week 6: β = 0.02, CI[0.0 − 0.03], p = 0.016; across all sessions: β = 0.02, CI[0.01 − 0.03], p = 0.001). However, there was no evidence for an association between anxiety symptoms and either the loss learning rate at baseline (β = 0.01, CI [−0.0 to 0.02], p = 0.24) or the early change in loss learning rate (week 2 – baseline; β = −0.02, CI[ − 0.04 to 0.01], p = 0.164). Additionally, this effect could not be shown based on the early switch probability described above (β = 0.0, CI[ − 0.02 to 0.02], p = 0.73).

Repeating these analyses on the complete sample including all task runs resulted in a broadly consistent pattern of effects (c.f. Supplementary Materials C Findings in the Whole Sample).

In post hoc analyses, we adjusted for the use of other antidepressants and/or psychotherapy, adherence score, and the number of tablets, yielding consistent results as detailed in the Supplementary Materials G Post-hoc Analyses.

Task reliability

Parameters showed poor to moderate reliability (ICC(3,1) ranging from 0 to 0.53; cf. Supplementary Materials B.6 Test-Retest Reliability). The aversive Pavlovian bias was the most reliable parameter (ICC(3, 1) = 0.53, CI[0.41 − 0.64], p < 0.001). The Pavlovian parameters and the go bias also significantly changed over time. The Pavlovian biases decreased (aversive: β = −0.1, CI[ − 0.16 to − 0.04], p = 0.001; appetitive: β = −0.08, CI[ − 0.13 to − 0.04], p < 0.001) and the go bias increased (β = 0.13, CI[0.05 − 0.21], p < 0.001) over sessions which likely led to an increase in task accuracy (β = 0.02, CI[0.01 − 0.03], p < 0.001). We note that age reduced accuracy (β = −0.03, CI[ − 0.04 to − 0.02], p < 0.001), most likely due to increasing Pavlovian biases (aversive: β = 0.19, CI[0.11 − 0.26], p < 0.001; appetitive: β = 0.16, CI[0.11 − 0.21], p < 0.001) and reducing go bias with age (β = −0.4, CI [−0.49 to−0.32], p< 0.001).

Discussion

We investigated the effects of the SSRI sertraline on reinforcement learning mechanisms in the PANDA trial, a pragmatic multicenter, double-blind, placebo-controlled, randomized clinical trial. SSRIs are first-line pharmacological treatments for depression and anxiety, but the mechanism of SSRI action is still unknown. A better understanding of how SSRIs work could lead to improved response predictions and new, refined treatments. Our goal was to identify clinically relevant mechanisms to link receptor action to cognition and affective processing. Reinforcement learning enables such links and hence is a promising framework for investigating the mechanisms of SSRI action. The PANDA trial was the largest individual placebo-controlled trial not funded by the pharmaceutical industry. The sample was recruited in primary care based on clinical equipoise, and depressive symptoms ranged from mild to severe. Findings might therefore be of relevance to the broader primary care population. As sertraline acts through similar mechanisms as other SSRIs (Cipriani et al., ), the findings may also be relevant for other SSRIs.

Due to the poor task performance, almost half of the performed task runs were excluded. Early on (at week two) non-informative data was more prevalent in the sertraline group, suggesting that patients in the active group may have responded more randomly. Such randomness can be a signature of low overall motivation to perform the task. One possibility is that such a broad motivational reduction could be a signature of SSRI-induced affective blunting (Barnhart, Makela, & Latocha, ; Marazziti et al., ; McCabe, Cowen, & Harmer, ; Price, Cole, & Goodwin, ). However, there were no discernible differences in symptoms between patients who provided informative and non-informative data at week two, and sertraline had a positive impact on learning at week two in the included sample. These findings speak against a broad blunting effect.

The primary goal of this study was to test whether aversive Pavlovian bias mediates the effect of sertraline on anxiety. We found no evidence supporting an influence of sertraline on aversive Pavlovian bias. This result contrasts with previous research suggesting that Pavlovian inhibition is sensitive to serotonin (Crockett et al., , ; Geurts et al., ; Hebart & Gläscher, ). There are several possible reasons for this discrepancy. First, it may be that serotonin manipulations have different effects on Pavlovian inhibition in samples with and without depression and/or anxiety. While the current study was performed in a clinical population, previous studies primarily examined healthy volunteers. Second, previous research focused on acute changes via tryptophan depletion (Crockett et al., , ; Geurts et al., ; Hebart & Gläscher, ) or a single administration of an SSRI citalopram (Guitart-Masip et al., ) rather than the chronic administration examined here. It has long been posited that acute and chronic SSRI administration have opposite effects (e.g. Harmer, Cowen, & Goodwin, ; Harmer et al., ). Third, we cannot rule out that some of the Pavlovian inhibition signal is conflated with the loss learning signal as there are non-negligible correlations between parameters (cf. Supplementary Materials Fig. B.6). This is likely compounded by broader issues with data quality, which in turn reduce the ability of models to distinguish aversive Pavlovian inhibition and learning from losses. Subsample analyses attempting to identify either test–retest or symptom loads as reasons for the null results did not yield clear results.

Exploratory analyses identified relationships between sertraline, aversive processing, and symptoms. First, sertraline affected learning from losses but not from rewards. This finding is in keeping with well-supported empirical evidence demonstrating that serotonin modulation impacts learning (Bari et al., ; Brigman et al., ; Michely et al., ; Scholl et al., ), and specifically punishment learning (Chamberlain et al., ; Cools, Roberts, & Robbins, ; Tanaka et al., , ). Prolonged serotonin alterations have downstream effects including augmented learning and plasticity (Dayer, ; Kraus, Castrén, Kasper, & Lanzenberger, ). In the current dataset, the learning rate from losses increased over the first two weeks of sertraline treatment relative to placebo. The placebo group then ‘caught up’, removing the group differences in loss learning rate at six weeks. Changes in the performance of learning tasks are frequently observed and thought to represent a type of meta-learning, i.e. learning more broadly about the strategy of performing a task rather than learning within the task itself (Botvinick et al., ; Doya, ; Langdon et al., ; Vanschoren, ). As such, the late change in performance in the placebo group compared to the early change in the sertraline group suggests that sertraline may have increased the speed at which this meta-learning may have occurred and may have done so by specifically altering behavioral adaptation after losses within the task. One complication is that, at two weeks, there was already some evidence for changes in anxiety symptoms, and an inverse causal path (with anxiety mediating the effect of sertraline) cannot be excluded.

The loss learning rate was correlated with anxiety symptoms at both follow-up time points and over all measurement points. This is, in principle, in line with previous research outlined in a recent meta-analysis reporting higher punishment learning rates and slightly lower reward learning rates in patients (Pike & Robinson, ). Yet, this is difficult to reconcile with, first, the SSRI-induced increase in learning from punishment, and second the fact that both anxiety and depression are treated by SSRIs and are linked to heightened punishment learning themselves. Interestingly, a similar conundrum was present in the literature on learned helplessness, which was associated with increased levels of serotonin (Petty, Kramer, & Moeller, ), but could also be reversed as a response to SSRIs (Hajszan et al., ; Kirby, ; Malberg & Duman, ; Maudhuit et al., ). Hence, coupling increases in serotonin levels with a simple account of serotonin levels on behavior is unlikely to be able to explain SSRI effects. Indeed, the serotonin system is known to be exquisitely complex, with many different serotonin receptors distinctively distributed (Hansen et al., ). A possible explanation could be that SSRIs facilitate learning faster in a punishing environment, thus leading to less negative and more positive (or neutral) feedback. It is interesting to consider how this bias towards learning from losses might be linked to mood. Self-reports of happiness are linked to positive prediction errors (Rutledge, Skandali, Dayan, & Dolan, ), suggesting that negative prediction errors might similarly influence negative affective states. In other words, SSRIs might gradually improve mood by enhancing negative expectations through faster loss learning, thereby giving rise to less disappointing and more rewarding experiences.

Finally, improvements in depressive symptoms in the sertraline group were preceded by an early decrease in the aversive Pavlovian bias. In other words, patients on sertraline showed a higher increase in their tendency to withhold an action when facing a loss between baseline and the 2-week follow-up, the higher their depressive symptoms were after 12 weeks.

Overall, the findings draw a complex picture involving aversive processing, sertraline, and symptoms, possibly reflecting the known complexity of the serotonin system. Despite the methodological limitations and the failure to support the preregistered hypotheses, the exploratory data suggest alterations in the processing of losses. A tentative possibility is that SSRIs alter the speed of learning from losses early on, inducing a shift from Pavlovian to instrumental learning when confronted with losses. The alteration in aversive Pavlovian bias was not directly linked to sertraline. However, sertraline appeared to modulate the association between Pavlovian inhibition and future treatment outcomes. Reducing aversive Pavlovian control might hence promote approach responses in a punishing environment, facilitating unexpected rewarding experiences and thus helping to alleviate depressive symptoms.

Limitations

Inclusion in the trial was based on clinical equipoise, i.e. inclusion was based on an uncertainty whether medication could clinically be helpful for a particular person. This may have decreased the power to detect differences from placebo. For mechanistic studies such as the current one, it could be better to study a cohort of typical responders, i.e. patients who are prescribed medication with clinical confidence.

Extensive validation analyses showed that task performance was frequently objectively poor resulting in a large fraction of the task runs being non-informative. Non-informative task runs had to be excluded from analyses because formally they cannot provide information about cognitive mechanisms. We attempted to address this by correcting for baseline variables that were significantly associated with non-informative task runs. The sertraline and the placebo group in the final informative sample continued to be matched on baseline characteristics. Nevertheless, the exclusion of data has severely curtailed the power in the study. Furthermore, because noninformative data was more common in the drug than the non-drug arm, a causal interpretation is no longer warranted.

The poor task performance has important implications for future mechanistic research in this domain. Although the task has been extensively used in laboratory studies (Guitart-Masip et al., ), combined with neuroimaging (Guitart-Masip et al., ), pharmacological (Guitart-Masip et al., ) and other interventions and adapted (Millner, Gershman, Nock, & den Ouden, ; Moutoussis et al., ; Swart et al., ), it did not prove effective in a longitudinal clinical trial. This reinforces the paramount importance of acceptability and effectiveness testing of cognitive measurements for translational research and calls for an involvement of stakeholders in the design of research tasks.

The relationships between cognitive mechanisms and symptoms were weak. This probably reflects more general findings in the field (Eisenberg et al., ), but also the specific limitations around data quality mentioned above which limit the strength of possible associations (Spearman, ). We also note that our computational modeling approach was very conservative in that all parameters were allowed to change freely between participants and sessions, with no constraints for within-participant data.

Exploratory results were presented based on passing a conservative significance threshold and their relevance to the preregistered hypotheses. Nevertheless, they should be treated with caution prior to replication.

Conclusion

This study represents a significant exploration of specific reinforcement learning processes in a pragmatic RCT for depression. Specific reinforcement learning mechanisms did show a relationship to aspects of depression and anxiety and its treatment with SSRIs, but this was weak and not as hypothesized a priori. Sertraline influenced aversive processing in the first two treatment weeks by altering how participants learn to execute a passive or active action to avoid loss. Moreover, symptoms were associated with aversive processing but how this relationship relates to SSRI appears complex. The fact that almost half of the data was non-informative emphasizes the importance of developing patient-acceptable task probes.

Acknowledgements

We would like to thank all the members of the Trial Committee, the Data Management Committee and the Public and Patient Involvement representatives. We would also like to thank colleagues who have contributed to the study through recruitment, administrative help and other advice. Finally, we would like to thank Agnes Norbury, Anahit Mkrtchian, Tore Erdmann, Jiazhou Chen, Jenny Fielder, Anna Hall, Jakub Onysk, Jade Serfaty, and Lana Tymchyk for their feedback on the manuscript.

Supplementary material

The supplementary material for this article can be found at https://doi.org/10.1017/S0033291724000837.

References

  • Abela A. R., Browne C. J., Sargin D., Prevot T. D., Ji X. D., Li Z., … Fletcher P. J. (2020). Median raphe serotonin neurons promote anxiety-like behavior via inputs to the dorsal hippocampus. Neuropharmacology, 168, 107985. doi: 10.1016/j.neuropharm.2020.107985
    Cited Here
  • Ahmed N., Bone J. K., Lewis G., Freemantle N., Harmer C. J., Duffy L., & Lewis G. (2022). The effect of sertraline on emotional processing: Secondary analyses of the PANDA randomised controlled trial. Psychological Medicine, 52(13), 2814–2821. doi: 10.1017/S0033291720004985
    Cited Here
  • Alvarez R. P., Chen G., Bodurka J., Kaplan R., & Grillon C. (2011). Phasic and sustained fear in humans elicits distinct patterns of brain activity. NeuroImage, 55(1), 389–400. doi: 10.1016/j.neuroimage.2010.11.057
    Cited Here
  • Amemori K., & Graybiel A. M. (2012). Localized microstimulation of primate pregenual cingulate cortex induces negative decision-making. Nature Neuroscience, 15(5), 776–785. doi: 10.1038/nn.3088
    Cited Here
  • Amo R., Fredes F., Kinoshita M., Aoki R., Aizawa H., Agetsuma M., … Okamoto H. (2014). The habenulo-raphe serotonergic circuit encodes an aversive expectation value essential for adaptive active avoidance of danger. Neuron, 84(5), 1034–1048. doi: 10.1016/j.neuron.2014.10.035
    Cited Here
  • Bandelow B., Reitt M., Röver C., Michaelis S., Görlich Y., & Wedekind D. (2015). Efficacy of treatments for anxiety disorders: A meta-analysis. International Clinical Psychopharmacology, 30(4), 183–192. doi: 10.1097/YIC.0000000000000078
    Cited Here
  • Bari A., Theobald D. E., Caprioli D., Mar A. C., Aidoo-Micah A., Dalley J. W., & Robbins T. W. (2010). Serotonin modulates sensitivity to reward and negative feedback in a probabilistic reversal learning task in rats. Neuropsychopharmacology: Official Publication of the American College of Neuropsychopharmacology, 35(6), 1290–1301. doi: 10.1038/npp.2009.233
    Cited Here
  • Barnhart W. J., Makela E. H., & Latocha M. J. (2004). SSRI-induced apathy syndrome: A clinical review. Journal of Psychiatric Practice, 10(3), 196–199. doi: 10.1097/00131746-200405000-00010
    Cited Here
  • Beck A. T., Steer R. A., & Brown G. K. (1996). Manual for the Beck depression inventory-II. San Antonio, TX: Psychological Corporation.
    Cited Here
  • Bogowicz P., Curtis H. J., Walker A. J., Cowen P., Geddes J., & Goldacre B. (2021). Trends and variation in antidepressant prescribing in English primary care: A retrospective longitudinal study. BJGP Open, 5(4), BJGPO.2021.0020. doi: 10.3399/BJGPO.2021.0020
    Cited Here
  • Bolles R. C. (1970). Species-specific defense reactions and avoidance learning. Psychological Review, 77(1), 32–48. doi: 10.1037/h0028589
    Cited Here
  • Botvinick M., Ritter S., Wang J. X., Kurth-Nelson Z., Blundell C., & Hassabis D. (2019). Reinforcement learning, fast and slow. Trends in Cognitive Sciences, 23(5), 408–422. doi: 10.1016/j.tics.2019.02.006
    Cited Here
  • Boureau Y. L., & Dayan P. (2011). Opponency revisited: Competition and cooperation between dopamine and serotonin. Neuropsychopharmacology, 36(1), 74–97. doi: 10.1038/npp.2010.151
    Cited Here
  • Brigman J. L., Mathur P., Harvey-White J., Izquierdo A., Saksida L. M., Bussey T. J., … Holmes A. (2010). Pharmacological or genetic inactivation of the serotonin transporter improves reversal learning in mice. Cerebral Cortex, 20(8), 1955–1963. doi: 10.1093/cercor/bhp266
    Cited Here
  • Bylsma L. M., Morris B. H., & Rottenberg J. (2008). A meta-analysis of emotional reactivity in major depressive disorder. Clinical psychology review, 28(4), 676–691. doi: 10.1016/j.cpr.2007.10.001
    Cited Here
  • Chamberlain S. R., Müller U., Blackwell A. D., Clark L., Robbins T. W., & Sahakian B. J. (2006). Neurochemical modulation of response inhibition and probabilistic learning in humans. Science, 311(5762), 861–863. doi: 10.1126/science.1121218
    Cited Here
  • Cipriani A., Furukawa T. A., Salanti G., Chaimani A., Atkinson L. Z., Ogawa Y., … Geddes J. R. (2018). Comparative efficacy and acceptability of 21 antidepressant drugs for the acute treatment of adults with major depressive disorder: A systematic review and network meta-analysis. Lancet, 391(10128), 1357–1366. doi: 10.1016/S0140-6736(17)32802-7
    Cited Here
  • Cohen J. Y., Amoroso M. W., & Uchida N. (2015). Serotonergic neurons signal reward and punishment on multiple timescales. eLife, 4, e06346. doi: 10.7554/eLife.06346
    Cited Here
  • Cools R., Roberts A. C., & Robbins T. W. (2008). Serotoninergic regulation of emotional and behavioural control processes. Trends in Cognitive Sciences, 12(1), 31–40. doi: 10.1016/j.tics.2007.10.011
    Cited Here
  • Crockett M. J., Clark L., Apergis-Schoute A. M., Morein-Zamir S., & Robbins T. W. (2012). Serotonin modulates the effects of Pavlovian aversive predictions on response vigor. Neuropsychopharmacology, 37(10), 2244–2252. doi: 10.1038/npp.2012.75
    Cited Here
  • Crockett M. J., Clark L., & Robbins T. W. (2009). Reconciling the role of serotonin in behavioral inhibition and aversion: Acute tryptophan depletion abolishes punishment-induced inhibition in humans. The Journal of Neuroscience, 29(38), 11993–11999. doi: 10.1523/JNEUROSCI.2513-09.2009
    Cited Here
  • Cuthbert B. N., & Insel T. R. (2013). Toward the future of psychiatric diagnosis: The seven pillars of RDoC. BMC Medicine, 11, 126. doi: 10.1186/1741-7015-11-126
    Cited Here
  • Dayan P., & Huys Q. J. M. (2008). Serotonin, inhibition, and negative mood. PLoS Computational Biology, 4(2), e4. doi: 10.1371/journal.pcbi.0040004
    Cited Here
  • Dayan P., & Huys Q. J. M. (2009). Serotonin in affective control. Annual Review of Neuroscience, 32, 95–126. doi: 10.1146/annurev.neuro.051508.135607
    Cited Here
  • Dayan P., Niv Y., Seymour B., & Daw N. D. (2006). The misbehavior of value and the discipline of the will. Neural Networks, 19(8), 1153–1160. doi: 10.1016/j.neunet.2006.03.002
    Cited Here
  • Dayer A. (2014). Serotonin-related pathways and developmental plasticity: Relevance for psychiatric disorders. Dialogues in Clinical Neuroscience, 16(1), 29–41. doi: 10.31887/DCNS.2014.16.1/adayer
    Cited Here
  • Doya K. (2002). Metalearning and neuromodulation. Neural Networks, 15(4–6), 495–506. doi: 10.1016/s0893-6080(02)00044-8
    Cited Here
  • Doya K., Miyazaki K. W., & Miyazaki K. (2021). Serotonergic modulation of cognitive computations. Current Opinion in Behavioral Sciences, 38, 116–123. doi: 10.1016/j.cobeha.2021.02.003
    Cited Here
  • Drevets W. C., Savitz J., & Trimble M. (2008). The subgenual anterior cingulate cortex in mood disorders. CNS Spectrums, 13(8), 663–681. doi: 10.1017/s1092852900013754
    Cited Here
  • Duffy L., Bacon F., Clarke C. S., Donkor Y., Freemantle N., Gilbody S., … Lewis G. (2019). A randomised controlled trial assessing the use of citalopram, sertraline, fluoxetine and mirtazapine in preventing relapse in primary care patients who are taking long-term maintenance antidepressants (ANTLER: ANTidepressants to prevent reLapse in dEpRession): Study protocol for a randomised controlled trial. Trials, 20(1), 319. doi: 10.1186/s13063-019-3390-8
    Cited Here
  • Eisenberg I. W., Bissett P. G., Zeynep Enkavi A., Li J., MacKinnon D. P., Marsch L. A., & Poldrack R. A. (2019). Uncovering the structure of self-regulation through data-driven ontology discovery. Nature Communications, 10(1), 2319. doi: 10.1038/s41467-019-10301-1
    Cited Here
  • Eshel N., & Roiser J. P. (2010). Reward and punishment processing in depression. Biological Psychiatry, 68(2), 118–124. doi: 10.1016/j.biopsych.2010.01.027
    Cited Here
  • Evans-Lacko S., Aguilar-Gaxiola S., Al-Hamzawi A., Alonso J., Benjet C., Bruffaerts R., … Thornicroft G. (2018). Socio-economic variations in the mental health treatment gap for people with anxiety, mood, and substance use disorders: Results from the WHO World Mental Health (WMH) surveys. Psychological Medicine, 48(9), 1560–1571. doi: 10.1017/S0033291717003336
    Cited Here
  • Garakani A., Murrough J. W., Freire R. C., Thom R. P., Larkin K., Buono F. D., & Iosifescu D. V. (2020). Pharmacotherapy of anxiety disorders: Current and emerging treatment options. Frontiers in Psychiatry, 11, 595584. doi: 10.3389/fpsyt.2020.595584
    Cited Here
  • Geurts D. E., Huys Q. J. M., den Ouden H. E., & Cools R. (2013a). Aversive Pavlovian control of instrumental behavior in humans. Journal of Cognitive Neuroscience, 25(9), 1428–1441. doi: 10.1162/jocn_a_00425
    Cited Here
  • Geurts D. E., Huys Q. J. M., den Ouden H. E., & Cools R. (2013b). Serotonin and aversive Pavlovian control of instrumental behavior in humans. The Journal of Neuroscience, 33(48), 18932–18939. doi: 10.1523/JNEUROSCI.2749-13.2013
    Cited Here
  • Guitart-Masip M., Duzel E., Dolan R., & Dayan P. (2014). Action versus valence in decision making. Trends in Cognitive Sciences, 18(4), 194–202. doi: 10.1016/j.tics.2014.01.003
    Cited Here
  • Guitart-Masip M., Fuentemilla L., Bach D. R., Huys Q. J. M., Dayan P., Dolan R. J., & Duzel E. (2011). Action dominates valence in anticipatory representations in the human striatum and dopaminergic midbrain. The Journal of Neuroscience, 31(21), 7867–7875. doi: 10.1523/JNEUROSCI.6376-10.2011
    Cited Here
  • Guitart-Masip M., Huys Q. J. M., Fuentemilla L., Dayan P., Duzel E., & Dolan R. J. (2012). Go and no-go learning in reward and punishment: Interactions between affect and effect. NeuroImage, 62(1), 154–166. doi: 10.1016/j.neuroimage.2012.04.024
    Cited Here
  • Hajszan T., Szigeti-Buck K., Sallam N. L., Bober J., Parducz A., Maclusky N. J., … Duman R. S. (2010). Effects of estradiol on learned helplessness and associated remodeling of hippocampal spine synapses in female rats. Biological Psychiatry, 67(2), 168–174. doi: 10.1016/j.biopsych.2009.08.017
    Cited Here
  • Halahakoon D. C., Kieslich K., O'Driscoll C., Nair A., Lewis G., & Roiser J. P. (2020). Reward-processing behavior in depressed participants relative to healthy volunteers: A systematic review and meta-analysis. JAMA Psychiatry, 77(12), 1286–1295. doi: 10.1001/jamapsychiatry.2020.2139
    Cited Here
  • Hansen J. Y., Shafiei G., Markello R. D., Smart K., Cox S. M. L., Nørgaard M., … Misic B. (2022). Mapping neurotransmitter systems to the structural and functional organization of the human neocortex. Nature Neuroscience, 25(11), 1569–1581. doi: 10.1038/s41593-022-01186-3
    Cited Here
  • Harmer C. J. (2013). Emotional processing and antidepressant action. Current Topics in Behavioral Neurosciences, 14, 209–222. doi: 10.1007/7854_2012_210
    Cited Here
  • Harmer C. J., Cowen P. J., & Goodwin G. M. (2011). Efficacy markers in depression. Journal of Psychopharmacology, 25(9), 1148–1158. doi: 10.1177/0269881110367722
    Cited Here
  • Harmer C. J., Duman R. S., & Cowen P. J. (2017). How do antidepressants work? New perspectives for refining future treatment approaches. The Lancet Psychiatry, 4(5), 409–418. doi: 10.1016/S2215-0366(17)30015-9
    Cited Here
  • Harmer C. J., Goodwin G. M., & Cowen P. J. (2009a). Why do antidepressants take so long to work? A cognitive neuropsychological model of antidepressant drug action. The British Journal of Psychiatry, 195(2), 102–108. doi: 10.1192/bjp.bp.108.051193
    Cited Here
  • Harmer C. J., O'Sullivan U., Favaron E., Massey-Chase R., Ayres R., Reinecke A., … Cowen P. J. (2009b). Effect of acute antidepressant administration on negative affective bias in depressed patients. The American Journal of Psychiatry, 166(10), 1178–1184. doi: 10.1176/appi.ajp.2009.09020149
    Cited Here
  • Hasler G., Fromm S., Alvarez R. P., Luckenbaugh D. A., Drevets W. C., & Grillon C. (2007). Cerebral blood flow in immediate and sustained anxiety. The Journal of Neuroscience, 27(23), 6313–6319. doi: 10.1523/JNEUROSCI.5369-06.2007
    Cited Here
  • Hebart M. N., & Gläscher J. (2015). Serotonin and dopamine differentially affect appetitive and aversive general Pavlovian-to-instrumental transfer. Psychopharmacology, 232(2), 437–451. doi: 10.1007/s00213-014-3682-3
    Cited Here
  • Hirschfeld R. M. (2001). The comorbidity of major depression and anxiety disorders: Recognition and management in primary care. Primary Care Companion to the Journal of Clinical Psychiatry, 3(6), 244–254. doi: 10.4088/pcc.v03n0609
    Cited Here
  • Huys Q. J. M., Browning M., Paulus M. P., & Frank M. J. (2021). Advances in the computational understanding of mental illness. Neuropsychopharmacology, 46(1), 3–19. doi: 10.1038/s41386-020-0746-4
    Cited Here
  • Huys Q. J. M., Cools R., Gölzer M., Friedel E., Heinz A., Dolan R. J., & Dayan P. (2011a). Disentangling the roles of approach, activation and valence in instrumental and Pavlovian responding. PLoS Computational Biology, 7(4), e1002028. doi: 10.1371/journal.pcbi.1002028
    Cited Here
  • Huys Q. J. M., Eshel N., O'Nions E., Sheridan L., Dayan P., & Roiser J. P. (2012). Bonsai trees in your head: How the Pavlovian system sculpts goal-directed choices by pruning decision trees. PLoS Computational Biology, 8(3), e1002410. doi: 10.1371/journal.pcbi.1002410
    Cited Here
  • Huys Q. J. M., Gölzer M., Friedel E., Heinz A., Cools R., Dayan P., & Dolan R. J. (2016). The specificity of Pavlovian regulation is associated with recovery from depression. Psychological Medicine, 46(5), 1027–1035. doi: 10.1017/S0033291715002597
    Cited Here
  • Huys Q. J. M., Moutoussis M., & Williams J. (2011b). Are computational models of any use to psychiatry? Neural Networks, 24(6), 544–551. doi: 10.1016/j.neunet.2011.03.001
    Cited Here
  • Huys Q. J. M., Russek E. M., Abitante G., Kahnt T., & Gollan J. K. (2022). Components of behavioral activation therapy for depression engage specific reinforcement learning mechanisms in a pilot study. Computational Psychiatry, 6(1), 238–255. doi: 10.5334/cpsy.81
    Cited Here
  • Kendrick T., Stuart B., Newell C., Geraghty A. W., & Moore M. (2015). Did NICE guidelines and the Quality Outcomes Framework change GP antidepressant prescribing in England? Observational study with time trend analyses 2003–2013. Journal of Affective Disorders, 186, 171–177. doi: 10.1016/j.jad.2015.06.052
    Cited Here
  • Kirby K. J. (2006). The effects of subchronic and chronic SSRI treatments on learned helplessness behavior in rats [Master's thesis]. University of Colorado, Boulder, CO, USA. ISBN: 9780542599736.
    Cited Here
  • Kraus C., Castrén E., Kasper S., & Lanzenberger R. (2017). Serotonin and neuroplasticity – links between molecular, functional and structural pathophysiology in depression. Neuroscience and Biobehavioral Reviews, 77, 317–326. doi: 10.1016/j.neubiorev.2017.03.007
    Cited Here
  • Kroenke K., Spitzer R. L., & Williams J. B. (2001). The PHQ-9: Validity of a brief depression severity measure. Journal of General Internal Medicine, 16(9), 606–613. doi: 10.1046/j.1525-1497.2001.016009606.x
    Cited Here
  • Lally N., Huys Q. J. M., Eshel N., Faulkner P., Dayan P., & Roiser J. P. (2017). The neural basis of aversive Pavlovian guidance during planning. The Journal of Neuroscience, 37(42), 10215–10229. doi: 10.1523/JNEUROSCI.0085-17.2017
    Cited Here
  • Lan D. C. L., & Browning M. (2022). What can reinforcement learning models of dopamine and serotonin tell us about the action of antidepressants? Computational Psychiatry, 6(1), 166–188. doi: 10.5334/cpsy.83
    Cited Here
  • Langdon A., Botvinick M., Nakahara H., Tanaka K., Matsumoto M., & Kanai R. (2022). Meta-learning, social cognition and consciousness in brains and machines. Neural Networks, 145, 80–89. doi: 10.1016/j.neunet.2021.10.004
    Cited Here
  • LeDoux J. E., Moscarello J., Sears R., & Campese V. (2017). The birth, death and resurrection of avoidance: A reconceptualization of a troubled paradigm. Molecular Psychiatry, 22(1), 24–36. doi: 10.1038/mp.2016.166
    Cited Here
  • Lewis G., Duffy L., Ades A., Amos R., Araya R., Brabyn S., … Lewis G. (2019). The clinical effectiveness of sertraline in primary care and the role of depression severity and duration (PANDA): A pragmatic, double-blind, placebo-controlled randomised trial. The Lancet Psychiatry, 6(11), 903–914. doi: 10.1016/S2215-0366(19)30366-9
    Cited Here
  • Maia T. V., & Frank M. J. (2011). From reinforcement learning models to psychiatric and neurological disorders. Nature Neuroscience, 14(2), 154–162. doi: 10.1038/nn.2723
    Cited Here
  • Malberg J. E., & Duman R. S. (2003). Cell proliferation in adult hippocampus is decreased by inescapable stress: Reversal by fluoxetine treatment. Neuropsychopharmacology, 28(9), 1562–1571. doi: 10.1038/sj.npp.1300234
    Cited Here
  • Marazziti D., Mucci F., Tripodi B., Carbone M. G., Muscarella A., Falaschi V., & Baroni S. (2019). Emotional blunting, cognitive impairment, bone fractures, and bleeding as possible side effects of long-term use of SSRIs. Clinical Neuropsychiatry, 16(2), 75–85.
    Cited Here
  • Martell C. R., Dimidjian S., & Herman-Dunn R. (2010). Behavioral activation for depression: A clinician's guide. New York: Guilford Press.
    Cited Here
  • Maudhuit C., Prévot E., Dangoumau L., Martin P., Hamon M., & Adrien J. (1997). Antidepressant treatment in helpless rats: Effect on the electrophysiological activity of raphe dorsalis serotonergic neurons. Psychopharmacology, 130(3), 269–275. doi: 10.1007/s002130050239
    Cited Here
  • Mayberg H. S., Lozano A. M., Voon V., McNeely H. E., Seminowicz D., Hamani C., … Kennedy S. H. (2005). Deep brain stimulation for treatment-resistant depression. Neuron, 45(5), 651–660. doi: 10.1016/j.neuron.2005.02.014
    Cited Here
  • McCabe C., Cowen P. J., & Harmer C. J. (2009). Neural representation of reward in recovered depressed patients. Psychopharmacology, 205(4), 667–677. doi: 10.1007/s00213-009-1573-9
    Cited Here
  • McGraw K. O., & Wong S. P. (1996). Forming inferences about some intraclass correlation coefficients. Psychological Methods, 1, 30–46. doi: 10.1037/1082-989X.1.1.30
    Cited Here
  • Michely J., Eldar E., Erdman A., Martin I. M., & Dolan R. J. (2022). Serotonin modulates asymmetric learning from reward and punishment in healthy human volunteers. Communications Biology, 5(1), 812. doi: 10.1038/s42003-022-03690-5
    Cited Here
  • Michely J., Eldar E., Martin I. M., & Dolan R. J. (2020). A mechanistic account of serotonin's impact on mood. Nature Communications, 11(1), 2335. doi: 10.1038/s41467-020-16090-2
    Cited Here
  • Millner A. J., Gershman S. J., Nock M. K., & den Ouden H. E. M. (2018). Pavlovian control of escape and avoidance. Journal of Cognitive Neuroscience, 30(10), 1379–1390. doi: 10.1162/jocn_a_01224
    Cited Here
  • Mkrtchian A., Aylward J., Dayan P., Roiser J. P., & Robinson O. J. (2017). Modeling avoidance in mood and anxiety disorders using reinforcement learning. Biological Psychiatry, 82(7), 532–539. doi: 10.1016/j.biopsych.2017.01.017
    Cited Here
  • Morris S. E., Sanislow C. A., Pacheco J., Vaidyanathan U., Gordon J. A., & Cuthbert B. N. (2022). Revisiting the seven pillars of RDoC. BMC Medicine, 20(1), 220. doi: 10.1186/s12916-022-02414-0
    Cited Here
  • Moutoussis M., Bullmore E. T., Goodyer I. M., Fonagy P., Jones P. B., Dolan R. J., Dayan P., & Neuroscience in Psychiatry Network Research Consortium (2018). Change, stability, and instability in the Pavlovian guidance of behaviour from adolescence to young adulthood. PLoS Computational Biology, 14(12), e1006679. doi: 10.1371/journal.pcbi.1006679
    Cited Here
  • Nord C. L., Lawson R. P., Huys Q. J. M., Pilling S., & Roiser J. P. (2018). Depression is associated with enhanced aversive Pavlovian control over instrumental behaviour. Scientific Reports, 8(1), 12582. doi: 10.1038/s41598-018-30828-5
    Cited Here
  • Ohmura Y., Tanaka K. F., Tsunematsu T., Yamanaka A., & Yoshioka M. (2014). Optogenetic activation of serotonergic neurons enhances anxiety-like behaviour in mice. The International Journal of Neuropsychopharmacology, 17(11), 1777–1783. doi: 10.1017/S1461145714000637
    Cited Here
  • Petty F., Kramer G., & Moeller M. (1994). Does learned helplessness induction by haloperidol involve serotonin mediation? Pharmacology, Biochemistry, and Behavior, 48(3), 671–676. doi: 10.1016/0091-3057(94)90330-1
    Cited Here
  • Pike A. C., & Robinson O. J. (2022). Reinforcement learning in patients with mood and anxiety disorders vs control individuals: A systematic review and meta-analysis. JAMA Psychiatry, 79(4), 313–322. doi: 10.1001/jamapsychiatry.2022.0051
    Cited Here
  • Pizzagalli D. A., Jahn A. L., & O'Shea J. P. (2005). Toward an objective characterization of an anhedonic phenotype: A signal-detection approach. Biological Psychiatry, 57(4), 319–327. doi: 10.1016/j.biopsych.2004.11.026
    Cited Here
  • Pizzagalli D. A., Smoski M., Ang Y. S., Whitton A. E., Sanacora G., Mathew S. J., … Krystal A. D. (2020). Selective kappa-opioid antagonism ameliorates anhedonic behavior: Evidence from the Fast-fail Trial in Mood and Anxiety Spectrum Disorders (FAST-MAS). Neuropsychopharmacology, 45(10), 1656–1663. doi: 10.1038/s41386-020-0738-4
    Cited Here
  • Price J., Cole V., & Goodwin G. M. (2009). Emotional side-effects of selective serotonin reuptake inhibitors: Qualitative study. The British Journal of Psychiatry, 195(3), 211–217. doi: 10.1192/bjp.bp.108.051110
    Cited Here
  • Ramirez-Mahaluf J. P., Perramon J., Otal B., Villoslada P., & Compte A. (2018). Subgenual anterior cingulate cortex controls sadness-induced modulations of cognitive and emotional network hubs. Scientific Reports, 8(1), 8566. doi: 10.1038/s41598-018-26317-4
    Cited Here
  • Robinson O. J., Overstreet C., Allen P. S., Letkiewicz A., Vytal K., Pine D. S., & Grillon C. (2013). The role of serotonin in the neurocircuitry of negative affective bias: Serotonergic modulation of the dorsal medial prefrontal-amygdala ‘aversive amplification’ circuit. NeuroImage, 78, 217–223. doi: 10.1016/j.neuroimage.2013.03.075
    Cited Here
  • Roiser J. P., Elliott R., & Sahakian B. J. (2012a). Cognitive mechanisms of treatment in depression. Neuropsychopharmacology, 37(1), 117–136. doi: 10.1038/npp.2011.183
    Cited Here
  • Roiser J. P., Levy J., Fromm S. J., Goldman D., Hodgkinson C. A., Hasler G., … Drevets W. C. (2012b). Serotonin transporter genotype differentially modulates neural responses to emotional words following tryptophan depletion in patients recovered from depression and healthy volunteers. Journal of Psychopharmacology, 26(11), 1434–1442. doi: 10.1177/0269881112442789
    Cited Here
  • Rutledge R. B., Skandali N., Dayan P., & Dolan R. J. (2014). A computational and neural model of momentary subjective well-being. Proceedings of the National Academy of Sciences of the United States of America, 111(33), 12252–12257. doi: 10.1073/pnas.1407535111
    Cited Here
  • Salaminios G., Duffy L., Ades A., Araya R., Button K. S., Churchill R., … Lewis G. (2017). A randomised controlled trial assessing the severity and duration of depressive symptoms associated with a clinically significant response to sertraline versus placebo, in people presenting to primary care with depression (PANDA trial): Study protocol for a randomised controlled trial. Trials, 18(1), 496. doi: 10.1186/s13063-017-2253-4
    Cited Here
  • Scholl J., Kolling N., Nelissen N., Browning M., Rushworth M. F., & Harmer C. J. (2017). Beyond negative valence: 2-week administration of a serotonergic antidepressant enhances both reward and effort learning signals. PLoS Biology, 15(2), e2000756. doi: 10.1371/journal.pbio.2000756
    Cited Here
  • Scholz V., Hook R. W., Kandroodi M. R., Algermissen J., Ioannidis K., Christmas D., … den Ouden H. E. M. (2022). Cortical dopamine reduces the impact of motivational biases governing automated behaviour. Neuropsychopharmacology, 47(8), 1503–1512. doi: 10.1038/s41386-022-01291-8
    Cited Here
  • Spearman C. (1904). The proof and measurement of association between two things. The American Journal of Psychology, 15(1), 72–101. doi: 10.2307/1422689
    Cited Here
  • Spitzer R. L., Kroenke K., Williams J. B., & Löwe B. (2006). A brief measure for assessing generalized anxiety disorder: The GAD-7. Archives of Internal Medicine, 166(10), 1092–1097. doi: 10.1001/archinte.166.10.1092
    Cited Here
  • Steele J. D., Kumar P., & Ebmeier K. P. (2007). Blunted response to feedback information in depressive illness. Brain, 130(Pt 9), 2367–2374. doi: 10.1093/brain/awm150
    Cited Here
  • Swart J. C., Froböse M. I., Cook J. L., Geurts D. E., Frank M. J., Cools R., & den Ouden H. E. (2017). Catecholaminergic challenge uncovers distinct Pavlovian and instrumental mechanisms of motivated (in)action. eLife, 6, e22169. doi: 10.7554/eLife.22169
    Cited Here
  • Tanaka S. C., Schweighofer N., Asahi S., Shishida K., Okamoto Y., Yamawaki S., & Doya K. (2007). Serotonin differentially regulates short- and long-term prediction of rewards in the ventral and dorsal striatum. PLoS One, 2(12), e1333. doi: 10.1371/journal.pone.0001333
    Cited Here
  • Tanaka S. C., Shishida K., Schweighofer N., Okamoto Y., Yamawaki S., & Doya K. (2009). Serotonin affects association of aversive outcomes to past actions. The Journal of Neuroscience, 29(50), 15669–15674. doi: 10.1523/JNEUROSCI.2799-09.2009
    Cited Here
  • Vanschoren J. (2019). Meta-learning. In F. Hutter, L. Kotthoff, & J. Vanschoren (Eds.), Automated machine learning: Methods, systems, challenges, the Springer series on challenges in machine learning (pp. 35–61). Cham: Springer International Publishing. doi: 10.1007/978-3-030-05318-5_9
    Cited Here
Copyright © Cambridge University Press 2024
View full text|Download PDF