ArticleOnline nowJanuary 09, 2026Open access

Motivation under aversive conditions is regulated by a striatopallidal pathway in primates

1Institute for the Advanced Study of Human Biology, Kyoto University, Kyoto 606-8501, Japan
2Center for the Evolutionary Origins of Human Behavior, Kyoto University, Inuyama 484-8506, Japan
3Japan Society for the Promotion of Science, Tokyo 102-0083, Japan
4
Lead contact
Publication History:
Received June 10, 2025; Revised November 6, 2025; Accepted December 12, 2025; Published online January 9, 2026
Copyright: © 2025 The Authors. Published by Elsevier Inc.

Highlights

Chemogenetic suppression of the VS-VP pathway restores motivation under aversion
VS-VP pathway detects aversive demand and suppresses initiation of trials
VS-VP pathway regulates trial initiation independently of goal valuation
VS and VP neurons exhibit opposing activity patterns across motivational contexts

Summary

Motivation often diminishes under aversive conditions. Clinically, motivational deficits are linked to psychiatric disorders such as depression and schizophrenia, yet the neural mechanisms by which aversive contexts suppress motivation remain unclear. Although classical theories associate motivation with the expected value of outcomes, less is known about the neural circuits that govern effort-based behavioral initiation. To address this, we dissociated motivational drive from goal valuation using an approach-avoidance (Ap-Av) task, in which macaques evaluated outcomes combining reward and punishment (air puffs to the face). As a control, we employed an approach-approach (Ap-Ap) task based solely on reward. Using chemogenetic manipulation, we found that selective inhibition of the ventral striatum to ventral pallidum (VS-VP) pathway restored the motivation to initiate trials in the Ap-Av task without affecting goal valuation. No effects were observed in the Ap-Ap task. These findings provide causal evidence that the VS-VP pathway mediates motivational suppression in aversive contexts. Electrophysiological recordings revealed rapid VS responses to aversive cues and a gradual decrease in VP activity, suggesting an inhibitory interaction in which elevated VS activity dampens VP output to limit initiation. The slower VP dynamics may reflect a process by which aversive signals are gradually integrated to influence the motivational state. Together, these results identify the VS-VP pathway as a key circuit by which aversive contexts suppress effort-based behavioral initiation, highlighting it as a potential target for treating motivational deficits in depression and schizophrenia.

Graphical abstract

Keywords

  1. motivation
  2. ventral striatum
  3. ventral pallidum
  4. DREADDs
  5. aversion
  6. nonhuman primates
  7. major depressive disorder
  8. approach-avoidance conflict
  9. avolition
  10. neuromodulation

Introduction

Motivation is the driving force that propels organisms to initiate and sustain goal-directed behavior. In daily life, it is not only shaped by rewards but also strongly influenced by aversive factors such as punishment.1 Classical reinforcement learning theories emphasize the role of goal valuation in motivating behavior,2,3 supported by findings that lowering expected value decreases the likelihood of associated actions.4,5 However, recent computational frameworks propose that the initiation of behavior may be regulated by mechanisms distinct from valuation, particularly under aversive contexts that impose additional costs, such as physical, cognitive, or temporal effort,6 on initiating behavior.7,8 This perspective provides a mechanistic account of how punishment or effort demands can suppress goal-directed behavior even when the expected outcome remains intact.
Clinically, such dissociation is evident in major depressive disorder (MDD). Some patients exhibit heightened sensitivity to punishment,9 possibly leading to devaluation of potential rewards and reduced goal-directed behavior,1,2,10 whereas others display avolition, and play a key role despite preserved valuation capacities.10,11,12 Understanding how aversive contexts suppress initiation independently of valuation is therefore critical for elucidating the neural basis of motivational deficits in psychiatric disorders such as depression and schizophrenia.
The ventral striatum (VS) and ventral pallidum (VP) are central components of motivational regulation within the basal ganglia. The VS is well established in reward processing and incentive motivation,13,14,15 and its dysfunction has been implicated in motivational deficits observed in MDD.16,17,18 The VP, in turn, encodes hedonic value and contributes to the generation of goal-directed output.19,20,21,22 Both regions are also involved in aversive motivation23,24,25,26,27 and effort-based decisions,28,29,30,31 indicating roles that extend beyond appetitive processing. Anatomically, the VS sends dense projections to the VP,32 and interfering with this ventral striatopallidal (VS-VP) pathway in primates has been associated with behavioral phenotypes reminiscent of apathy33 and compulsive behavior.34 Together, these findings suggest that the VS-VP pathway plays a critical role in regulating behavioral initiation and emotional control. However, direct causal evidence linking this pathway to the suppression of behavioral initiation under aversive contexts remains lacking.
To address this gap, we tested whether the VS-VP pathway plays a distinct role in motivation under aversive conditions. Using a chemogenetic designer receptors exclusively activated by designer drugs (DREADDs)35 approach in macaques, we selectively inhibited VS projections to the VP, while animals performed an approach-avoidance (Ap-Av) conflict task that evokes motivational conflict, a paradigm previously used to quantify negative bias in decision-making in humans17 and nonhuman primates.36,37 By combining pathway-specific manipulation with precise behavioral measures, we demonstrate that selective suppression of the VS-VP pathway restores initiation of goal-directed behavior in aversive contexts without altering decision-making. This finding provides a causal circuit mechanism for aversion-related motivational regulation.

Results

To investigate the causal role of the VS-VP pathway in goal-directed behavior under aversive conditions in macaque monkeys, an adeno-associated viral vector (AAV2.1-CaMKIIa-hM4Di-IRES-AcGFP)38 was injected into the VS under the CaMKIIα promoter39 to express the inhibitory hM4Di DREADD receptor in medium spiny neurons (MSNs) (Table S1). The viral expression was largely confined to the ventral and rostral putamen, a region classically considered part of the VS40,41,42 (Figures 1A and S4). Axon terminals derived from these neurons were observed to innervate the VP. To selectively suppress these VS inputs and assess the behavioral effects of pathway-selective inhibition, deschloroclozapine (DCZ), a selective DREADD ligand,43 was locally infused into the VP, which predominantly receives dense projections from the VS32,44,45 (Figures 1A and S1A–S1C; Table S2). Importantly, infusion of DCZ in a control animal lacking DREADD expression did not alter behavioral performance, confirming that the observed effects were specific to chemogenetic suppression rather than non-specific pharmacological effects of DCZ (Figures S1D–S1F).
Figure 1 Experimental design to study the role of the VS-VP pathway in aversive situations
(A) Schematic representation of suppressing the VS-VP pathway using an inhibitory DREADD; middle. Monkey MK#1 and MK#2 received bilateral injections of AAV-hM4Di vector into the VS. DCZ was locally infused into the VP to suppress inputs from the VS. Expression of hM4Di tagged with GFP in VS MSNs (left) and their axon fibers in the VP (right) in MK#1. The upper images were created by stitching individual panels. Cd, caudate nucleus; Put, putamen; VS, ventral striatum; IC, internal capsule; AC, anterior commissure; VP, ventral pallidum. See also Figures S1A–S1C.
(B) Schematic flow of the Ap-Av and Ap-Ap tasks. In the Ap-Av task, the monkeys chose whether to accept a reward-punishment combination or reject it for a minimal reward. The offer was presented during the cue period using two visual bars: a red bar, with its length indicating the size of the reward, and a yellow bar, with its length representing the duration of the aversive air puff. In the Ap-Ap task, the monkeys chose between two targets, with the red and yellow bars indicating the rewards for choosing the cross or square target, respectively.
(C) Average baseline rates of precue-FEs during sessions without DCZ infusion in the Ap-Ap (n = 23) and Ap-Av tasks (n = 41). The Ap-Av task produced significantly more errors than the Ap-Ap task (Wilcoxon rank-sum test, p = 0.033), confirming that aversive factors increased disengagement at trial initiation.
See also Tables S1 and S2.
Motivation in this study was defined as the behavioral tendency or capacity to initiate goal-directed actions in response to task demands. To examine how aversive factors modulate the motivational state, we designed two decision-making tasks with matched sensory and motor demands but differing motivational contexts,46,47,48 distinguished by the presence or absence of air-puff punishment (Figures 1B and 1C). Two male Japanese macaques (Macaca fuscata; MK#1 and MK#2) were trained to perform an Ap-Av task,36 which required deciding whether to accept or reject offers comprising both a reward and an aversive air puff, and an approach-approach (Ap-Ap) task,36 which involved only rewarding outcomes (see also STAR Methods). Because offers with varying outcome magnitudes were presented pseudo-randomly within each task, both motivational engagement and outcome valuation could be quantitatively assessed. This task framework thus enabled us to dissociate motivational initiation from outcome valuation and to examine how chemogenetic suppression of the VS-VP pathway affects these processes under aversive versus purely appetitive conditions.

VS-VP pathway-selective suppression prevented the typical reduction in precue-FEs during the Ap-Av task

To assess the effect of VS-VP pathway-selective suppression on the motivational state, we compared monkeys’ behavior during DCZ infusion sessions with that in corresponding control sessions (Figure 2A). Motivation was quantified by the frequency of fixation errors during the precue period (precue-FEs; Figure 1B). Precue-FEs were defined as unsuccessful trials caused by incorrect eye movements (e.g., failing to fixate the precue, omitting it, or breaking fixation before cue onset). A higher frequency of precue-FEs indicated a reduced willingness to engage in the task and initiate goal-directed behavior.
Figure 2 VS-VP pathway-selective suppression prevented the typical reduction in precue-FEs during the Ap-Av task
(A) Experimental protocol for DCZ infusion. DCZ sessions were conducted weekly following a control session, with monkeys performing either the Ap-Av or Ap-Ap task. After completing pre-trials (MK#1, 150 trials; MK#2, 100 trials), DCZ was infused into the VP for 5–10 min (gray ellipse), followed by a 10–15 min waiting period before resuming the task.
(B) Example of the change in precue-FE frequency between control and DCZ sessions in the Ap-Av task. Thin vertical lines (gray) indicate unsuccessful trials with precue-FEs, whereas thick solid lines (black, control; red, DCZ) represent cumulative precue-FEs over trials.
(C) Mean change in precue-FEs in the Ap-Av task. Left: boxplots (top, MK#1; bottom, MK#2; gray, control; red, DCZ) show the distribution of precue-FEs between paired sessions; gray dotted lines connect each pair. Mean precue-FEs were significantly lower in DCZ sessions compared with controls (Wilcoxon signed-rank test, p < 0.05). Right: normalized cumulative precue-FEs of both monkeys (total n = 12; 5 for MK#1, 7 for MK#2) over trial progression (0%–100%) after DCZ infusion. Solid lines represent the mean, shaded areas the SEM. Thicker segments indicate trial bins where differences between control and DCZ sessions were statistically significant (Wilcoxon signed-rank test, p < 0.05). The total cumulative precue-FEs were significantly lower at the end of DCZ sessions (Wilcoxon signed-rank test, p < 0.05).
(D) Example of the change in precue-FE frequency between control and DCZ sessions in the Ap-Ap task. Thin vertical lines (gray) indicate unsuccessful trials with precue-FEs; thick solid lines (black, control; blue, DCZ) represent cumulative precue-FEs over trials.
(E) Mean change in precue-FEs in the Ap-Ap task, as in (C). Left: boxplots showing the distribution of precue-FEs between paired sessions (gray, control; blue, DCZ). No significant difference was observed between control and DCZ sessions (Wilcoxon signed-rank test; n.s., MK#1, p = 0.164; MK#2, p = 0.233). Right: normalized cumulative precue-FEs over trial progression (total n = 7; 3 for MK#1, 4 for MK#2). No statistically significant differences were observed throughout the session (Wilcoxon signed-rank test; n.s., p > 0.05).
See also Figures S1D–S1J and S2A–S2D.
The pathway-selective suppression significantly reduced the occurrence of precue-FEs in the Ap-Av task. As an example, the cumulative number of precue-FEs was prominently decreased following DCZ infusion on day 2, compared with the control session on day 1 with no infusion (Figure 2B). Data from individual monkeys demonstrated a consistent decrease in the total number of precue-FEs during DCZ infusion sessions, unlike their corresponding control sessions (Wilcoxon signed-rank test, MK#1, p = 0.036; MK#2, p = 0.043; Figure 2C, left), with omission errors being particularly affected (Figure S2A). Notably, line plots of the normalized cumulative number of precue-FEs over session progress aligned to the DCZ infusion onset (0%–100%) revealed differences between control and DCZ sessions, which began to emerge around the first quarter (∼25%) of the session in MK#1 and near the final quarter (∼75%) in MK#2 and further diverged thereafter, reaching statistical significance from 82% in MK#1 and 95% in MK#2 (Wilcoxon signed-rank test, p < 0.05; Figure 2C, right; see also Figure S2B, left, for raw data on total trial numbers). To confirm that this effect was not driven by trial-number differences, we further truncated all session pairs to the global minimum trial count (Figure S2B, right). Even under this conservative normalization, DCZ significantly reduced error accumulation in the Ap-Av task (p = 0.004) but showed only a non-significant trend toward an increase in the Ap-Ap task (p = 0.063; Wilcoxon signed-rank test). Given the reduced occurrence of precue-FEs, these results suggest that suppressing the VS-VP pathway restored the monkeys' motivation to initiate goal-directed behavior in the Ap-Av task.
Interestingly, in the Ap-Ap task, VS-VP pathway-selective suppression did not restore the monkeys' motivation for task initiation. As shown in the example session (Figure 2D), the cumulative number of precue-FEs did not differ significantly between DCZ infusion and control sessions. Population data confirmed this pattern: there was no significant difference in the total number of precue-FEs at the session endpoint between DCZ and corresponding control sessions in either monkey (Wilcoxon signed-rank test, not significant [n.s.], MK#1, p = 0.164; MK#2, p = 0.233; Figure 2E, left). Consistently, line plots of the normalized cumulative number of precue-FEs over session progress aligned to the DCZ infusion onset (0%–100%) showed no significant differences between DCZ and control sessions, although modest increasing trends were observed from ∼30% of the session in MK#1 and ∼50% in MK#2 (Wilcoxon signed-rank test, n.s., p > 0.05; Figure 2E, right). Notably, similar tendencies were observed with vehicle infusions during the Ap-Av task (Figures S1G–S1J), suggesting that other factors, such as brief human intrusion during infusion, could contribute to the slight increase in precue-FEs following DCZ infusion in the Ap-Ap task.
To complement these findings, we also analyzed the effects of DCZ infusion on FEs occurring during other task epochs (cue and response periods). Pooled data from both monkeys revealed no significant changes in cue- and response-FEs between paired control and DCZ sessions in either task (Figure S2C), indicating that once the animals initiated a trial, they generally maintained engagement and performed consistently. DCZ infusion significantly decreased the total number of FEs, including precue-, cue-, and response-FEs, in the Ap-Av task, while increasing them in the Ap-Ap task (Wilcoxon signed-rank test, Ap-Av, p = 0.011; Ap-Ap, p = 0.038; Figure S2D), confirming that precue-FEs were strongly affected by the pathway-selective suppression.

Modulation of behavioral initiation by the VS-VP pathway reflects error history rather than value history

Suppression of the VS-VP pathway restored motivational initiation under aversive conditions; however, it remained unclear whether this behavioral modulation was influenced by previous trial history. In particular, initiation behavior might depend on valuation history (the reward and aversive magnitudes offered in the preceding trial) or on success history (whether the previous trial ended in an initiation error, i.e., a precue-FE). To address this question, we examined whether initiation failures on a given trial were influenced by recent trial outcomes and whether such dependencies were altered by chemogenetic suppression of the VS-VP pathway.
To examine these possibilities, we performed a trial-by-trial logistic regression analysis within each session, incorporating three predictors: previous error, previous reward magnitude, and previous aversive magnitude. Because cue combinations were pseudorandomized, the reward and punishment magnitudes on each trial were statistically independent of the previous outcome. Thus, any dependence on these variables reflects trial history rather than the effects of cue scheduling. In both the Ap-Ap and the Ap-Av tasks, previous error robustly predicted initiation failures, whereas neither previous reward nor aversive magnitude made consistent contributions (Figure 3A). These results indicate that the likelihood of failing to initiate a trial was primarily determined by whether the preceding trial ended in an error, rather than by residual valuation carried over from prior cues and outcomes.
Figure 3 Modulation of behavioral initiation by the VS-VP pathway reflects error history rather than value history
(A) Each dot represents the p value from an individual session, obtained from a logistic model of initiation failure with three predictors: previous error, previous reward magnitude, and previous aversive magnitude. Red dots indicate Ap-Av sessions; blue dots indicate Ap-Ap sessions. Short horizontal ticks show group medians. The dotted line marks p = 0.05.
(B) The serial-error effect was calculated for each session as the difference between two conditional probabilities: the probability of current failure given a previous error minus that given a previous correct trial. Gray plus signs indicate individual sessions; the horizontal line shows the median; boxes indicate interquartile ranges; and whiskers extend to 1.5 × IQR. The proportions of sessions with significant effects (Fisher’s exact test, p < 0.05) are shown.
(C) DCZ effects on repetitive errors in a context-dependent manner. Left: Ap-Av task; right: Ap-Ap task. Paired session-wise comparisons of the total number of repetitive errors (consecutive error-to-error trials) between control (gray boxes) and DCZ sessions (red, Ap-Av; blue, Ap-Ap). Thin gray lines connect matched control and DCZ sessions (solid, MK#1; dotted, MK#2). Across panels, lower values indicate fewer consecutive error-to-error trials, reflecting weaker serial propagation of initiation failures. DCZ reduced error repetition in the Ap-Av task (Wilcoxon signed-rank test, p = 0.006) but increased it in the Ap-Ap task (p = 0.028).
We next used a model-free estimation to quantify the effect of serial errors, which was defined as the difference in conditional probabilities (ΔP) between trials following previous errors and those following previous completions. ΔP values were consistently greater than zero across sessions in both the Ap-Ap and the Ap-Av tasks, confirming that recent initiation failures increased the likelihood of subsequent failures (Figure 3B).
Guided by this robust serial-error dependence, we next tested whether suppression of the VS-VP pathway modulated repetitive initiation failures. Repetitive errors, defined as consecutive error-to-error trials, were quantified from pooled data from the two monkeys and compared between control and DCZ sessions. In the Ap-Av task, DCZ significantly reduced repetitive errors relative to control (Wilcoxon signed-rank test, p = 0.006), indicating that pathway suppression dampened the serial propagation of initiation failures in aversive contexts (Figure 3C, left). By contrast, in the Ap-Ap task, DCZ slightly increased repetitive errors (Wilcoxon signed-rank test, p = 0.028; Figure 3C, right), suggesting a mild enhancement of repetition when both options were appetitive. This modest effect may, however, partly reflect non-specific factors such as human intrusion during infusion, as noted in the previous section.
Together, these analyses demonstrate that modulation of behavioral initiation by the VS-VP pathway depends on recent error history rather than on goal valuation. The pathway appears to regulate the serial propagation of initiation failures in a context-dependent manner, selectively attenuating repetition under aversive conditions.

VS-VP pathway-selective suppression shortened reaction times in the Ap-Av task

Motivation was also quantified by response time (RT), which reflects the monkeys' motivation for task execution. RT was defined as the time from target onset until the monkeys selected either a cross (+) or square (□) target in response to a presented cue (Figure 1B). Within each session, we assessed DCZ effects by comparing pre- and post-infusion trials with matched trial counts (Figure 2A; see also Figures S2E and S2F for raw RTs).
To determine how the VS-VP pathway suppression influenced motivational engagement, we analyzed RTs separately for cross and square choices (Figures 4A and 4B). In the Ap-Av task, the cross target indicates acceptance of the combination of offered reward and air puff (Ap), whereas the square target indicates rejection (Av). When data were pooled from both monkeys (n = 12), DCZ infusion significantly shortened RTs for Ap choices (p = 0.0096) but not for Av choices (Wilcoxon signed-rank test; n.s., p > 0.05; Figure 4A), indicating that Ap behavior was selectively facilitated under aversive conditions. By contrast, in the Ap-Ap task, where compound offers consisted of two reward magnitudes depicted by red and yellow-bar lengths corresponding to cross and square choices, respectively, DCZ infusion did not significantly alter RTs for either choice type (Figure 4B). Together, these results indicate that the VS-VP pathway suppression specifically restored motivational vigor associated with Ap behavior in aversive contexts, rather than affecting purely reward-driven motivation.
Figure 4 VS-VP pathway-selective suppression shortened RTs in the Ap-Av task
(A) Choice-dependent changes in RTs between paired control and DCZ sessions in the Ap-Av task. To assess DCZ effects within a session, the difference (ΔRT) was calculated between the mean RTs of pre-trials (before DCZ infusion) and post-trials (after infusion, truncated to match pre-trial counts). Boxplots show the corresponding t values of ΔRT for trials in which monkeys chose the cross (Ap; left) or square (Av; right) targets. In the Ap-Av task, the cross target indicates acceptance of the offered reward-air-puff combination (Ap), whereas the square target indicates rejection (Av). Gray and red boxes indicate control and DCZ sessions, respectively; gray solid and dotted lines connect paired sessions from MK#1 and MK#2, respectively. The horizontal dotted line indicates ΔRT = 0 (mean RTs of pre-trials). Positive t values indicate increased RTs during post-trials, and negative t values indicate decreased RTs. Data pooled from both monkeys (n = 12) showed that DCZ infusion significantly shortened RTs for Ap choices (p = 0.0096) but not for Av choices (n.s., p > 0.05; Wilcoxon signed-rank test), suggesting that VS-VP pathway suppression modulated effort-cost computation in motivation.
(B) Choice-dependent ΔRTs in the Ap-Ap task. Same format as (A). In this task, compound offers consisted of two reward magnitudes indicated by red and yellow-bar lengths, corresponding to cross (left) and square (right) choices, respectively. Data pooled from both monkeys (n = 7) showed that DCZ infusion (blue boxes) did not significantly affect RTs for either choice compared with control sessions (gray boxes; Wilcoxon signed-rank test; n.s., p > 0.05), suggesting a less prominent role of the VS-VP pathway in purely reward-driven motivation.
(C) Smoothed changes in RTs across cue offers in DCZ infusion sessions. The x axis indicates red-bar length (reward amount) and the y axis yellow-bar length (air-puff duration). Top, MK#1; bottom, MK#2. Left: pre-trials (color-coded mean RTs; reddish, longer; bluish, shorter). Middle: post-trials after DCZ infusion. Right: t values of RT differences between pre- and post-trials (reddish, decreased RT; bluish, increased RT). Regions with significant differences (two-sample t test, p < 0.05) are outlined in black in the right panels. The Ap-Av decision boundaries estimated from pre-trials (black dotted lines) and post-trials (black solid lines) are shown for reference. Detailed derivation and analysis are presented in Figures 5C and 5D and STAR Methods. Cue offers to the left of the decision boundary were more likely to elicit Av choices, whereas those to the right were more likely to elicit Ap choices.
(D) Corresponding control sessions showing t values of RT differences between pre- and post-trials.
See also Figures S2E and S2F.
To characterize how the pattern of RT modulation varied across cue offers during the Ap-Av task, we pooled all pre- and post-infusion trials across sessions for each monkey and visualized the data as a two-dimensional offer matrix after smoothing (x axis, offered reward amount; y axis, offered air-puff duration; Figure 4C). Statistical differences between pre- and post-infusion conditions were assessed at each offer combination using two-sample t tests, revealing spatially specific regions of change. The Ap-Av decision boundary was included in the matrix as a reference for the relationship between RT and choice patterns; detailed derivation and analysis are presented in Figures 5C and 5D. Consistent with a previous report,49 RTs during pre-DCZ trials were typically longer under cue conditions that were associated with Av choices (regions to the left of the standard decision boundary) than those that were associated with Ap choices (regions to the right of the decision boundary) in both monkeys. Thus, the difference in RTs between Ap and Av choices likely reflects cue-dependent variation in the motivational state.
Figure 5 VS-VP pathway-selective suppression does not influence the Ap-Av choice patterns
(A) Boxplots showing percent changes in Ap choices (%ΔAp, top) and Av choices (%ΔAv, bottom) during post-trials relative to pre-trials in the Ap-Av task. Results for MK#1 (left) and MK#2 (right). Gray boxplots represent control sessions; cyan and magenta indicate %ΔAp and %ΔAv in DCZ sessions, respectively. Gray circles denote individual sessions; gray lines connect paired control and DCZ sessions. The yellow-shaded area above the 5% threshold indicates significant choice changes.49 For both monkeys, DCZ infusions failed to induce changes exceeding this threshold across sessions (MK#1, n = 5; MK#2, n = 7). No significant differences were found in the means of %ΔAp (n.s., MK#1, p = 0.294; MK#2, p = 0.106) or %ΔAv between DCZ and control sessions (n.s., MK#1, p = 0.185; MK#2, p = 0.592; paired t test).
(B) Boxplot showing percent changes in Ap choices (%ΔAp) for individual Rew+/Air− control sessions (n = 4; light green circles) in MK#2, where the reward magnitude per red-bar length increased, and the air-puff magnitude per yellow-bar length decreased during post-trials. Ap choices significantly increased above the 5% threshold in all sessions, and the mean change in these control sessions (light green line) was significantly higher than in DCZ sessions (two-sample t test; p < 0.001).
(C) Normalization of choice patterns between pre- and post-trials using an econometric model (STAR Methods). The decision function is defined as f(x, y) = ax + by + c, where x and y represent the lengths of the red and yellow bars, respectively. The coefficients a, b, and c were estimated by fitting a generalized linear regression to behavioral choices in the pre-trials. To align decision boundaries across sessions, we applied a standardization procedure.36,50 Specifically, the original decision boundary (ax + by + c = 0, dotted line) was aligned to a standard decision boundary (a0x + b0y + c0 = 0, solid line), defined by the points (20, 0) and (50, 100). Areas to the left of the standard decision boundary represent a higher likelihood of Av choices over Ap choices, whereas areas to the right indicate a higher likelihood of Ap choices over Av choices.
(D) Effects of chemogenetic suppression on choice patterns across reward-punishment combinations by comparing standardized cumulative pre- and post-trial data (top, MK#1, n = 5; bottom, MK#2, n = 7). The far-left and second columns show mean Ap-Av decisions during pre- and post-trials in the DCZ sessions, respectively (reddish, Av choice; bluish, Ap choice). Black dotted and solid lines represent pre- and post-trial decision boundaries. The third column shows statistical maps of pre-post differences after smoothing, with t values color-coded (reddish, increased Av choices; bluish, increased Ap choices). Significant regions (two-sample t test, p < 0.05) are outlined in black. The far-right column shows corresponding control sessions without DCZ infusion. DCZ infusion produced a slight, inconsistent tendency toward more Ap choices under high air-puff conditions near the decision boundary in both monkeys.
After DCZ infusion, RTs were generally shorter across offer combinations. In MK#1, the effect was most pronounced near the decision boundary and under low-reward and low-air-puff (low-low) offers (two-sample t test, p < 0.05), suggesting restored motivation under conflict and low-intensity conditions. In MK#2, RT decreases were more evident for high-reward offers (>50%) and were particularly pronounced under high-intensity conditions involving large rewards and longer air puffs, suggesting restored motivation. By contrast, the corresponding control sessions did not show significant differences between pre- and post-trials (Figure 4D). Although these analyses do not establish a direct relationship between RTs and choice patterns, the findings raise the possibility that the improvement in behavioral initiation after DCZ infusion could partly reflect influences on valuation or conflict processing.
To complement the behavioral indices of motivation, we analyzed changes in pupil diameter during the precue period, a physiological index that reflects the arousal-related motivational state.46,47 Across sessions, DCZ infusion produced a slight, non-significant increase in pupil size, most notably in MK#1 during the Ap-Av task (Wilcoxon signed-rank test; n.s., p = 0.080; Figures S3A–S3E). When data were pooled from two monkeys, this effect reached significance for the Ap-Av task but not for the Ap-Ap task (Wilcoxon rank-sum test; Ap-Av, p = 0.002; Ap-Ap, n.s., p = 0.600). These supplementary results provide limited but suggestive evidence that the VS-VP pathway may influence arousal-related components of motivation,46 particularly under aversive conditions. In addition, water consumption per trial remained unchanged between control and DCZ sessions in either monkey (Figure S3F), indicating that DCZ infusion did not affect baseline satiety.
In sum, the selective suppression of the VS-VP pathway prevented reduction in motivation, specifically under aversive conditions, as demonstrated by reduced precue-FEs and shortened RTs in the Ap-Av task. As such effects were not observed in the Ap-Ap task, these findings highlight that the VS-VP pathway plays a causal role in regulating motivation, specifically under adversity.

VS-VP pathway-selective suppression does not influence Ap-Av choice patterns

Because the previous analyses raised the possibility that changes in behavioral initiation after DCZ infusion might also involve alterations in decision-making, we next examined whether suppression of the VS-VP pathway affected Ap-Av choice patterns. We first performed an analysis of choice changes at the single-session level (Figure 5A). We compared pre- and post-infusion trials within each session by analyzing choice distributions (Ap vs. Av) in a two-dimensional decision matrix (see STAR Methods for details). The proportion of Ap and Av choices before and after infusion did not exceed the 5% threshold49 in any DCZ or control session. Thus, DCZ-induced suppression of the VS-VP pathway did not alter Ap-Av choice behavior.
By contrast, in the “Rew+/Air−” control sessions (MK#2, n = 4; Figure 5B), the magnitude of outcomes was manipulated in post-trials to alter perceived goal value. Under these conditions, the same cue choices yielded greater rewards and reduced air puffs compared with the standard Ap-Av task, which was expected to increase Ap choices. As predicted, the proportion of Ap choices exceeded the 5% threshold and showed significant differences relative to the DCZ sessions (two-sample t test; p < 0.001). This clear behavioral shift under value manipulation, contrasted with the absence of change under DCZ infusion, demonstrates that suppression of the VS-VP pathway does not affect valuation or choice patterns in the Ap-Av task.
To further characterize how DCZ infusion influences valuation tendencies, we next examined cue-dependent variations in decision-making patterns by analyzing pooled data across sessions. Choice distributions (Ap vs. Av) across different cue combinations were standardized using an econometric normalization procedure36,50 (Figure 5C; STAR Methods). We then compared cumulative choice distributions between pre- and post-infusion trials from all DCZ sessions (MK#1, n = 5; MK#2, n = 7). The results revealed a slight increase in Ap choices near the decision boundary under high air-puff conditions, but no consistent differences were detected across the two monkeys (Figure 5D). These findings suggest that suppression of the VS-VP pathway may slightly bias valuation toward positive outcomes, although this effect was not statistically significant.
Taken together, the overall results indicate that suppression of the VS-VP pathway had a minimal impact on goal evaluation, as choice patterns were not significantly altered; however, it robustly prevented the decline in goal-directed initiation under aversive conditions, as evidenced by reduced precue-FEs and shortened RTs. Although subtle influences on valuation or conflict processing cannot be entirely excluded, the absence of significant changes in choice patterns supports its predominant role in initiating goal-directed behavior, rather than in facilitating decision-making or evaluating the goal value.

Contrasting response patterns of VS and VP neurons to the Ap-Av and Ap-Ap tasks

Our chemogenetic results indicate that suppression of the VS-VP pathway selectively restores motivation under aversive conditions. Because the VS-VP projection is known as GABAergic,51,52 inhibition of VS neuron activity could disinhibit VP neurons, thereby increasing motivation in such contexts. To test whether VS neurons convey condition-dependent signals in response to aversive stimuli and whether VP neurons exhibit complementary activity patterns, we recorded single-unit activity from both regions while alternating between the Ap-Ap and Ap-Av tasks (Figure 6A; see also Figures S4A and S4B for recording sites with hM4Di-GFP expression).
Figure 6 Contrasting response patterns of VS and VP neurons to the Ap-Av and Ap-Ap tasks
(A) Precue-period neuronal activities were recorded from the VS (top) and VP (bottom) of monkeys MK#1 and MK#2 during the required fixation on the precue until cue onset in alternative blocks of Ap-Ap and Ap-Av tasks (middle).
(B) Distribution of precue activities recorded from the VS and VP neurons. The x axis represents the t values of the difference in mean firing rates between the Ap-Av and Ap-Ap tasks (Δfiring rate). The y axis represents the binned number of neurons. Dotted vertical lines indicate the mean Δfiring rate for each population. The mean of VS precue activities was significantly greater for the Ap-Av task than for the Ap-Ap task (p = 0.048), whereas the mean of VP precue activities was significantly greater for the Ap-Ap task (t test; p = 0.028). VS neurons with significantly higher precue activity during the Ap-Av task (t test; p < 0.05) were classified as Ap-Av type (thick black box). VP neurons with significantly higher precue activity during the Ap-Ap task (t test; p < 0.05) were classified as Ap-Ap type.
(C) The boxplots show the distribution of normalized precue activities of Ap-Av type VS neurons during four alternating blocks of Ap-Ap (blue) and Ap-Av (red) tasks. These VS neurons exhibited significant changes in their normalized firing rates after each task transition (t test; p < 0.001; p < 0.01). The white line represents the median, the edges of the box represent the 25th and 75th percentiles, and the whiskers extend to the minimum and maximum values within 1.5 × IQR. Outliers beyond this range are indicated by individual cross (+) points.
(D) Mean firing rates of Ap-Av type VS neurons before and after transitioning from Ap-Ap to Ap-Av tasks. For each trial, population activities were compared with the average firing rate for the 30 trials before the transition. Unfilled circles indicate non-significant differences, colored circles indicate significant differences (t test, p < 0.05). Black circles denote significant differences for single trials, and pink circles indicate two consecutive significant differences. Pinkish shaded areas indicate intervals during which trials exhibited significant differences for more than two trials in a row.
(E) Boxplots showing the distribution of normalized precue activities of Ap-Ap type VP neurons for four alternating blocks, as in (C). The Ap-Ap type VP neurons significantly changed their activities at the transition from Ap-Ap to Ap-Av task (t test; p < 0.01; p < 0.001).
(F) Mean firing rates of Ap-Ap type VP neurons before and after the Ap-Ap to Ap-Av transition, as in (D).
See also Figures S4S6 and S7A–S7H.
Given that precue-FEs were most strongly affected by pathway-selective suppression, analyses focused on the precue activity, defined as the interval from fixation-initiation on the precue until the cue onset (Figure S5). Analysis of 209 VS neurons (MK#1, n = 50; MK#2, n = 159) revealed significantly higher precue activity during the Ap-Av task, as compared with the Ap-Ap task (t test, p = 0.048; Figure 6B, top). Conversely, analysis of 87 VP neurons (MK#1, n = 27; MK#2, n = 60) showed significantly higher precue activity during the Ap-Ap task, as compared with the Ap-Av task (p = 0.028; Figure 6B, bottom). Together, these findings indicate a contrasting preference in VS and VP neurons, with VS neurons being more active during aversive contexts (Ap-Av task) and VP neurons being more active during non-aversive contexts (Ap-Ap task). This opposing pattern suggests a potential inhibitory influence of VS neurons on VP activity, which may play a critical role in regulating motivation under aversive conditions.
To further examine the task-specific preferences of VS and VP neurons, we analyzed changes in their precue activity during transitions between the Ap-Ap and the Ap-Av tasks (Figures 6C–6F and S6). Among the 209 VS neurons, 36 were identified as the “Ap-Av type,” showing significantly higher precue activity during the Ap-Av task than during the Ap-Ap task (t test, p < 0.05; Figure 6B, top, black box; see spatial distribution in Figure S4C). These Ap-Av type VS neurons exhibited distinct and bidirectional firing changes during task transitions: their activity increased significantly when switching from the Ap-Ap to the Ap-Av task (t test, p < 0.05) and decreased when switching back from the Ap-Av to the Ap-Ap task (t test, p < 0.05; Figure 6C). Notably, their firing rates increased immediately after the transition to the Ap-Av task and remained elevated for at least 30 trials (Figure 6D), suggesting that VS neurons rapidly and persistently encode the onset of aversive contexts to modulate downstream motivational circuits.
By contrast, among the 87 VP neurons, 15 were classified as the “Ap-Ap type,” displaying significantly higher precue activity during the Ap-Ap task (t test, p < 0.05; Figure 6B, bottom, black box; see spatial distribution in Figure S4D). These Ap-Ap type VP neurons showed a marked decrease in firing rates after transitioning to the Ap-Av task, with no significant change when returning to the Ap-Ap task (Figure 6E). This unidirectional shift suggests that VP activity may be modulated primarily through inhibitory input from VS neurons under aversive conditions. Furthermore, the Ap-Ap type VP neurons required more trials to adjust their firing after the Ap-Ap → Ap-Av transition (Figure 6F), indicating a slower adaptation to aversive contexts compared with the Ap-Av type VS neurons.
Interestingly, Ap-Ap type VP neurons showed a significant decrease in firing from the early (0%–50%) to the late (50%–100%) phase of the session, whereas Ap-Av type VS neurons did not show significant modulation with trial number (Figures S7A–S7H). This pattern suggests that VS neuron activity primarily tracks the task context (e.g., aversive vs. appetitive), whereas VP neuron activity reflects gradual changes in the motivational state over the course of the session.
In summary, VS neurons encoded aversive contexts and displayed activity patterns opposing those of VP neurons during task transitions, particularly in the precue period, which is critical for task initiation. These opposing dynamics parallel the chemogenetic findings, indicating that inhibition of VS input to the VP specifically disrupts motivational regulation under aversive conditions.

VS neurons show sustained modulation following previous-trial errors

Motivational engagement is shaped not only by current task demands but also by recent experiences of success or failure. To determine whether the VS-VP pathway integrates such recent behavioral history, we analyzed within-trial activity dynamics during the precue period. Specifically, we assessed correlations between the precue firing rate of each neuron and the outcome of the immediately preceding trial (successful or error), which served as an index of the animal’s prior engagement or failure. Errors included those occurring during the precue, cue, or response periods, as any failure in the preceding trial was assumed to influence motivational engagement in the subsequent one.
A significant proportion of VS neurons in both the Ap-Av and Ap-Ap tasks exhibited positive correlations with previous-trial errors (Figure 7A). At the population level, correlation coefficients were significantly positive in both tasks (Ap-Av, r = 0.027, Wilcoxon signed-rank test against zero, p = 0.007; Ap-Ap, r = 0.037, p < 0.001), indicating that VS neurons tended to increase precue firing following error trials. Notably, neurons showing significant differences between successful and error trials during the precue period also exhibited sustained changes during the cue period (Figure 7B).
Figure 7 VS neurons show sustained modulation following the previous-trial errors
(A) Distributions of Pearson’s correlation coefficients (r) between precue firing rate and the outcome of the previous trial (0 = successful, 1 = error) in the Ap-Av (top) and the Ap-Ap (bottom) tasks. Errors included all trial phases (precue, cue, and response), reflecting how previous failures influenced motivation in the next trial. Red and blue bars indicate neurons with significant correlations (two-sided t test, p < 0.05) in the Ap-Av and Ap-Ap tasks, respectively; gray bars indicate non-significant neurons. Dashed lines mark zero correlation, and solid vertical colored lines indicate population means (Ap-Av, r = 0.027; Ap-Ap, r = 0.037), both significantly positive (Wilcoxon signed-rank test against zero, Ap-Av, p = 0.007; Ap-Ap, p < 0.001).
(B) Population activity of VS neurons with significant positive correlations in (A) (black boxes; p < 0.05, r > 0; top, Ap-Av, n = 25; bottom, Ap-Ap, n = 24). Thick lines show the normalized mean firing rate for trials following previous errors (red, Ap-Av; blue, Ap-Ap) and those following successful trials (black). Firing rates were normalized to each neuron’s mean activity during the cue period; shaded areas represent ± SEM. Vertical black lines indicate cue onset and offset. Yellow bands mark time windows showing significant differences between error and successful conditions (two-sided t test, p < 0.05). VS neurons showing positive correlations exhibited enhanced precue activity following error trials, which persisted into the cue period in both tasks.
See also Figure S7I.
By contrast, VP neurons showed weaker and more heterogeneous relationships with previous-trial outcomes (Figure S7I). Only a small subset displayed significant correlations, and the population mean did not differ significantly from zero (Ap-Av, r = 0.003, n.s., p = 0.885; Ap-Ap, r = 0.015, n.s., p = 0.155). Thus, unlike the VS, the VP did not exhibit a consistent bias toward maintaining history-dependent activity across task epochs.
Taken together, these results indicate that the VS, but not the VP, integrates information about recent behavioral outcomes to adjust motivational engagement in upcoming trials. Sustained firing after errors suggests that VS neurons retain information about previous failure, functioning as an internal error-monitoring signal that promotes re-engagement and facilitates renewed behavioral initiation after failure, particularly under aversive conditions.

Discussion

This chemogenetic study provides causal evidence that the VS-VP pathway regulates the initiation of goal-directed behavior under aversive conditions while exerting minimal effects on outcome valuation. This dissociation supports computational models proposing that behavioral initiation and evaluative processes rely on distinct neural mechanisms.7 In particular, our findings suggest that aversive contexts can suppress initiation independently of value judgment, consistent with theories emphasizing the role of effort-cost computations in motivation.8 Our trial-history analysis further showed that modulation of initiation by the VS-VP pathway depended on recent error history rather than on goal valuation, suggesting that this pathway regulates motivation to initiate behavior in a history-dependent, context-specific manner.
The VS-VP pathway appears to be preferentially engaged in aversive contexts and to play a key role in sustaining task-driven motivation. Early in sessions, reward- and task-driven motivations likely act in parallel, but as reward-driven vigor declines with satiety, behavior becomes more dependent on task-driven motivation. Under such conditions, motivational engagement could more strongly be reduced in the Ap-Av task than in the Ap-Ap task, and chemogenetic suppression of VS input to the VP could selectively attenuate this decline. Small, non-significant increases in errors observed in the Ap-Ap task or during vehicle controls likely reflect transient effects of infusion rather than specific circuit suppression. Consistent with this view, water consumption showed no notable change with DCZ infusion, suggesting that the behavioral effects were independent of satiety. These results highlight the VS-VP pathway as a critical substrate for regulating task engagement when goal pursuit conflicts with aversive outcomes.
An important consideration is that the apparent improvement in initiation following DCZ infusion could partly reflect modulation of valuation processes. Specifically, DCZ shortened reaction times for the Ap choices, raising the possibility that the VS-VP pathway contributes not only to behavioral initiation but also to certain components of valuation. However, the absence of significant effects on overall choice behavior suggests that the primary function of this pathway is to regulate motivation rather than to facilitate decision-making or subjective valuation. Any indirect influence on valuation may instead be mediated through downstream circuits, such as dopaminergic feedback to cingulate cortical areas, including the pregenual anterior cingulate cortex (pgACC),53 which are known to play critical roles in value representation.36,37
Electrophysiological recordings revealed contrasting activity patterns in the VS and VP during transitions between appetitive-only (Ap-Ap) and conflict (Ap-Av) tasks. These opposing dynamics suggest an inhibitory interaction within the pathway, potentially mediated by the GABAergic VS-VP projection.51,54,55,56 In this model, increased VS activity inhibits VP neurons and thereby reduces initiation in aversive contexts.57 Supporting this interpretation, chemogenetic suppression of the VS-VP pathway prevented the typical decline in initiation in the Ap-Av task, presumably by reducing inhibitory VS input and disinhibiting VP neurons.56
We also observed a temporal dissociation between activity in the VS and VP: VS neurons responded rapidly to the Ap-Av context, potentially serving as an early detector of aversive salience, whereas VP neurons changed more gradually and stabilized after repeated exposure to aversive conditions. This divergence may reflect complementary functions, with the VS providing rapid, adaptive inhibition and the VP maintaining longer-term regulation. Furthermore, VS neurons exhibited sustained activity linked to previous-trial errors, with enhanced precue firing that persisted into the cue period, whereas VP neurons showed little history-dependent modulation. These findings imply that the VS integrates past performance with current task demands to dynamically regulate the motivational state across trials.
Although the electrophysiological and chemogenetic data were obtained from separate experiments and cannot demonstrate any causality at the synaptic level, together they provide convergent evidence that the VS-VP pathway mediates effort-cost computations independently of goal valuation, elucidating a circuit mechanism through which aversive contexts selectively suppress behavioral initiation.
Importantly, there is also a possibility of partial circuit specialization between the Ap-Av and Ap-Ap tasks. The Ap-Av task, which requires explicit resolution of Ap-Av conflict,58 may recruit additional cortical and subcortical circuitries beyond the VS-VP pathway.17,36,59,60 Such specialization may depend on the cellular heterogeneity of the VP,61 where GABAergic and glutamatergic populations represent the internal motivational state and differentially guide Ap and Av behaviors.22,62
Moreover, subregion-specific VS-VP projections,32 such as medial versus lateral or anterior versus posterior subregions, may differentially contribute to motivational regulation.63,64,65,66 In rodents, anterior versus posterior VS-VP projections have been linked to “wanting” and “liking,” respectively.67,68 In primates, posterior regions of the VS maintain long-term value representations that support habitual seeking even in the absence of immediate reward outcomes.66 Because our recordings were located mainly in the anterior VS, which has been associated with motivational drive and wanting-like processes, our findings provide evidence for functional specialization along the anterior-posterior axis of the VS. This functional gradient suggests that chemogenetic suppression of the anterior VS-to-VP projection in our study primarily affected “wanting”, reflected in restored trial initiation and response vigor without changes in subjective valuation. Although our findings are consistent with this functional gradient, our experiment was not designed to directly test anterior-posterior specialization in primates. Future studies employing finer anatomical targeting and large-scale neural recordings will be essential to verify subregional contributions across motivational contexts.
From a clinical perspective, these findings provide mechanistic insight into motivational deficits central to psychiatric disorders. Avolition,11,69,70,71 diminished initiation despite preserved hedonic capacity, is a core symptom in MDD and schizophrenia, reflecting a dissociation between value representation and behavioral initiation. Our results refine this view by identifying the VS-VP pathway as a key regulator of initiation under aversive conditions while leaving outcome valuation largely intact.
Emerging evidence also highlights the heterogeneity of VP subcircuits in mood regulation. Distinct VP neurons projecting to the lateral habenula or ventral tegmental area mediate different depression-like symptoms in rodents,72 implying that discrete VS-VP-LHb/VTA pathways may contribute to symptom-specific motivational impairments. Targeted modulation of these circuits could thus provide more precise treatment strategies for psychiatric disorders such as depression and schizophrenia.
Finally, comparison with rodent studies emphasizes the unique value of primate models. Rodent literature has highlighted roles of VS (including nucleus accumbens) and VP in reward-driven actions68,73,74,75 and aversive processing.25,26,76,77,78 These accounts often rest on a simplified functional dichotomy between MSNs expressing dopamine D1 receptors (D1-MSNs) and those expressing D2 receptors (D2-MSNs). However, this division is not absolute79: MSNs can co-express both receptor types,80,81 and their functions vary with context,82 projection target,83,84 and anatomical gradients.67,85,86 In primates, striatal organization is more complex and parallel,79,87,88,89 limiting direct translation from rodents. Thus, this nonhuman primate study provides a crucial bridge, combining homologous striatopallidal circuitry88 with the capacity for sophisticated behavioral paradigms.

Limitations of the study

Several limitations should be acknowledged. The experiments were conducted in two male macaques, and future work should include females to examine potential sex differences.90,91 The current manipulation targeted putative GABAergic VS-VP projections, yet the specific VP cell types affected remain unidentified. Finally, future studies should dissect upstream cortical and cingulate inputs to the VS, as well as downstream VP projections to the LHb, VTA, and mediodorsal thalamus, to determine how motivational initiation and value judgment are differentially implemented across the circuit.

Resource availability

Lead contact

Requests for further information and resources should be directed to and will be fulfilled by the lead contact, Ken-ichi Amemori (amemori.kenichi.7s@kyoto-u.ac.jp).

Materials availability

This study did not generate new, unique reagents.

Data and code availability

All data reported in this paper will be shared by the lead contact upon request.
This study did not generate any unique code.
Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.

Acknowledgments

We thank Drs. Masayuki Matsumoto and Spyros Goulas for critical reading and constructive suggestions on the manuscript and Dr. Akihisa Kaneko and Yuto Fukushima for helping with taking care of the animals. This research was supported by Naito Foundation (K.-i.A. and S.A.); Takeda Science Foundation (K.-i.A. and S.A.); Uehara Memorial Foundation (K.-i.A.); Japan Agency for Medical Research and Development JP24gm6910012 (K.-i.A.), JP24wm0625210 (K.-i.A.), JP21jm0210081 (K.-i.A.), and JP22dm0207077 (M.T.); and Japan Society for the Promotion of Science JP24H02163 (K.-i.A.), JP21K19428 (K.-i.A.), JP21H05169 (K.-i.A.), JP20H03555 (K.-i.A.), JP20H05063 (K.-i.A.), JP22H04998 (K.-i.A.), JP22H05157 (K.-i.I.), JP21K07259 (S.A.), JP21J40030 (S.A.), and JP19H05467 (M.T.). Schematic representation in Figure 1A was partially created with BioRender.com (agreement number: HW26WJZ21C).

Author contributions

Conceptualization, J.-m.N.O. and K.-i.A.; methodology, J.-m.N.O. and K.-i.A.; software, K.-i.A.; formal analysis, K.-i.A.; investigation, J.-m.N.O., S.A., and K.-i.A.; resources, S.A., K.-i.I., K.K., M.T., and K.-i.A.; data curation, J.-m.N.O. and K.-i.A.; writing – original draft, J.-m.N.O. and K.-i.A.; writing – review and editing, J.-m.N.O., S.A., K.-i.I., K.K., M.T., and K.-i.A.; visualization, J.-m.N.O. and K.-i.A.; supervision, M.T. and K.-i.A.; project administration, J.-m.N.O. and K.-i.A.; funding acquisition, S.A., K.-i.I., M.T., and K.-i.A.

Declaration of interests

The authors declare no competing interests.

STAR★Methods

Key resources table

REAGENT or RESOURCESOURCEIDENTIFIER
Antibodies
Rabbit anti-GFP antibodyInvitrogenCat# G10362; RRID: AB_2536526
Anti-rabbit polymer HRPGeneTexCat# GTX83399; RRID: AB_11164964
Streptavidin Alexa Fluor 488InvitrogenCat# S11223
Bacterial and virus strains
AAV2.1-CaMKIIα-hM4Di-IRES-AcGFPAvailable from Ken-ichi InoueN/A
Chemicals, peptides, and recombinant proteins
Deschloroclozapine (DCZ)MedChemExpressCat# HY-42110A
Dimethyl sulfoxide (DMSO)Nacalai TesqueCat# 13408-64; CAS# 67-68-5
Tyramide Signal Amplification (TSA) kit (includes blocking reagent & TSA-biotin)Akoya BiosciencesCat# NEL700A001KT
ProLong Diamond Antifade Mountant with DAPIInvitrogenCat# P36971
Experimental models: Organisms/strains
Macaca fuscata, male (MK#1, MK#2), 5–8 years, 7.7–11.0 kgCenter for the Evolutionary Origins of Human Behavior, Kyoto UniversityN/A
Software and algorithms
NIMH MonkeyLogicNational Institute of Mental Healthhttps://monkeylogic.nimh.nih.gov
NeuroExplorer (version 5.445)Plexonhttps://plexon.com/products/neuroexplorer/
MATLAB 2024bMathWorkshttps://www.mathworks.com/
Offline SorterPlexonhttps://plexon.com/products/offline-sorter/
Other
Diaphragm Precision Micro Injection Pump Smooth Flow Pump Q SeriesTacmina CorporationModel: QI-100-TT-P-S
Oil-free Scroll Compressor (Air compressor)Anest IwataModel: SLP-07EED
Solid State Relay (Phototriac, Applied output load: 10 A at 24–240 VAC, With zero cross function, Screw terminal)OmronModel: G3NA-210B-UTU
Injectrode (tungsten electrode–silica capillary assembly)Unique Medical CorporationN/A
Monopolar Microelectrode, stainless steel (120 mm length with PIN; 1–1.5MΩ)FHCCat# UESLGESEXN1M
ViewPoint eye Tracker (monocular; 400 Hz)Arrington ResearchModel: MCU400

Experimental model and study participant details

Two male Japanese macaque monkeys (Macaca fuscata; MK#1 and MK#2; age 5–8 years old; 7.7–11.0 kg) were used according to the Guidelines for Care and Use of Nonhuman Primates, 3rd edition, with the approval of the Animal Experimental Committee of Kyoto University (approval number 24097). All animals were in good health, with normal immune status, and had no prior experimental involvement. Male subjects were selected from the institutional pool at the Center for the Evolutionary Origins of Human Behavior, Kyoto University, prioritizing individuals older than 5 years and free of ocular or systemic health issues. To avoid potential sex-related differences in punishment sensitivity, both subjects were of the same sex. Monkeys were housed in individual stainless-steel cages under temperature- and humidity-controlled conditions (22–26°C; 40–60%) and a 12-h light/dark cycle. Daily health checks were conducted by trained animal care staff. Monkeys were fed a standard daily diet along with supplementary vegetables and fruits. Water consumption was moderately restricted following the approved procedures to use water as a reward during experiments. After each daily session, the monkeys were given sufficient water to quench their thirst (minimum 30 ml/kg), while considering its potential impact on next-day performance.

Method details

Decision-making tasks

All behavioral experiments were conducted under room temperature, and the monkeys performed the tasks in a soundproof, dark booth to minimize external noise and visual distractions. We trained the two monkeys to perform a variant of the Approach–Avoidance decision-making task (Ap–Av task, Figure 1B).36 Precue-period: To initiate each trial, the monkeys were required to make a saccade to a white square fixation point (precue) at the center of the screen and keep the fixation for the specified time in order to continue the task. The required fixation time varied across sessions, ranging from a minimum of 0.8 seconds to a maximum of 5.2 seconds. Cue-period: Following the precue-period, a visual cue consisting of red and yellow bars appeared in the same location as the precue. The length of the red bar was directly proportional to the amount of reward delivery (water or isotonic drink), while the yellow bar length was directly proportional to the duration of the air-puff delivery. In each trial, the length of both bars was independently and pseudo-randomly varied across 1 to 100 steps. The monkeys were required to start a gaze at the cue within the time limit (2.0–3.0 s) and maintain the gaze during the cue presentation period (1.0–1.5 s). Response-period: After the cue-period, two choice targets (a white cross and a white square) appeared: one on the left and the other on the right of the cue. The placement of the targets was randomly altered in each trial for MK#2. The monkeys were required to indicate their decisions by making a saccade to either target within the target presentation period (2.0–4.0 s). If they gazed at the cross (+) target for the specified time (0.3–0.5 s), the choice was considered “approach”. If they gazed at the square (□) target for the specified time (0.3–0.5 s), the choice was considered “avoidance.” Outcome-period: When “approach” was chosen, a specific amount of water indicated by the length of the red bar was given to the monkeys. After the reward delivery, a pre-indicated duration of air-puff was delivered to the monkeys’ faces based on the length of the yellow bar shown during the cue. When “avoidance” was chosen, a fixed amount of minimal reward was given in order to maintain the monkeys’ willingness to perform the task. An inter-trial interval (ITI) was inserted after each trial for the specified time. If the monkeys successfully completed each trial, the current trial was counted as a 'successful trial'. However, if the monkeys made incorrect eye movements, such as ignoring the visual guidance (omission) or not maintaining eye fixation on it (break), the trial was terminated and categorized as a 'fixation error', depending on the phase in which the error occurred. In addition, as control to establish conditions sufficient to alter decision-making, we implemented “Rew+ / Air–” control sessions in MK#2 (Figure 5B). In these sessions, the reward per unit length of the red bar increased, and the air-puff per unit length of the yellow bar decreased during post-trials compared to pre-trials.
We also trained the monkeys to perform the Approach–Approach decision-making task (Ap–Ap task, Figure 1B). The procedure of the Ap–Ap task was identical to that of the Ap–Av task, except that both the red and yellow bars represented rewards. When the monkey chose the cross (+) target, a specific amount of water indicated by the length of the red bar was delivered as a reward. When the monkey chose the square (□) target, a specific amount of water indicated by the length of the yellow bar was delivered as a reward. We adjusted the amount of reward per unit length of the red bar to be 1.0–2.0 times greater than that of the yellow bar.
A pump (Diaphragm Precision Micro Injection Pump Smooth Flow Pump Q Series; QI-100-TT-P-S, Tacmina Corporation) regulated the amount of reward delivered through the water tube. This amount was controlled proportionally to analog signals (DC 4–20 mA) sent from a computer operating the task. An oil-free scroll air compressor (SLP-07EED, Anest Iwata) produced the compressed air used for air-puff. The delivery duration of the air-puff was regulated by a solenoid valve (AB31-02-3, CKD) that was activated by a solid state relay (phototriac; applied output load: 10 A at 24–240 VAC; with zero cross function and screw terminal; G3NA-210B-UTU, Omron). This relay received analog signals of DC 5 V from the task computer, transmitted via transistor-transistor logic (TTL). Eye movements were monitored using an infrared light camera and eye-tracking software (ViewPoint Eye Tracker; Monocular; 400Hz; MCU400, Arrington research). The tasks were controlled through a NIMH-MonkeyLogic system (National Institute of Mental Health). Analog signals of the eye movements were collected with task event markers in real time through Plexon's OmniPlex system (Plexon).

Recording chamber implantation

After training the monkeys MK#1 and MK#2 in the tasks, we conducted a sterile surgery to implant a plastic recording chamber on each monkey's skull using bone cement and ceramic screws. Prior to the surgeries, we performed magnetic resonance (MR) imaging with T1-weighted turbo spin echo (0.3 Tesla, 1 mm slice thickness; 7 Tesla, 0.3 mm slice thickness) to identify stereotaxic coordinates of target brain regions. For both MR imaging and surgeries, we anesthetized the monkeys with intramuscular injections of ketamine (5–8 mg/kg) and atropine (0.005–0.025 mg/kg) with xylazine (0.2 mg/kg). Anesthesia was maintained by inhalation of 1–2.5% sevoflurane or 1–2% isoflurane with 1.5 L O2. Daily antibiotic injections were administered starting one day before surgery and continued for at least one week afterward. After full recovery, a second surgery was performed under anesthesia and aseptic conditions, using the same procedures as described above, to remove the skull over the target regions.

Electrophysiology and neuronal analysis

After the chamber implantation, we performed in vivo single-unit recordings of neuronal activities in the VS and VP while the monkeys performed both the Ap–Ap and Ap–Av tasks in alternating blocks. The transition between each block of these tasks occurred without prior notification after a preset number of successful trials. All recordings were conducted under room temperature conditions, and the monkeys performed the tasks inside a soundproof, dark booth to minimize external visual and auditory stimuli.
We carefully inserted a sterile glass-coated electrode with an impedance of 0.5–2.5 MΩ (Alpha-Omega) through a hole, spaced at 1 mm intervals, on a plastic grid placed over the recording chamber, using an oil hydraulic micromanipulator (MO-97A, Narishige) and determined its location through the aid of MR images. Digitized and amplified neural signals were collected along with task event markers in real time through the OmniPlex system (Plexon). We classified neural signals into single-unit activities by using offline Sorter (Plexon). Action potentials were aligned and classified according to task event markers, using NeuroExplorer (version 5.445; Plexon). For further detailed analyses of firing rates and spike widths, we employed MATLAB 2024b (MathWorks). Data from fixation error trials, including omissions and fixation breaks, were excluded from analysis.

Viral vector injections

We utilized a type of inhibitory DREADDs known as hM4Di (i.e., a mutated inhibitory G-protein-coupled human muscarinic receptor 4), which suppresses neuronal activity upon activation and effectively silences the targeted neurons. We produced a mosaic adeno-associated viral vector (AAV2.1-CaMKIIα-hM4Di-IRES-AcGFP) optimized for high transgene expression in the primate brain as described previously38 with the aim of introducing hM4Di receptors into GABAergic MSNs within the VS. We employed CaMKIIα (Calcium/calmodulin-dependent protein kinase II subunit α) promoter that enabled hM4Di receptors to express specifically in MSNs of the VS.39 We conducted a surgery for the viral vector injections under anesthesia and sterile conditions in a fully equipped operating room. We injected AAV2.1-CaMKIIα-hM4Di-IRES-AcGFP (2.0 x 1013 gc/ml for monkey MK#1; 2.5 x 1013 gc/ml for monkey MK#2) bilaterally into the VS (Figure 1A; Table S1). The target regions were estimated using MR imaging and in vivo single-unit recordings, based on the relative position of holes on the recording grid. For monkey MK#1, the injections were manually performed into the right and left hemispheres through the grid holes, using a beveled needle of a 10 μL Hamilton microsyringe. For monkey MK#2, the injections were conducted into the right and left hemispheres through a stainless cannula connected to a 25 μL Hamilton microsyringe via fused silica tubing, controlled by a syringe pump (KD Scientific; WPI). The total amount of viral aliquots injected for monkey MK#1 was 7 μL into the left hemisphere and 8.2 μL into the right hemisphere (1.0–2.2 μL per track, four tracks for each hemisphere). Immunohistochemical analysis of hM4Di-AcGFP expression in MK#1 revealed that several injection tracks did not result in detectable receptor expression (Figures 1A and S4). Despite this partial expression, chemogenetic manipulation produced clear behavioral effects in MK#1. Therefore, in MK#2, we adjusted the total injection volume to 3 μL per hemisphere (1.5 μL per track, two tracks for each hemisphere) in order to achieve a comparable level of receptor expression. We waited at minimum 10 weeks after the viral vector injections to ensure valid expression of the hM4Di receptors.92

Local infusions of drugs

In order to silence the hM4Di receptors, we employed deschloroclozapine (DCZ), a novel and highly selective actuator.43 We dissolved DCZ (HY-42110A, MedChemExpress) in dimethyl sulfoxide (DMSO; 13408-64; CAS# 67-68-5, Nacalai Tesque) at a concentration of 1 mg DCZ per mL of DMSO. These stock solutions were stored in a –80°C deep freezer. We prepared a fresh working solution on the day of usage by diluting the stock solution in saline to concentrations of 342.02 or 684.04 nM. We used an injectrode (tungsten electrode–silica capillary assembly; Unique Medical Corporation) consisting of a tungsten electrode (diameter, 200 μm; 0.2–1.5 MΩ) and a silica capillary tube (outer diameter, 150 μm), enclosed in a polyimide tube (outer diameter, 480 μm) and connected to a 25 μL Hamilton microsyringe controlled by a syringe pump (KD Scientific; WPI). This injectrode enabled us to infuse DCZ into the VP while recording neuronal activities from this region. We placed the injectrode in the VP, 1.0–1.5 mm deeper than the ventral edge of the anterior commissure (AC–VP border) (Figures S1A–S1C). We maintained a consistent task type throughout the week for the monkeys to perform, either the Ap–Av or Ap–Ap task, and infused DCZ, or vehicle, once a week when their performance consistently stabilized over two consecutive days. After the monkeys completed a set number of pre-trials (150 for MK#1; 100 for MK#2), we locally infused the working solution of DCZ into the VP. For monkey MK#1, the DCZ infusion was performed bilaterally, delivering 1.5–2.0 μL per hemisphere at a rate of 0.15–0.2 μL/min (Table S2). However, immunohistochemical analysis of hM4Di-AcGFP expression in MK#1 revealed asymmetric expression between hemispheres, raising the possibility that the observed behavioral effects in MK#1 may have resulted predominantly from unilateral modulation. To examine the impact of unilateral manipulation, DCZ infusion in MK#2 was carried out unilaterally, with 0.7–1.0 μL delivered into a single hemisphere at a rate of 0.1 μL/min. The task resumed approximately 5–13 minutes after the infusion was completed. As previously reported,43 we also verified that DCZ administration alone was not associated with significant alterations in goal-directed behavior in the non-DREADD control monkey (n = 1; MK#2), in which bilateral DCZ infusion into the VP was performed prior to viral vector injection (342.02 nM, 2.0 μL per hemisphere; Figures S1D–S1F).

Histology and immunostaining

For histological identification, we inserted a monopolar stainless steel electrode (120 mm length with PIN; 1–1.5MΩ; UESLGESEXN1M, FHC) to make iron deposits in the VP region of monkey MK#1, where DCZ had been infused via the injectrode (Figure S1C). Using the same procedure as in in vivo single-unit recordings, we positioned the electrode at the AC–VP border. Electrolytic deposition of iron ions occurred at the electrode tip by applying a negative current of 10 μA for 10 s. Following the deposition, we waited approximately for 3 min. On the day following the deposition, monkey MK#1 was administered a deep anesthesia using an overdose of secobarbital sodium (24 mg/kg, i.v.). Subsequently, transcardial perfusion was performed, first using 0.1 M phosphate-buffered saline (PBS; pH 7.4), followed by 10% formalin in 0.1 M PBS. Following an overnight immersion in 10% formalin at 4°C, the brain underwent a gradual saturation process with sucrose in 0.02% sodium azide (Sigma) in 0.1 M PBS. This process consisted of increasing sucrose concentration from 10% over five nights, then to 20% over four nights, and finally to 30% over ten nights. The brain was frozen with dry ice and sliced into 40-μm-thick coronal sections using a freezing sliding microtome. Sections were preserved in 0.02% sodium azide in 0.1 M PBS.
To visualize immunoreactive signals of hM4Di-AcGFP, sections underwent a series of steps. After rinsing 3 times for 5 min in 0.01 M PBS, sections were pre-treated with 3% H2O2 for 20 min. After 5 times rinse in 0.01 M PBS containing 0.2% Triton X-100, the sections were pre-treated in tyramide signal amplification (TSA) blocking reagent (NEL700A001KT, AKOYA Biosciences) in 0.01 M PBS for 30 min, and then incubated with rabbit anti-GFP antibody (1:1000; G10362; AB_2536526, Invitrogen) in the same fresh medium for 60 min at RT and subsequently overnight at 4°C. After 5 times rinse in PBS, the sections were incubated with anti-rabbit polymer HRP (GTX83399, GeneTex) solution for 1 h. After rinsing 5 times, the sections were treated in TSA biotin working solution (NEL700A001KT, Akoya Biosciences) in TSA blocking reagent for 15 min. The sections were rinsed 5 times and then incubated in TSA blocking reagent containing Streptavidin Alexa Fluor 488 (1:1000; S11223, Invitrogen) for 60 min. We rinsed the sections 3 times in 0.01 M PBS, mounted onto glass slides coated with gelatin, and air-dried. We then coverslipped the sections using ProLong Diamond Antifade Mountant with DAPI (P36971, Invitrogen). Immunoreactive signals in the regions of interest were observed and digitally imaged with a confocal fluorescence microscope (BZ-X800; Keyence Corp., IL, USA) fitted with 2×, 4×, 10×, and 40× objectives. The images were viewed and processed using BZ-X analyzer software (Keyence Corporation, IL, USA) and exported as TIFF images.

Quantification and statistical analysis

Experimental design and data analysis

Behavioral and electrophysiological experiments were replicated in two monkeys (MK#1 and MK#2) across multiple sessions. Because the experimenter performed real-time electrophysiological recordings and DCZ infusions, blinding during data collection was not feasible. However, behavioral and neuronal data were analyzed using automated scripts without reference to condition identity to minimize bias. Sample sizes were not predetermined by formal power analysis. Numbers of animals (two macaques), sessions, and recorded units were determined based on feasibility in nonhuman primate experiments. The within-subject, session-paired design increased statistical power, and all key effects were replicated across both animals and multiple sessions.

Fixation errors during the precue period

All statistical analyses were performed at the session level. The experiments followed a weekly paradigm in which each DCZ session was counterbalanced and flanked by control sessions conducted on the preceding and following days. Because task performance was relatively stable from day to day but exhibited slower fluctuations across weeks, each DCZ session was paired with the immediately preceding control session to minimize week-to-week drift (within-subject, matched task conditions).
Given the modest number of sessions and the possibility that the assumption of normality might not hold, both parametric and nonparametric tests were performed. Paired t-tests were used for within-subject comparisons between control and DCZ sessions, and two-sample t-tests were used for independent comparisons. In parallel, nonparametric Wilcoxon signed-rank tests were applied to confirm the robustness of these results, and Wilcoxon rank-sum tests were used for between-session summary statistics where appropriate. All tests were two-sided, and exact p-values are reported. Statistical significance was defined as P < 0.05.
To quantify motivational changes induced by VS–VP pathway manipulation, we analyzed fixation errors occurring during the precue period (precue-FEs; Figures 2B–2E, S2A, and S2B), which reflect failures to initiate goal-directed behavior. Because session lengths (total trial numbers) varied across control and DCZ sessions, the accumulated number of precue-FEs in each session was computed up to the minimum trial count of each control–DCZ pair to ensure matched session lengths. For each monkey, the total accumulated precue-FEs were then compared between control and DCZ sessions (Figures 2C and 2E, left).
To examine how the effect of DCZ evolved over time within a session, trial progression was normalized to a common 0–100% scale, where 0% and 100% corresponded to the first and last trials up to the pairwise minimum trial count. This truncation and normalization procedure aligned data across sessions and minimized potential influences of differences in session length or total trial number. The cumulative number of precue-FEs was then plotted as a function of normalized trial progression, and trajectories were compared between control and DCZ sessions (Figures 2C and 2E, right). See also Figures S2C and S2D for cue-FEs, response-FEs, and total number of FEs.

Trial-history effects on initiation

To determine whether behavioral initiation was influenced by previous trial history, we analyzed how initiation failures (i.e., precue-FEs) depended on the outcome and valuation of the preceding trial (lag-1). For each trial, the following predictors were extracted from the immediately previous trial: (i) previous error (binary: 1 = error, 0 = successful), (ii) previous reward magnitude, and (iii) previous aversive magnitude. Cues were pseudo-randomized, ensuring that these variables were statistically independent of the offer of the current trial.
Within each session, the probability of an initiation failure on trial t was modeled using a logistic regression including the three lag-1 predictors:
P(failuret)=11+e(β0+β1·prev_errort1+β2·prev_rewardt1+β3·prev_aversivet1).
Statistical significance for each coefficient was assessed with a two-sided Wald test (P < 0.05). To provide a model-free summary of the serial-error effect, we computed the difference in conditional probabilities between consecutive trials:
ΔP=P(failuret|prev=error)P(failuret|prev=complete),
which quantifies how often initiation failures tended to repeat.
Finally, repetitive errors (consecutive error-to-error sequences) were counted for each session and compared between control and DCZ conditions using Wilcoxon signed-rank tests. All analyses were conducted separately for the Ap–Av and Ap–Ap tasks, and results were pooled from the two monkeys for population summaries (Figure 3).

Reaction time analyses

Reaction time (RT) during the response period was defined as the latency from target onset to target selection. Each DCZ session was paired with a corresponding control session recorded within the same week and under the same task condition (Ap–Av or Ap–Ap).
For the choice-dependent RT analysis (Figures 4A and 4B), RTs were categorized according to the chosen target: in the Ap–Av task, selection of the cross (+) target indicated acceptance of the combined offer (approach, Ap), whereas selection of the square (□) target indicated rejection of the offer (avoidance, Av). In the Ap–Ap task, cross and square targets corresponded to the reward magnitudes indicated by the red and yellow bar lengths, respectively. For each session, mean RTs were calculated separately for each choice type. The effects of DCZ infusion were evaluated by comparing the mean RTs for each choice type between DCZ and control sessions using the Wilcoxon signed-rank test (two-sided). This procedure was conducted separately for the Ap–Av and Ap–Ap tasks to determine whether VS–VP pathway suppression differentially affected approach- and avoidance-related responding.
To examine how RT modulation depended on cue offers, RTs were visualized as two-dimensional offer matrices (Figures 4C and 4D). Matrix analyses were performed both for individual monkeys and on pooled data, to characterize how the pattern of RT modulation varied across cue offers. In the Ap–Av task, the x- and y-axes represented the offered reward amount and air-puff duration, respectively, whereas in the Ap–Ap task, they represented the two offered reward magnitudes associated with the cross and square targets. After pooling all trial-wise RT data across sessions, we computed local mean RTs on a two-dimensional offer grid by convolving the data with a two-dimensional square kernel (20% × 20% window). This procedure effectively smoothed the data and reduced sampling noise, providing a continuous map of RT modulation across all offered conditions. For each offer cell, two-sample t-tests were applied to compare pre- versus post-infusion trials. Regions showing significant differences (P < 0.05) were highlighted on the heatmaps. The decision boundary estimated from the behavioral choice model (see STAR Methods section choice model) was overlaid on the Ap–Av RT maps to relate RT modulation to cue combinations that favored Av choices (left of the boundary) or Ap choices (right of the boundary). The boundary itself was derived independently of the RT data. All analyses were implemented in MATLAB 2024b (MathWorks).

Pupil diameter analysis

Pupil diameter was recorded together with eye position using an infrared eye-tracking system (ViewPoint Eye Tracker; Monocular; 400 Hz; MCU400, Arrington Research). Analyses focused on the precue period, defined as the interval from the onset of precue fixation to cue presentation.
Preprocessing
For each session, raw pupil size data were segmented into individual trials. Within each trial, a fixed precue window was extracted, and a five-sample median filter was applied to suppress transient artifacts caused by blinks or noise. Missing samples were linearly interpolated, and traces were smoothed with a 100-ms moving average.
Statistical analysis
For each session, the injection trial index was identified, and the change from pre- to post-injection was quantified as an independent-samples t statistic by comparing trial means between pre-injection and post-injection blocks, excluding the earliest and latest trials to avoid unstable estimates. For paired sessions, t statistics were computed for both DCZ and the matched control sessions. Group-level comparisons were conducted using both parametric and nonparametric tests: unpaired comparisons (DCZ vs. control) used independent-samples t tests and Wilcoxon rank-sum tests, whereas paired comparisons used paired t tests and Wilcoxon signed-rank tests. Analyses were performed both within individual monkeys and across all sessions combined, with nonparametric tests considered primary (Figures S3A–S3E).

Quantifying changes in decisions

The change in Ap and Av choices between the pre- and post-trials after DCZ infusion was quantified using the following method (Figure 5A). Each decision was mapped onto a decision matrix and convolved with a 20 × 20 point square window. After convolution, each choice datum was stacked in each cell of the 100 × 100 point decision matrix. We calculated the t-statistics for each point using the stacked choice data and employed Fisher's exact test to determine statistically significant differences (P < 0.05 at each point) between the two blocks. The increase in Av choices between the pre- and post-trials was represented by the total area of points in the decision matrix showing a significant increase in Av (%ΔAv). Similarly, the increase in Ap choices was represented by the total area of points exhibiting a significant increase in Ap (%ΔAp).
To characterize the choice pattern, we performed econometric modeling (Figure 5C). Specifically, the probability of choosing the cross target (pAP) was calculated using the logistic function pAP = 1 / (1 + exp(–f(x, y))). The function was parameterized as f(x, y) = ax + by + c, where x and y represent the lengths of the red and yellow bars, respectively. The coefficients a, b, and c were determined through generalized linear regression fitted to the behavioral choices in the pre-trials. We performed a normalization procedure on the choice data in both the pre- and post-trials. For the dataset that produced the decision boundary ax + by + c = 0, we applied the procedure described previously36,50 to align it with the standard decision boundary a0x + b0y + c0 = 0. The standard decision boundary was set through the points (20, 0) and (50, 100). For data to the left of the decision boundary, x was resized as x' = x Xa / Xb, where Xa = –(c + by) / a and Xb = –(c0 + b0y) / a0. For data to the right of the decision boundary, x was resized as x' = 100 – (100 – x) X"a / X"b, where X"a = 100 + (c + by) / a and X"b = 100 + (c0 + b0y) / a0. This procedure ensured that the decision boundary derived from the transformed data corresponded to the standard decision boundary. We applied this normalization procedure to the choice data in the pre- and post-trials independently. After normalization, session data were pooled per monkey, mapped onto the decision matrix, and convolved with a 20 × 20 point square. We conducted a t-test at each time point to evaluate the differences between pre- and post-trials. The resulting t-values reflect the magnitude of change in Ap and Av choices (Figure 5D).

Neuronal activity during the precue period

To assess neuronal correlates of motivational initiation, analyses of VS and VP activity focused on the precue period, during which monkeys were required to maintain fixation on the central precue until cue presentation (Figures 6A and S5). Each recording session consisted of alternating blocks of Ap–Ap and Ap–Av tasks, allowing the same neuron to be recorded over multiple unsignaled task transitions. Only stably isolated units from sessions containing at least two unsignaled task transitions were included, enabling within-cell adaptation analysis.
For each neuron, the mean firing rates during the Ap–Av and Ap–Ap tasks were compared. Differences were expressed as t-values and plotted as histograms to visualize population distributions (Figure 6B). The population mean of t-values was tested against zero to determine whether overall activity showed systematic task-related bias. For classification, the same trial sets used for population analyses were compared, and the task preference of each neuron was expressed as the t-value of the firing rate difference (Ap–Av – Ap–Ap). Neurons with significantly higher firing during Ap–Av trials were defined as “Ap–Av type,” and those with higher firing during Ap–Ap trials as “Ap–Ap type” (t-test, P < 0.05; Figure S6). Neuron counts and recording locations for significantly biased populations are shown in Figures S4C and S4D. See Figures S7A–S7H for firing activity of these task-specific biased populations across variables: trial number (low, 0–50%; high, 50–100%), reward magnitude (low, 0–50%; high, 50–100%), air-puff magnitude (low, 0–50%; high, 50–100%), and choice target.
For each task-specific group, trial-by-trial firing rates were ratio-normalized by dividing activity by the mean firing rate of that neuron across all trials, allowing comparison across neurons with different baseline activity levels. Normalized precue activities were computed for each of the four alternating task blocks and visualized as box plots showing the median, interquartile range, and outliers beyond 1.5 × IQR (Figures 6C and 6E).
To characterize adaptation dynamics during block transitions, trial-wise mean firing rates were computed for each neuron (Figures 6D and 6F). For each post-transition trial, firing rates were compared with the mean of the 30 pre-transition trials (t-test, P < 0.05). Trials showing significant differences were marked individually (black circles), and consecutive significant trials were clustered (two consecutive, pink; ≥3 consecutive, pink-shaded intervals).

History-dependent neuronal activity

To examine how neural activity reflected previous-trial outcomes, we analyzed correlations between each neuron’s firing rate during the precue period and the outcome of the immediately preceding trial (0 = successful, 1 = error; Figures 7 and S7I). Errors included those occurring during the precue, cue, and response periods; similar results were obtained when the analysis was restricted to precue fixation errors. For each neuron, the mean firing rate within the precue period (from fixation onset to cue onset) was computed across all trials and correlated with the binary outcome variable using Pearson’s correlation coefficient (r). Statistical significance was determined with a two-sided test (P < 0.05). The distribution of r values across neurons was tested against zero using a Wilcoxon signed-rank test.
For neurons showing significant correlations, time-resolved firing activity was further analyzed by aligning spike trains to cue onset and computing normalized firing rates in sliding 100 ms bins (50 ms step). Firing rates at each time bin were normalized by dividing by the mean firing rate during the cue period. This normalization enabled comparison of relative modulations across neurons with different baseline activity levels and facilitated the detection of sustained effects extending into the cue period. Population averages were then computed across neurons within each condition, separately for trials following error and compete outcomes. Periods showing significant differences the two conditions were identified using two-sided t-tests (P < 0.05) and are indicated by shaded yellow bands in Figure 7.

Supplemental information (2)

Document S1. Figures S1–S7 and Tables S1 and S2
Document S2. Article plus supplemental information

References

1.
Seymour, B. ∙ Singer, T. ∙ Dolan, R.
The neurobiology of punishment
Nat. Rev. Neurosci. 2007; 8:300-311
2.
Adams, C.D. ∙ Dickinson, A.
Instrumental responding following reinforcer devaluation
Q. J. Exp. Psychol. B. 1981; 33:109-121
3.
Colwill, R.M. ∙ Rescorla, R.A.
Postconditioning devaluation of a reinforcer affects instrumental responding
J. Exp. Psychol.: Anim. Behav. Processes. 1985; 11:120-132
4.
Dickinson, A. ∙ Balleine, B.
Motivational control of goal-directed action
Anim. Learn. Behav. 1994; 22:1-18
5.
Yin, H.H. ∙ Knowlton, B.J.
The role of the basal ganglia in habit formation
Nat. Rev. Neurosci. 2006; 7:464-476
6.
Meyniel, F. ∙ Goodwin, G.M. ∙ Deakin, J.W. ...
A specific role for serotonin in overcoming effort cost
eLife. 2016; 5, e17282
7.
Pessiglione, M. ∙ Vinckier, F. ∙ Bouret, S. ...
Why not try harder? Computational approach to motivation deficits in neuro-psychiatric diseases
Brain. 2018; 141:629-650
8.
Meyniel, F. ∙ Safra, L. ∙ Pessiglione, M.
How the brain decides when to work and when to rest: dissociation of implicit-reactive from explicit-predictive computational processes
PLoS Comput. Biol. 2014; 10, e1003584
9.
Allen, T.A. ∙ Lam, R.W. ∙ Milev, R. ...
Early change in reward and punishment sensitivity as a predictor of response to antidepressant treatment for major depressive disorder: a CAN-BIND-1 report
Psychol. Med. 2019; 49:1629-1638
10.
Eshel, N. ∙ Roiser, J.P.
Reward and punishment processing in depression
Biol. Psychiatry. 2010; 68:118-124
11.
Bylsma, L.M. ∙ Morris, B.H. ∙ Rottenberg, J.
A meta-analysis of emotional reactivity in major depressive disorder
Clin. Psychol. Rev. 2008; 28:676-691
12.
Barch, D.M. ∙ Dowd, E.C.
Goal representations and motivational drive in schizophrenia: the role of prefrontal–striatal interactions
Schizophr. Bull. 2010; 36:919-934
13.
Hollerman, J.R. ∙ Tremblay, L. ∙ Schultz, W.
Influence of reward expectation on behavior-related neuronal activity in primate striatum
J. Neurophysiol. 1998; 80:947-963
14.
Kelley, A.E.
Ventral striatal control of appetitive motivation: role in ingestive behavior and reward-related learning
Neurosci. Biobehav. Rev. 2004; 27:765-776
15.
Natsubori, A. ∙ Tsutsui-Kimura, I. ∙ Nishida, H. ...
Ventrolateral striatal medium spiny neurons positively regulate food-incentive, goal-directed behavior independently of D1 and D2 selectivity
J. Neurosci. 2017; 37:2723-2733
16.
Pizzagalli, D.A. ∙ Holmes, A.J. ∙ Dillon, D.G. ...
Reduced caudate and nucleus accumbens response to rewards in unmedicated individuals with major depressive disorder
Am. J. Psychiatry. 2009; 166:702-710
17.
Ironside, M. ∙ Amemori, K.I. ∙ McGrath, C.L. ...
Approach-avoidance conflict in major depressive disorder: congruent neural findings in humans and nonhuman primates
Biol. Psychiatry. 2020; 87:399-408
18.
Malone, D.A. ∙ Dougherty, D.D. ∙ Rezai, A.R. ...
Deep brain stimulation of the ventral capsule/ventral striatum for treatment-resistant depression
Biol. Psychiatry. 2009; 65:267-275
19.
Smith, K.S. ∙ Berridge, K.C.
The ventral pallidum and hedonic reward: neurochemical maps of sucrose “liking” and food intake
J. Neurosci. 2005; 25:8637-8649
20.
Tachibana, Y. ∙ Hikosaka, O.
The primate ventral pallidum encodes expected reward value and regulates motor action
Neuron. 2012; 76:826-837
21.
Root, D.H. ∙ Melendez, R.I. ∙ Zaborszky, L. ...
The ventral pallidum: Subregion-specific functional anatomy and roles in motivated behaviors
Prog. Neurobiol. 2015; 130:29-70
22.
Fujimoto, A. ∙ Hori, Y. ∙ Nagai, Y. ...
Signaling incentive and drive in the primate ventral pallidum for motivational control of goal-directed action
J. Neurosci. 2019; 39:1793-1804
23.
Jensen, J. ∙ McIntosh, A.R. ∙ Crawley, A.P. ...
Direct activation of the ventral striatum in anticipation of aversive stimuli
Neuron. 2003; 40:1251-1257
24.
Wulff, A.B. ∙ Tooley, J. ∙ Marconi, L.J. ...
Ventral pallidal modulation of aversion processing
Brain Res. 2019; 1713:62-69
25.
Reynolds, S.M. ∙ Berridge, K.C.
Positive and negative motivation in nucleus accumbens shell: bivalent rostrocaudal gradients for GABA-elicited eating, taste “liking”/“disliking” reactions, place preference/avoidance, and fear
J. Neurosci. 2002; 22:7308-7320
26.
Cromwell, H.C. ∙ Berridge, K.C.
Where does damage lead to enhanced food aversion: the ventral pallidum/substantia innominata or lateral hypothalamus?
Brain Res. 1993; 624:1-10
27.
Saga, Y. ∙ Richard, A. ∙ Sgambato-Faure, V. ...
Ventral pallidum encodes contextual information and controls aversive behaviors
Cereb. Cortex. 2017; 27:2528-2543
28.
Schouppe, N. ∙ Demanet, J. ∙ Boehler, C.N. ...
The role of the striatum in effort-based decision-making in the absence of reward
J. Neurosci. 2014; 34:2148-2154
29.
Hauber, W. ∙ Sommer, S.
Prefrontostriatal circuitry regulates effort-related decision making
Cereb. Cortex. 2009; 19:2240-2247
30.
Farrar, A.M. ∙ Font, L. ∙ Pereira, M. ...
Forebrain circuitry involved in effort-related choice: Injections of the GABAA agonist muscimol into ventral pallidum alter response allocation in food-seeking behavior
Neuroscience. 2008; 152:321-330
31.
Salamone, J.D. ∙ Correa, M.
The neurobiology of activational aspects of motivation: exertion of effort, effort-based decision making, and the role of dopamine
Annu. Rev. Psychol. 2024; 75:1-32
32.
Haber, S.N. ∙ Lynd, E. ∙ Klein, C. ...
Topographic organization of the ventral striatal efferent projections in the rhesus monkey: an anterograde tracing study
J. Comp. Neurol. 1990; 293:282-298
33.
Grabli, D. ∙ McCairn, K. ∙ Hirsch, E.C. ...
Behavioural disorders induced by external globus pallidus dysfunction in primates: I. Behavioural study
Brain. 2004; 127:2039-2054
34.
Worbe, Y. ∙ Baup, N. ∙ Grabli, D. ...
Behavioral and movement disorders induced by local inhibitory dysfunction in primate striatum
Cereb. Cortex. 2009; 19:1844-1856
35.
Roth, B.L.
DREADDs for neuroscientists
Neuron. 2016; 89:683-694
36.
Amemori, K. ∙ Graybiel, A.M.
Localized microstimulation of primate pregenual cingulate cortex induces negative decision-making
Nat. Neurosci. 2012; 15:776-785
37.
Amemori, K. ∙ Amemori, S. ∙ Graybiel, A.M.
Motivation and affective judgments differentially recruit neurons in the primate dorsolateral prefrontal and anterior cingulate cortex
J. Neurosci. 2015; 35:1939-1953
38.
Kimura, K. ∙ Nagai, Y. ∙ Hatanaka, G. ...
A mosaic adeno-associated virus vector as a versatile tool that exhibits high levels of transgene expression and neuron specificity in primate brain
Nat. Commun. 2023; 14, 4762
39.
Mayford, M. ∙ Bach, M.E. ∙ Huang, Y.Y. ...
Control of memory formation through regulated expression of a CaMKII transgene
Science. 1996; 274:1678-1683
40.
Haber, S.N. ∙ Mcfarland, N.R.
The concept of the ventral striatum in nonhuman primates
Ann. N. Y. Acad. Sci. 1999; 877:33-48
41.
Giménez-Amaya, J.M. ∙ McFarland, N.R. ∙ De Las Heras, S. ...
Organization of thalamic projections to the ventral striatum in the primate
J. Comp. Neurol. 1995; 354:127-149
42.
Heimer, L.
The subcortical projections of allocortex: similarities in the neuronal associations of the hippocampus, the piriform cortex and the neocortex
Raven. 1975; 173-193
43.
Nagai, Y. ∙ Miyakawa, N. ∙ Takuwa, H. ...
Deschloroclozapine, a potent and selective chemogenetic actuator enables rapid neuronal and behavioral modulations in mice and monkeys
Nat. Neurosci. 2020; 23:1157-1167
44.
Heimer, L.
A new anatomical framework for neuropsychiatric disorders and drug abuse
Am. J. Psychiatry. 2003; 160:1726-1739
45.
Basar, K. ∙ Sesia, T. ∙ Groenewegen, H. ...
Nucleus accumbens and impulsivity
Prog. Neurobiol. 2010; 92:533-557
46.
Bradley, M.M. ∙ Miccoli, L. ∙ Escrig, M.A. ...
The pupil as a measure of emotional arousal and autonomic activation
Psychophysiology. 2008; 45:602-607
47.
Lang, P.J. ∙ Bradley, M.M. ∙ Cuthbert, B.N.
Emotion, motivation, and anxiety: brain mechanisms and psychophysiology
Biol. Psychiatry. 1998; 44:1248-1263
48.
Dickinson, A. ∙ Pearce, J.M.
Inhibitory interactions between appetitive and aversive stimuli
Psychol. Bull. 1977; 84:690-711
49.
Amemori, S. ∙ Graybiel, A.M. ∙ Amemori, K.-I.
Cingulate microstimulation induces negative decision-making via reduced top-down influence on primate fronto-cingulo-striatal network
Nat. Commun. 2024; 15, 4201
50.
Amemori, K.I. ∙ Amemori, S. ∙ Gibson, D.J. ...
Striatal microstimulation induces persistent and repetitive negative decision-making predicted by striatal beta-band oscillation
Neuron. 2018; 99:829-841.e6
51.
Záborszky, L. ∙ Cullinan, W.E.
Projections from the nucleus accumbens to cholinergic neurons of the ventral pallidum: a correlated light and electron microscopic double-immunolabeling study in rat
Brain Res. 1992; 570:92-101
52.
Price, J.L.
Limbic system
Ramachandran, V.S. (Editor)
Encyclopedia of the Human Brain
Academic Press, 2002; 695-702
53.
Vander Weele, C.M.V. ∙ Siciliano, C.A. ∙ Tye, K.M.
Dopamine tunes prefrontal outputs to orchestrate aversive processing
Brain Res. 2019; 1713:16-31
54.
Chrobak, J.J. ∙ Napier, T.C.
Opioid and GABA modulation of accumbens-evoked ventral pallidal activity
J. Neural Transm. Gen. Sect. 1993; 93:123-143
55.
Smith, Y. ∙ Shink, E. ∙ Sidibé, M.
Neuronal circuitry and synaptic connectivity of the basal ganglia
Neurosurg. Clin. N. Am. 1998; 9:203-222
56.
Gallo, E.F. ∙ Meszaros, J. ∙ Sherman, J.D. ...
Accumbens dopamine D2 receptors increase motivation by decreasing inhibitory transmission to the ventral pallidum
Nat. Commun. 2018; 9, 1086
57.
Soares-Cunha, C. ∙ Heinsbroek, J.A.
Ventral pallidal regulation of motivated behaviors and reinforcement
Front. Neural Circuits. 2023; 17, 1086053
58.
Elliot, A.J.
Handbook of Approach and Avoidance Motivation
Psychology Press, 2013
59.
Friedman, A. ∙ Homma, D. ∙ Gibb, L.G. ...
A corticostriatal path targeting striosomes controls decision-making under conflict
Cell. 2015; 161:1320-1333
60.
Bryant, K.G. ∙ Barker, J.M.
Arbitration of approach-avoidance conflict by ventral hippocampus
Front. Neurosci. 2020; 14, 615337
61.
Yang, L. ∙ Fang, L.Z. ∙ Lynch, M.R. ...
Transcriptomic landscape of mammalian ventral pallidum at single-cell resolution
Sci. Adv. 2024; 10, eadq6017
62.
Stephenson-Jones, M. ∙ Bravo-Rivera, C. ∙ Ahrens, S. ...
Opposing contributions of GABAergic and glutamatergic ventral pallidal neurons to motivational behaviors
Neuron. 2020; 105:921-933.e5
63.
Iwaoki, H. ∙ Hori, Y. ∙ Hori, Y. ...
Functional heterogeneity within the primate ventral striatum for motivational regulation
J. Neurosci. 2025; 45, e2430242025
64.
Sgambato-Faure, V. ∙ Worbe, Y. ∙ Epinat, J. ...
Cortico-basal ganglia circuits involved in different motivation disorders in non-human primates
Brain Struct. Funct. 2016; 221:345-364
65.
Berridge, K.C. ∙ Robinson, T.E. ∙ Aldridge, J.W.
Dissecting components of reward:‘liking’,‘wanting’, and learning
Curr. Opin. Pharmacol. 2009; 9:65-73
66.
Kang, J. ∙ Kim, H. ∙ Hwang, S.H. ...
Primate ventral striatum maintains neural representations of the value of previously rewarded objects for habitual seeking
Nat. Commun. 2021; 12, 2100
67.
Castro, D.C. ∙ Cole, S.L. ∙ Berridge, K.C.
Lateral hypothalamus, nucleus accumbens, and ventral pallidum roles in eating and hunger: interactions between homeostatic and reward circuitry
Front. Syst. Neurosci. 2015; 9, 90
68.
Berridge, K.C.
‘Liking’ and ‘wanting’ food rewards: Brain substrates and roles in eating disorders
Physiol. Behav. 2009; 97:537-550
69.
Krynicki, C.R. ∙ Upthegrove, R. ∙ Deakin, J.F.W. ...
The relationship between negative symptoms and depression in schizophrenia: a systematic review
Acta Psychiatr. Scand. 2018; 137:380-390
70.
Trémeau, F. ∙ Nolan, K.A. ∙ Malaspina, D. ...
Behavioral validation of avolition in schizophrenia
Schizophr. Res. 2012; 138:255-261
71.
Foussias, G. ∙ Remington, G.
Negative symptoms in schizophrenia: avolition and Occam's razor
Schizophr. Bull. 2010; 36:359-369
72.
Knowland, D. ∙ Lilascharoen, V. ∙ Pacia, C.P. ...
Distinct ventral pallidal neural populations mediate separate symptoms of depression
Cell. 2017; 170:284-297.e18
73.
Smith, K.S. ∙ Tindell, A.J. ∙ Aldridge, J.W. ...
Ventral pallidum roles in reward and motivation
Behav. Brain Res. 2009; 196:155-167
74.
Richard, J.M. ∙ Ambroggi, F. ∙ Janak, P.H. ...
Ventral pallidum neurons encode incentive value and promote cue-elicited instrumental actions
Neuron. 2016; 90:1165-1173
75.
Carvalho Poyraz, F.C. ∙ Holzner, E. ∙ Bailey, M.R. ...
Decreasing striatopallidal pathway function enhances motivation by energizing the initiation of goal-directed action
J. Neurosci. 2016; 36:5988-6001
76.
Moaddab, M. ∙ Ray, M.H. ∙ McDannald, M.A.
Ventral pallidum neurons dynamically signal relative threat
Commun. Biol. 2021; 4, 43
77.
Khan, H.A. ∙ Urstadt, K.R. ∙ Mostovoi, N.A. ...
Mapping excessive “disgust” in the brain: Ventral pallidum inactivation recruits distributed circuitry to make sweetness “disgusting”
Cogn. Affect. Behav. Neurosci. 2020; 20:141-159
78.
Reynolds, S.M. ∙ Berridge, K.C.
Emotional environments retune the valence of appetitive versus fearful functions in nucleus accumbens
Nat. Neurosci. 2008; 11:423-425
79.
Soares-Cunha, C. ∙ Coimbra, B. ∙ Sousa, N. ...
Reappraising striatal D1- and D2-neurons in reward and aversion
Neurosci. Biobehav. Rev. 2016; 68:370-386
80.
Aizman, O. ∙ Brismar, H. ∙ Uhlén, P. ...
Anatomical and physiological evidence for D1 and D2 dopamine receptor colocalization in neostriatal neurons
Nat. Neurosci. 2000; 3:226-230
81.
Bonnavion, P. ∙ Varin, C. ∙ Fakhfouri, G. ...
Striatal projection neurons coexpressing dopamine D1 and D2 receptors modulate the motor function of D1-and D2-SPNs
Nat. Neurosci. 2024; 27:1783-1793
82.
Soares-Cunha, C. ∙ Domingues, A.V. ∙ Correia, R. ...
Distinct role of nucleus accumbens D2-MSN projections to ventral pallidum in different phases of motivated behavior
Cell Rep. 2022; 38, 110380
83.
Kupchik, Y.M. ∙ Brown, R.M. ∙ Heinsbroek, J.A. ...
Coding the direct/indirect pathways by D1 and D2 receptors is not valid for accumbens projections
Nat. Neurosci. 2015; 18:1230-1232
84.
Lu, X.Y. ∙ Ghasemzadeh, M.B. ∙ Kalivas, P.W.
Expression of D1 receptor, D2 receptor, substance P and enkephalin messenger RNAs in the neurons projecting from the nucleus accumbens
Neuroscience. 1998; 82:767-780
85.
Soares-Cunha, C. ∙ Coimbra, B. ∙ David-Pereira, A. ...
Activation of D2 dopamine receptor-expressing neurons in the nucleus accumbens increases motivation
Nat. Commun. 2016; 7, 11829
86.
Al-Hasani, R. ∙ McCall, J.G. ∙ Shin, G. ...
Distinct subpopulations of nucleus accumbens dynorphin neurons drive aversion and reward
Neuron. 2015; 87:1063-1077
87.
He, J. ∙ Kleyman, M. ∙ Chen, J. ...
Transcriptional and anatomical diversity of medium spiny neurons in the primate striatum
Curr. Biol. 2021; 31:5473-5486.e6
88.
Izpisua Belmonte, J.C.I. ∙ Callaway, E.M. ∙ Caddick, S.J. ...
Brains, genes, and primates
Neuron. 2015; 86:617-631
89.
Parent, A. ∙ Hazrati, L.N.
Functional anatomy of the basal ganglia. I. The cortico-basal ganglia-thalamo-cortical loop
Brain Res. Brain Res. Rev. 1995; 20:91-127
90.
Li, S.H. ∙ Graham, B.M.
Why are women so vulnerable to anxiety, trauma-related and stress-related disorders? The potential role of sex hormones
Lancet Psychiatry. 2017; 4:73-82
91.
Parker, G. ∙ Brotchie, H.
Gender differences in depression
Int. Rev. Psychiatry. 2010; 22:429-436
92.
Nagai, Y. ∙ Kikuchi, E. ∙ Lerchner, W. ...
PET imaging-guided chemogenetic silencing reveals a critical role of primate rostromedial caudate in reward evaluation
Nat. Commun. 2016; 7, 13605

Article metrics