Delegating Destruction: Coercive Threats and Automated Nuclear Systems

Joshua A. Schwartz; Michael C. Horowitz

doi:10.1017/S0020818325101215

Delegating Destruction: Coercive Threats and Automated Nuclear Systems

Published online by Cambridge University Press: 30 January 2026

Joshua A. Schwartz

and

Michael C. Horowitz

Show author details

Joshua A. Schwartz*: Affiliation:
Carnegie Mellon Institute for Strategy & Technology, Carnegie Mellon University , Pittsburgh, PA, USA
Michael C. Horowitz: Affiliation:
Political Science Department, University of Pennsylvania, Philadelphia, USA
*: Corresponding author: Joshua A. Schwartz; Email: joshschwartz@cmu.edu

Article contents

Abstract
Debates in the Literature
Theory: Tying Hands via Automation
Data and Methods
Results
Conclusion
Data Availability Statement
Supplementary Material
Funding
Footnotes
References

Abstract

Are nuclear weapons useful for coercion, and, if so, what factors increase the credibility and effectiveness of nuclear threats? While prominent scholars like Thomas Schelling argue that nuclear brinkmanship, or the manipulation of nuclear risk, can effectively coerce adversaries, others contend nuclear weapons are not effective tools of coercion, especially when designed to achieve offensive and revisionist objectives. Simultaneously, there is broad debate about the incorporation of automation via artificial intelligence into military systems, especially nuclear command and control. We develop a theoretical argument that nuclear threats implemented with automated nuclear launch systems are more credible compared to those implemented via non-automated means. By reducing human control over nuclear use, leaders can more effectively tie their hands and thus signal resolve, even if doing so increases the risk of nuclear war and thus is extremely dangerous. Preregistered survey experiments on an elite sample of United Kingdom Members of Parliament and two public samples of UK citizens provide support for these expectations, showing that in a crisis scenario involving a Russian invasion of Estonia, automated nuclear threats can increase credibility and willingness to back down. From a policy perspective, this paper highlights the dangers of countries adopting automated nuclear systems for malign purposes, and contributes to the literatures on coercive bargaining, weapons of mass destruction, and emerging technology.

Keywords

Nuclear automation automated nuclear launch systems nuclear coercion nuclear blackmail nuclear brinkmanship autonomous weapons artificial intelligence

Information

Type: Research Note
Information: International Organization , First View , pp. 1 - 26

DOI: https://doi.org/10.1017/S0020818325101215 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright: © The Author(s), 2026. Published by Cambridge University Press on behalf of The IO Foundation

The “Doomsday Machine”: A nuclear arsenal programmed to automatically explode should the Soviet Union come under nuclear attack. “Skynet”: An artificial general superintelligence system that becomes self-aware and launches a series of nuclear attacks against humans to prevent it from being shut down. “War Operation Plan Response (WOPR)”: A supercomputer that is given access to the US nuclear arsenal, programmed to run continuous war games, and comes to believe a simulation of global thermonuclear war is real. These systems appear in the movies Dr. Strangelove, The Terminator, and WarGames. However, the idea of automating nuclear weapons use—that is, creating a computer system that, once turned on, could launch nuclear weapons on its own without further human interventionFootnote ² —is not just a Hollywood plot device.

The Soviet Union actually developed a semi-autonomous system called “Dead Hand” or “Perimeter” during the Cold War, which may still be active in some form today.Footnote ³ Once Dead Hand was activated, it monitored for evidence of a nuclear attack against the Soviet Union using a network of radiation, seismic, and air pressure sensors. Upon detecting that a nuclear weapon had been exploded on Soviet territory, it would check to see if there was an active communications link to the Soviet leadership. If the communications link was inactive, Dead Hand would assume Soviet leadership had been killed and transfer nuclear launch authority to a lower-level official operating the system in a protected bunker. Today, technology related to automating the use of lethal force has moved significantly beyond what was possible in the Cold War due to advances in computing and artificial intelligence.Footnote ⁴ Some scholars have even advocated for the United States to develop its own Dead Hand system.Footnote ⁵

We assess the impact of automation in the nuclear realm by focusing on nuclear coercion. That is, the threatened use of nuclear weapons to deter actors from changing the status quo or compelling them to alter it. Actually using nuclear weapons against another nuclear-armed state with a secure second-strike capability would be irrational since it comes with a very high risk of retaliation and complete destruction. On the other hand, threatening to use nuclear weapons for the purpose of coercing your adversary and taking active steps to increase the risks of nuclear use could be rational because it is a strategy that enables a state to potentially achieve their objectives without having to actually resort to nuclear war.Footnote ⁶

This research question intersects with two foundational debates in political science. First, are nuclear weapons useful for coercion at all, and, if so, what factors make the threatened use of nuclear weapons more credible? Although nuclear weapons do appear to be effective at deterring major nuclear attacks against a country’s homeland due to the likelihood of retaliation and mutually assured destruction (MAD), scholars vigorously disagree about their coercive utility beyond these fundamentally defensive scenarios. While some argue that nuclear weapons can be effective tools of coercion,Footnote ⁷ others contend that “despite their extraordinary power, nuclear weapons are uniquely poor instruments of compellence.”Footnote ⁸ The second debate our project overlaps with is whether emerging technologies—such as autonomous systems powered by artificial intelligence (AI)—aid or detract from strategic stability.Footnote ⁹ The effects of automation on the use of nuclear weapons for coercive purposes, however, remain mostly unexplored.

We contribute to these debates by theorizing that greater automation in the nuclear realm—while incredibly dangerous—can enhance the credibility of nuclear threats and better enable states to win games of nuclear brinkmanship. The logic of our argument is simple. Unless it is integrated into their programming, computers do not care that actually using nuclear weapons in a particular situation would be irrational or enormously costly. Consequently, by developing and activating an automated nuclear weapons system, leaders can more credibly threaten to use nuclear weapons compared to when it is a human that retains complete control over the process.Footnote ¹⁰ Like other strategies, such as public threats that put a leader’s reputation on the line and launch-on-warning policies, automated nuclear weapons systems enable leaders to more effectively “tie their hands,” increase nuclear risks, and throw the proverbial steering wheel out of the car in a game of chicken.Footnote ¹¹

We test our expectations with preregistered experiments on an elite sample of UK Members of Parliament (MPs) and two representative samples of the UK public. The experimental scenario involves a future Russian invasion of Estonia. While nuclear powers like the United States, China, the United Kingdom, and France have made some level of public commitment to maintain human control over nuclear weapons, Russia has not made this kind of pledge.Footnote ¹² All three studies yield at least some evidence—albeit somewhat inconsistent evidence—that Russian nuclear threats implemented via automation are more credible and/or effective than nuclear threats implemented via non-automated means. Troublingly, these dynamics suggest that nuclear-armed states may have incentives to automate elements of nuclear command and control to make their threats more credible.

This study makes several key contributions. First, the results move the debate forward about whether and under what conditions nuclear weapons have coercive utility. Our findings show that automated nuclear weapons launch systems can indeed provide states leverage in international crises. Second, while automated nuclear systems may have the theoretical potential to be stabilizing, our project suggests they can also be effectively used in highly destabilizing and malign ways, such as offensive and revisionist coercive efforts (for example, a Russian invasion of Estonia). Third, given the lack of real-world historical data on the use of automated systems in nuclear coercion, our experimental approach provides unique insights. Fourth, this paper speaks to broader debates about how technological advances will shape warfare, given that most military applications will occur outside the nuclear realm. If automated threats generate credibility in the nuclear realm, it may suggest something about conventional deterrence and coercion as well, which is a potential avenue for future research.

Although we find that automating nuclear use can potentially be useful for coercion, this does not at all suggest that states should adopt these systems given the ways automation could increase the risk of nuclear use and the significant ethical questions associated with reducing human control over weapons of mass destruction. However, states should prepare for the possibility that their adversaries might consider automating elements of nuclear command and control if they believe it will increase their coercive leverage.

Debates in the Literature

The Debate over Nuclear Weapons and Coercion

Sun Tzu famously said, “For to win one hundred victories in one hundred battles is not the acme of skill. To subdue the enemy without fighting is the acme of skill.” The logic of this argument is that actually fighting costs blood and treasure, whereas achieving your goals without using military force preserves lives, money, and other resources like land and weapons. One way to achieve foreign policy objectives without fighting is to utilize coercion, which involves the threat to do something—such as use military force—in the future. Coercion can either be utilized to (1) deter, which involves preventing a target from taking an action, or to (2) compel, which entails convincing a target to take an action. The latter type of coercion is generally accepted as more difficult to achieve because, in that case, it will be obvious a target “gave in” to the threat, which causes the kind of embarrassment leaders typically wish to avoid. Compellence is also more likely to fail because psychological biases tend to make those attempting to maintain the status quo more resolved than those trying to change it, and giving in to compellent threats can weaken a target’s material power relative to the status quo in ways that may make resisting future aggression more challenging.Footnote ¹³ Threats can also involve either punishment (for example, “if you attack my country, then I will hurt your country”) or denial (for example, “you can try to attack my country, but I will prevent you from successfully doing so”).

Although there are many factors that impact the efficacy of coercive efforts, a critical one is a state’s military capabilities. For example, the greater a state’s capacity to engage in deadly violence, the more pain they can realistically threaten. Nuclear weapons, then, would ostensibly confer significant coercive leverage given their immense destructive capabilities. As Pape said, “Nuclear weapons can almost always inflict more pain than any victim can withstand.”Footnote ¹⁴ However, for coercion to achieve its desired goal (that is, for it to be effective) the target must believe that if they violate your threat, then they will face the promised punishment or be directly denied from achieving their objective (that is, your threat must be credible). Or, at least, the target must believe there is an unacceptably high risk of the punishment being imposed or a denial effort working. The question of whether nuclear threats are credible has led to two fundamental debates among scholars that are germane to this project. First, are nuclear weapons useful at all for coercion, especially coercion that does not involve deterrence and self-defense? Second, if nuclear weapons can aid in coercion, then what factors make the threatened use of nuclear weapons more credible?

With respect to the first question, there are two schools of thought that can broadly be labeled as nuclear coercion skeptics and nuclear coercion believers. Skeptics question whether the threat of nuclear use by State A against State B is credible, except for direct self-defense against a potentially existential attack. When facing a nuclear-armed opponent with a secure second-strike capability, using nuclear weapons may simply be irrational because it has a high chance of leading to devastating nuclear retaliation.Footnote ¹⁵ In accordance with this logic, some studies find that nuclear-armed states are less likely to go to warFootnote ¹⁶ and crises that involve nuclear-armed states are more likely to end without violence.Footnote ¹⁷ The fact that the Cold War between the United States and Soviet Union stayed cold and never escalated into World War III is also frequently cited as evidence of the pacifying effects of nuclear weapons.Footnote ¹⁸

Even against non-nuclear states that cannot threaten nuclear retaliation, nuclear threats may also lack credibility because the political, economic, and social costs of carrying out the threat would be extremely high.Footnote ¹⁹ Some scholars argue there is a global nuclear taboo, which is “a de facto prohibition against the use of nuclear weapons … [that] involves expectations of awful consequences or sanctions to follow in the wake of a taboo violation.”Footnote ²⁰ This logic may help explain why countries have not used nuclear weapons even when they possess an asymmetric nuclear edge (for example, the United States’ war in Vietnam). More systematically, using statistical analysis and case studies, Sechser and Fuhrmann find little evidence that nuclear weapons are useful for more than self-defense. The reason, they conclude, “is that it is exceedingly difficult to make coercive nuclear threats believable. Nuclear blackmail does not work because threats to launch nuclear attacks for offensive political purposes fundamentally lack credibility.”Footnote ²¹

On the other hand, nuclear coercion believers contend that nuclear threats can be made credible, even outside the context of deterrence and self-defense. One key argument that purportedly helps solve the problem of credibility is Schelling’s theory of nuclear brinkmanship, or the “threat that leaves something to chance.”Footnote ²² Although actually using nuclear weapons against a nuclear-armed state with a second-strike capability is illogical due to the dynamics of MAD, states may be able to take certain actions to increase the risk of nuclear use and thus convince adversaries that nuclear threats could end up being acted upon.

This paper directly contributes to this first debate by theorizing and empirically testing whether nuclear weapons can be useful for coercion in a case that involves clear offensive objectives rather than self-defense. It contributes to the second debate as well about the factors that make the threatened use of nuclear weapons more credible. One option is for states to delegate the authority to use nuclear weapons to lower-level military commanders (for example, Dead Hand). Doing so raises the risk that “rogue military officers could take matters into their own hands and release nuclear weapons.”Footnote ²³ In other words, giving the authority to launch nuclear weapons over to an entity that is less likely to be dissuaded by the risks of MAD can make nuclear threats more credible.Footnote ²⁴ This policy option is closely related to this project, which examines the impact of delegating nuclear launch authority to a machine rather than another human.

There are several other factors that have been posited to make the use of nuclear weapons more credible. These include nuclear superiority relative to an opponent,Footnote ²⁵ putting nuclear forces on hair-trigger alert,Footnote ²⁶ making public threats that engage audience costs,Footnote ²⁷ or adopting the so-called “madman strategy.”Footnote ²⁸

While all of these strategies can increase risks by leaving “something to chance,” at the end of the day a human being still has to press the nuclear button. As Pauly and McDermott said, “Barring a preexisting doomsday machine, leaders still must make a conscious choice to use nuclear weapons, even in response to an attack that is assumed to be so provocative as to demand one.”Footnote ²⁹ Nevertheless, it is precisely the potential use of a doomsday machine, as in Dr. Strangelove or like the Soviet’s Dead Hand system, that we are interested in and turn to now.

The Debate over Automation and Strategic Stability

Given the unthinkable destructiveness that a nuclear war would bring, a key concern of scholars and policymakers is maintaining strategic stability. Strategic stability can be defined narrowly as the lack of incentives for any country to attempt a disarming nuclear first strike or, more broadly, as the lack of military conflict between nuclear-armed states.Footnote ³⁰ One major debate is whether greater automation in the nuclear realm will enhance or undermine strategic stability. States can automate nuclear processes via hard-coded and rules-based computer systems, such as Dead Hand, or more complex and cutting-edge machine learning programs that make inferences from patterns in a training data set and then perform tasks without explicit instructions (namely, AI).Footnote ³¹ We derive two broad schools of thought from the existing literature—nuclear automation optimists and pessimists. Though very few scholars advocate automating nuclear use, there is a potential nuclear automation optimist argument for why it could bolster strategic stability.

One key argument automation optimists make is that new technologies—such as hypersonic weapons, stealthy nuclear-armed cruise missiles, and underwater nuclear-armed drones—substantially compress the time available to respond to a nuclear first strike, making such an attack more likely to succeed in crippling a country’s nuclear command, control, and communications systems (NC3) before they are able to retaliate or leading to poor and escalatory decision making, thus undermining strategic stability.Footnote ³² To avoid such a scenario, countries can adopt automated nuclear systems (like Dead Hand) to help ensure retaliation and enhance nuclear deterrence.

Other optimistic arguments include the claim AI can enhance the accuracy of nuclear early warning systems by more efficiently processing information.Footnote ³³ However, this capability would be less valuable when a country’s adversary has only a small nuclear arsenal that presents less of a large data problem. AI-enabled systems may also be able to search for vulnerabilities in a country’s own NC3. Nevertheless, nearly all scholars—even relative optimists—argue there is substantial risk to removing humans from decisions about the use of nuclear weapons.Footnote ³⁴

The argument automation pessimists make, by contrast, is that implementing automation into nuclear systems is highly dangerous and likely to undermine strategic stability. One key risk with these systems is simply that they will fail.Footnote ³⁵ An oft-cited example is the “man who saved the world” on 26 September 1983, when a Soviet early warning system falsely indicated that an American nuclear strike was incoming. Colonel Stanislav Petrov, who was the key Soviet officer on duty, believed this was a false alarm due to computer error and decided not to report this warning up the chain of command. If he had reported it, then that could have led the Soviet political leadership to immediately order nuclear retaliation. There are other historical examples of nuclear early-warning systems reporting false positives during the Cold War, and automated systems have failed in other contexts. For example, Boeing’s 737 MAX Maneuvering Characteristics Augmentation System (MCAS) was responsible for several fatal commercial airline crashes in 2018 and 2019. Automation systems based on machine learning are particularly opaque in terms of how they make decisions, which could lead to errors that would be less likely in hard-coded, rules-based systems like Dead Hand. The problem is made worse by the lack of extensive real-world data on nuclear war that can be used to effectively train these systems (thankfully).Footnote ³⁶ Furthermore, automated nuclear systems may reflect the biases of their programmers and of societies at-large, which can contribute to system failure.Footnote ³⁷

Two other issues associated with automation compound the dangers of system failure. The first is automation bias, where humans put too much trust in computers and become less likely to question or critically analyze their conclusions and recommendations.Footnote ³⁸ The risk is that if an automated nuclear system makes a mistake, then the next human in a similar situation as Petrov may decide to just trust the machine. A second concern is the high speeds with which computers operate, which might prevent humans from having the time to recognize and correct a mistake made by an automated nuclear system.Footnote ³⁹ For example, in the 2010 “flash crash,” automated stock market trading systems caused the market to lose around one trillion dollars worth of value in a matter of minutes. Although the stock market was able to recover, the destruction caused by the use of even a single nuclear weapon would be impossible to fully ameliorate.

Finally, another principal concern is that automated nuclear systems could be hacked.Footnote ⁴⁰ Malign state or nonstate actors could leverage this vulnerability to try and start a nuclear war between their enemies, or a state actor could attempt to prevent an adversary from having the capability to retaliate in response to a nuclear first strike attempt.

While nuclear automation pessimists have explored the many ways in which automation could undermine strategic stability, much less work has focused on how automation could be utilized for compellent and/or offensive efforts to acquire territory or achieve other revisionist foreign policy objectives. We fill this gap by developing a theory about how automated nuclear systems can be used to enable highly aggressive and revisionist foreign policies.

Theory: Tying Hands via Automation

Nuclear brinkmanship contests between two nuclear-armed states can be thought of as a game of chicken. In the classic example of a game of chicken, two cars are barreling toward each other and the first car to swerve loses. “Winning” requires one actor convincing the other that they will not swerve, even though a head-on collision would, of course, be disastrous. A similar situation exists in games of nuclear brinkmanship, as one actor must convince the other of the credibility of their nuclear threats even though using nuclear weapons against another nuclear-armed country has a high likelihood of leading to disaster. One way to do this is to throw the steering wheel out of the car, demonstrating your resolve and degrading or removing your ability to steer the car out of harm’s way.Footnote ⁴¹ As previously discussed, several strategies have been proposed that might have this kind of effect. For example, delegating launch authority to lower-level officers, making public threats, and putting weapons on hair-trigger alert. However, none of these strategies eliminates human choice, as someone must still actively decide to launch nuclear weapons barring mechanical failure.Footnote ⁴² Since doing so is arguably irrational, the credibility of such threats may still be uncertain.

By contrast, the development and deployment of automated nuclear weapons systems may remove human agency to an even greater degree than these other policies and thus increase threat credibility. Unless it is integrated into their programming, computers do not care that actually using nuclear weapons in a particular situation would be irrational or unimaginably costly. Therefore, automated nuclear systems may enable leaders to more literally “tie their hands.” As Reiter explained, “Perhaps the most extreme form of tying hands is giving a computer the ability to implement a commitment. This solves the credible commitment problem, because a computer is untroubled if an action is genocidal or suicidal.”Footnote ⁴³ Or consider the following logic from Dr. Strangelove himself:

Mr. President, it is not only possible [for the doomsday machine to be triggered automatically and impossible to untrigger], it is essential. That is the whole idea of this machine, you know. Deterrence is the art of producing in the mind of the enemy the fear to attack. And so, because of the automated and irrevocable decision-making process which rules out human meddling, the doomsday machine is terrifying. It’s simple to understand. And completely credible, and convincing.

Of course, automated nuclear launch systems do not completely eliminate human choice. Humans would have to design such a system, activate it, and (potentially, depending on the programming) not override it if the conditions were met to launch a nuclear attack. Political leaders may be loath to take these actions given the risks of escalation and the loss of personal control it entails.Footnote ⁴⁴ There are therefore many factors that likely impact the credibility of claims that an automated nuclear weapons system has been activated. This includes the stakes of the scenario, the technological capabilities of the country, and the perceived risk tolerance of the country’s leadership. Coercers could potentially increase the credibility of this claim by (1) providing some kind of demonstration or test of the automated system, (2) making public statements that engage audience costs, which prior work shows even operate in autocracies to some extent,Footnote ⁴⁵ and (3) issuing private statements to target countries, which previous research also shows can enhance credibility.Footnote ⁴⁶ Alternatively, they might purposefully maintain strategic ambiguity—much like Washington’s policy vis-à-vis Taiwan—to create some level of uncertainty while also lessening the reputational costs and escalatory risks of explicitly announcing the activation of an automated nuclear system. A necessary condition for our argument to operate is that a claim to have built and activated an automated nuclear weapons launch system is at least somewhat credible, meaning—at minimum—it is perceived as having a non-zero probability. Even if the probability is less than 100 percent, uncertainty about whether a country has actually activated such a system could increase credibility by heightening perceptions that a country has perhaps taken the risky step of reducing human control over nuclear weapons. Although how countries handle this uncertainty will likely vary, we expect the following preregistered hypothesis to hold on average.Footnote ⁴⁷

H1 Nuclear threats implemented via automated launch systems will increase perceptions in target audiences that a country will use nuclear weapons if their threatened red line is violated compared to nuclear threats implemented via non-automated procedures.

For threats to achieve their desired goal (that is, for them to be effective), target states must believe they might actually face the stated punishment if they cross the threatening country’s red line. Consequently, threat credibility is a key mechanism explaining threat effectiveness. Given H1, we next hypothesize that nuclear threats implemented via automated systems will enhance threat effectiveness relative to threats implemented without automation. If an adversary believes a country is more likely to utilize nuclear weapons when they have an automated launch system, then they should be more likely to back down to avoid the tremendous pain such an attack would inflict. Leaders might also be better able to justify to their domestic constituents why they are backing down from previous promises if they can point to new information in the form of an automated nuclear threat.Footnote ⁴⁸

H2 Nuclear threats implemented via automated launch systems will increase target audiences’ willingness to back down compared to nuclear threats implemented via non-automated procedures.

Given that these are probabilistic rather than deterministic arguments, we do not expect threats implemented via automated launch systems to always be effective. Many factors will impact the likelihood of success. For instance, the higher the stakes are for the target of coercion, the lower the probability in general of them giving in. Nuclear coercion—whether automated or non-automated—involving threats of denial rather than punishment may also be more likely to succeed given findings about airpower.Footnote ⁴⁹ Additionally, per the aforementioned consensus among scholars, compellent threats should be less likely to be effective in general than deterrent ones. As Art and Greenhill conclude from a review of the literature, “successful nuclear compellence is not impossible, but it is difficult and dangerous to execute.”Footnote ⁵⁰ However, while the success rate of compellent nuclear threats should be lower than deterrent ones whether implemented via automation or not, we do not anticipate compellence will be uniquely challenging in the case of automation. In both scenarios, the reputational and other challenges with successfully compelling targets to change their behavior should be present. We thus expect relative differences in credibility between automated and non-automated threats to persist for both compellence and deterrence.

Although we theorize automated nuclear weapons launch systems provide some advantages in coercive bargaining, we preregistered that they also entail drawbacks to the coercing state. Given that baseline public opposition to lethal autonomous weapons is highFootnote ⁵¹ —despite also being malleableFootnote ⁵² —respondents may view nuclear threats implemented via automated systems to be especially threatening.

H3 Nuclear threats implemented via automated launch systems will increase target audiences’ threat perceptions compared to nuclear threats implemented via non-automated procedures.

Heightened threat perceptions may then impact the kinds of policy preferences held by target audiences. In particular, it may incentivize balancing behavior—such as increasing military spending and maintaining a nuclear deterrent—to address the threat. However, these costs to the coercer would materialize only in the longer term, whereas the credibility benefits would accrue in the short term and thus be more immediately relevant for a time-critical crisis of the type we study.

H4 Nuclear threats implemented via automated launch systems will increase target audiences’ support for greater military spending compared to nuclear threats implemented via non-automated procedures.

H5 Nuclear threats implemented via automated launch systems will decrease target audiences’ support for nuclear disarmament compared to nuclear threats implemented via non-automated procedures.

Finally, we theorize that respondents will perceive a greater chance of a nuclear accident (that is, nuclear weapons being mistakenly used even when a country’s red line has not been violated) when automated launch systems are used. Despite the possibility of automation bias—which would suggest the opposite hypothesis—we expect that the relatively novel nature of this technology, recent examples of errors related to automated computer systems (for example, MCAS) and large-language models (for example, ChatGPT), and the dystopian presentation of nuclear automation in popular culture,Footnote ⁵³ will reduce confidence in the ability of these systems to avoid accidents. Non-experimental surveys of eighty-five experts from around the world provide some initial evidence supporting this hypothesis, as a large majority believed emerging technologies like AI increase the risks of inadvertent escalation in the nuclear realm.Footnote ⁵⁴ Another study finds US citizens have a relatively high baseline view that autonomous weapons are accident-prone.Footnote ⁵⁵ Although a perceived increased risk of accidents could dissuade the coercer from adopting such a system in the first place,Footnote ⁵⁶ it could also make the target more likely to back down due to a logical desire to avoid a devastating accident and/or because of psychological risk aversion.

H6 Nuclear threats implemented via automated launch systems will increase perceptions in target audiences that a nuclear accident will occur compared to nuclear threats implemented via non-automated procedures.

Data and Methods

Study 1

Experimental Design

To test our hypotheses, we designed a between-subjects experiment with two key experimental conditions—one where a non-automated nuclear threat is made and another where an automated nuclear threat is made.Footnote ⁵⁷

The experimental scenario takes place in 2030 and involves a Russian invasion of Estonia. Russia is an appropriate country for this study given its large nuclear arsenal, history of making nuclear threats, and prior use of a semi-automated nuclear weapons system. A Russian invasion of Estonia in 2030 is also, at the very least, plausible. In 2023, the Estonian Foreign Intelligence Service assessed that Russia was unlikely to invade within the next year, but “in the mid-to-long term, Russia’s belligerence and foreign policy ambitions have significantly increased the security risks for Estonia.”Footnote ⁵⁸ We control for several factors to ensure information equivalence across experimental conditions and thus avoid confounding.Footnote ⁵⁹ First, we inform respondents that the Russia–Ukraine War formally concluded in 2025 and we hold constant the outcome of the conflict. Russia is also still led by Vladimir Putin and we control for Russia’s military capabilities relative to NATO countries. In particular, we remind respondents that Russia “still maintains a large stockpile of high-yield (strategic) and lower-yield (tactical) nuclear weapons.”

Respondents are then randomly assigned to one of two conditions. The first is the non-automated nuclear threat treatment, where Putin makes an explicit nuclear threat, but human control over the use of nuclear weapons is maintained. The second is the automated nuclear threat treatment. Here, Putin also makes an explicit nuclear threat, but in this scenario the threat is implemented via automation, meaning human control is delegated to a machine:

Vladimir Putin has also publicly made a threat that Russia will immediately launch at least one nuclear weapon against NATO military or civilian targets in the UK and other European countries at the first sign NATO uses missiles or strike aircraft against its forces in Estonia.

Non-Automated Treatment: Ultimately, Putin would make the final decision whether or not to use nuclear weapons. UK and US intelligence agencies confirm Putin has indeed issued this order to the Russian military.
Automated Treatment: Moreover, Putin also publicly announced that this response would be completely automated, meaning Russia’s artificial intelligence systems would automatically launch a nuclear weapon if Russia’s early-warning systems detect a NATO missile launch or deployment of strike aircraft. Ultimately, a computer—rather than Putin—would make the final decision whether or not to use nuclear weapons. UK and US intelligence agencies confirm Putin has indeed issued this order to the Russian military and the automated nuclear weapons launch system has been turned on.

In both treatments we inform respondents that intelligence agencies confirm that Putin has taken concrete steps to operationalize his threat. Uncertainty about whether or not an automated system has been turned on could certainly reduce the effectiveness of nuclear threats.Footnote ⁶⁰ Future research should analyze the precise role uncertainty plays, but for our purposes here, we want to test the relative credibility and effectiveness of non-automated and automated nuclear threats that are both presented in relatively strong forms.

We contend these treatments—where Russia makes nuclear threats—are, at the very least, plausible. The Biden administration was so concerned about possible Russian nuclear use against Ukraine or NATO targets that it created task forces and conducted simulations to plan for what the US response should be. At one point, US intelligence agencies estimated that the likelihood of Russian nuclear use was as high as 50 percent if their lines in southern Ukraine collapsed.Footnote ⁶¹ Furthermore, according to reporting in the New York Times, “One simulation … involved a demand from Moscow that the West halt all military support for the Ukrainians: no more tanks, no more missiles, no more ammunition.”Footnote ⁶² This situation is quite similar to the one outlined in the experiment and thus indicates the survey scenario is realistic.

All in all, this scenario involves a clearly aggressive and revisionist action by Russia, and the kind of attempted nuclear blackmail that nuclear coercion skeptics believe is unlikely to succeed.Footnote ⁶³ For example, Russia’s action in the scenario resembles Pakistan’s strategy in the 1999 Kargil War, in which they deployed troops into Indian-controlled Kashmir and hoped their nuclear arsenal would coerce India into accepting the new status quo. Nuclear coercion skeptics point to the Kargil War—the only direct conflict between nuclear-armed powers in history that meets the Correlates of War threshold for war—as a failed example of nuclear blackmail.Footnote ⁶⁴ Our experimental study provides a more controlled setting to assess the efficacy of nuclear brinkmanship.

Respondents are then asked a series of dependent variable questions. To assess threat credibility, we ask survey subjects to estimate the percentage chance Russia will use at least one nuclear weapon if NATO countries use missiles or strike aircraft. To measure threat effectiveness, we ask to what extent respondents would support or oppose the United Kingdom using missiles or strike aircraft in conjunction with other NATO forces; that is, violating Putin’s red line. We also ask a series of other questions regarding threat perceptions toward Russia, support for increasing military spending or abolishing the United Kingdom’s nuclear arsenal, and perceptions that Russia will accidentally use nuclear weapons even if NATO countries do not cross Putin’s red line.

There are some aspects of our design that make this a harder test for finding evidence that automated nuclear threats increase the efficacy of coercive efforts. Since Estonia is a member of NATO and thus protected under Article 5, UK citizens and policymakers have relatively strong incentives to support coming to their aid compared to countries that have not been given defense guarantees. The United Kingdom also has nuclear weapons of its own that may cause respondents to believe Russia will be deterred from launching a nuclear attack against it. Moreover, Putin and other Russian officials have made nuclear threats in the context of the Russia–Ukraine War that have not been carried out, which may make respondents question the credibility of future threats. On the other hand, this may be an easier test because Russia has deployed a semi-automated nuclear launch system in the past and made implicit and explicit nuclear threats that may have played a role in preventing more direct or larger-scale NATO intervention in Ukraine. Russia has also demonstrated a high degree of resolve by absorbing enormous costs in the Russia–Ukraine War, which is arguably a war of choice rather than necessity. On balance, while we leave it to readers to decide whether this is closer to an easier or a harder test, we believe it is a fair test of our hypotheses—neither too hard nor too easy—given these countervailing logics.

Sample

We recruited 800 UK citizens via the survey platform Prolific in September 2023. Prolific uses quota sampling to match Census benchmarks on sex, age, and ethnicity. Prior research has also demonstrated that data from Prolific is high quality and may perform better than many other survey providers, such as Qualtrics, Dynata, CloudResearch, and MTurk.Footnote ⁶⁵

We chose to study the United Kingdom because it is a key member of NATO, would likely play a significant role responding to a Russian invasion of Estonia (especially since it is the leading state in the NATO Enhanced Forward Presence force deployed to Estonia), and is a nuclear power itself, along with being one of the world’s largest economies and a permanent member of the United Nations Security Council.

Studying public opinion on this topic is valuable because previous studies—including those conducted directly on elites—establish that policymakers respond to and are constrained by public opinion.Footnote ⁶⁶ For example, Tomz, Weeks, and Yarhi-Milo conducted experiments on actual members of the Israeli Knesset and found they were more willing to use military force when the public was in favor, as they feared the political consequences of defying public opinion.Footnote ⁶⁷ In the nuclear realm, scholars have also provided empirical evidence that domestic and international public opinion impacts the views of policymakers,Footnote ⁶⁸ which is a key basis for the voluminous experimental literature evaluating the nuclear taboo via survey experiments on the public.Footnote ⁶⁹ Given that nuclear crises are highly salient, they may even be more likely to capture public attention than other types of foreign policy crises. Thus, public willingness to capitulate in the face of threats will reduce the domestic constraints leaders face to backing down. Public opposition to capitulation, on the other hand, will stiffen the spine of leaders and make nuclear threats less likely to succeed.

Study 2

To test the robustness of our results from Study 1, we also designed and fielded a second experiment on about 1,060 members of the UK public in partnership with Prolific in February 2024. The only substantive difference between Studies 1 and 2 relates to the wording of the two treatment conditions. In Study 1, some of the wording, especially in the automated nuclear threat treatment, could arguably increase the likelihood of finding effects. For example, in Study 1 the automated nuclear threat treatment includes the following language: “Ultimately, a computer—rather than Putin—would make the final decision whether or not to use nuclear weapons.” While a computer could indeed have the final say on whether to use nuclear weapons if that was how the system was constructed, this is strong language because some kind of override may very well be programmed into automated launch systems. Consequently, in Study 2 we remove this language from the automated threat treatment. We also remove the corresponding language in the non-automated nuclear threat treatment that “ultimately, Putin would make the final decision whether or not to use nuclear weapons.” On balance, we expect these changes could reduce the perceived disparities between the automated and non-automated treatments and make Study 2 a harder test of our hypotheses. However, one advantage of Study 1 was that it ensured a clearer experimental manipulation since there was a starker contrast between the automated treatment (decision being made by a machine) and the non-automated treatment (decision being made by a human).

Study 3

In a meta-analysis of 162 paired experiments on members of the public and elites, Kertzer finds that elites generally respond to treatments in the same ways as members of the public.Footnote ⁷⁰ This indicates that results among the general public are likely to be externally valid to policymakers. However, studies have specifically found relatively large elite-public gaps in baseline support for nuclear weapons use.Footnote ⁷¹ Although this does not necessarily mean there will be similar gaps in experimental treatment effects due to factors like floor effects, we test the external validity of our public results in a third experiment on an elite sample of 117 UK MPs in October 2024 via a partnership with YouGov. This sample has also been utilized by prior work,Footnote ⁷² and it tracks well with the actual UK House of Commons in terms of factors like party identification.

We utilize the treatment language from Study 2 in Study 3. We also make another key change to the design. Given the small sample size, we present respondents with both treatment conditions. This alters the experiment from a between-subjects to a fully within-subjects design. Given the small sample size of most elite experiments, this design choice is logical because it helps maximize statistical power.Footnote ⁷³ Extant research also shows that within-subjects designs are valid tools for causal inference, despite theoretical concerns about demand effects and consistency pressures.Footnote ⁷⁴ To mitigate any possible order effects, we randomize the order of the treatments.

Results

Study 1

According to our theoretical expectations, we find strong evidence that threat credibility is higher in the automated treatment than it is in the non-automated treatment. Figure 1 plots the densities for these two treatments and shows that the perceived probability that Russia will use nuclear weapons is over 12.3 percentage points greater in the automated than the non-automated nuclear threat treatment (p < .001). While undoubtedly dangerous in terms of escalation risks, this accords with our preregistered hypothesis that automated nuclear threats can allow leaders to more credibly tie their hands in crisis scenarios, and illustrates the temptation some leaders may face to adopt these kinds of systems in similar contexts to our experiment.

Figure 1. Automated nuclear threats enhance credibility

Note: *p < .10; **p < .05; ***p < .01.

Similarly, we find strong evidence that nuclear threats implemented via automated launch systems are more effective than those implemented via non-automated means (p < .05 using a seven-point scale). Figure 2 illustrates this visually. Most notably, 20 percent of respondents “strongly opposed” violating Putin’s red line in the automated nuclear threat treatment compared to just 10.9 percent in the non-automated treatment. Interestingly, absolute support for violating Putin’s red line is still relatively high (42–47 percent) despite the risks involved. This is likely due to a combination of the United Kingdom’s Article 5 commitment to Estonia and a perception among some respondents that Russia is probably bluffing. Indeed, we show in the appendix that perceived threat credibility significantly mediates support for violating Putin’s red line.

Figure 2. Automated nuclear threats enhance effectiveness

Surprisingly, and in contrast to our preregistered expectations, automated nuclear threats are not associated with significantly greater threat perception, support for increasing military spending, or opposition to nuclear disarmament.Footnote ⁷⁵ Floor and ceiling effects can help explain this null finding. For example, threat perceptions were already high and support for nuclear disarmament was already low in an absolute sense whenever any kind of nuclear threat was made, meaning there was little room for the nature of that threat (automated versus non-automated) to matter. This null finding indicates that the costs to the coercer of threatening nuclear use via an automated system are somewhat lower than expected, at least when the counterfactual is a nuclear threat implemented via non-automated means.

However, as Figure 3 shows, automated nuclear threats do increase perceptions—by over thirteen percentage points—that Russia will mistakenly use nuclear weapons even if NATO does not violate Putin’s red line. Automated nuclear systems may enable leaders to throw the steering wheel out of the car in a game of chicken and enhance the effectiveness of nuclear brinkmanship, but the public is relatively less trusting that the technology will work as intended compared to when humans remain fully “in the loop.” This belief may reduce public support for the adoption of automated nuclear launch systems due to the fear of losing control. Of course, whether this perception matches reality, relative to the potential for human error, is open to debate, and the answer may change as the technology develops. Moreover, from the perspective of the coercer, an increased perceived risk of accidents may actually be a positive in one key respect. Since the goal of nuclear brinkmanship is to increase the target’s perception that nuclear escalation is possible, higher fears that automated nuclear weapons systems will lead to accidents might encourage the target to back down.

Figure 3. Automated threats increase the perceived risk of accidents

Note: *p < .10; **p < .05; ***p < .01.

Robustness and Study 2

We take several steps in the appendix to demonstrate the robustness of our core results. First, we illustrate that they hold in a regression context when controlling for factors like hawkishness, political identification, education, and gender. Second, we probe potential interaction effects and do not find consistent effects for factors like hawkishness, support for NATO or military aid to Ukraine, or self-assessed knowledge about international relations or artificial intelligence.Footnote ⁷⁶ Third, we show that almost all of our core results hold in Study 2. The one exception is that the difference in threat effectiveness between the automated and non-automated threat treatment is estimated less precisely and is not statistically significant, though it is in the hypothesized direction. Overall, there is robust evidence among our public samples that automated nuclear threats enhance threat credibility more than non-automated threats.

Study 3

There are mixed results in our UK MP study.Footnote ⁷⁷ Strikingly, we find significant evidence among our UK MP sample that automated nuclear threats are more effective than non-automated threats in the context of an international crisis. Specifically, as Table 1 displays, UK MPs are about nine percentage points (p ≈ .02) less likely to support violating Putin’s red line when he makes an automated nuclear threat compared to a non-automated nuclear threat.

Although directly comparing our elite study to our public studies is challenging given the differences in experimental design, it is interesting that in contrast to both Studies 1 and 2, we do not find significant evidence in our MP study that automated nuclear threats are more credible than non-automated nuclear threats. Belief that Russia will use nuclear weapons if their red line is violated is about 39 percent in both cases (though, in an absolute sense, that number is still quite high). The difference between our public and elite samples may be due to elites having a better understanding and belief in the dynamics of nuclear deterrence, though, on the other hand, the average UK MP is far from a nuclear expert.Footnote ⁷⁸ Given that policymakers have a more direct influence over a country’s foreign policy than the public, the null result for threat credibility suggests the incentives to automate nuclear weapons launch systems are somewhat limited.

Table 1. Findings among United Kingdom Members of Parliament

Note: pp = percentage points.

What explains the difference between the threat effectiveness and credibility results, especially since threat effectiveness is arguably the more significant measure? We suggest two potential explanations. The first relates to uncertainty. Even if policymakers’ best point estimate guess about the probability of nuclear use was roughly the same for non-automated and automated nuclear threats, they may have been less confident and more uncertain about their estimates for the latter given the relative novelty of the technology and the inability to bargain with a computer in the same way one can bargain and reason with a human. Greater uncertainty about the risk of nuclear escalation may then have convinced some MPs to support backing down to avoid a devastating outcome. This explanation relates to the philosophical concept of “Pascal’s Wager,” which holds that if presented with significant uncertainty about a high-stakes outcome (for example, nuclear war), it is logical to take the path that reduces the probability of the worst outcome (for example, nuclear Armageddon), even if that outcome is not likely. The French thinker Blaise Pascal applied this logic to belief in God: Even if God might not exist, it is better to hedge your bets and believe to avoid eternal damnation. It is also relevant here in that automated systems might increase uncertainty and thus make backing down, which involves a finite cost, more attractive relative to risking nuclear conflict, which involves potentially infinite costs. A second potential explanation is that perhaps UK MPs felt it would be easier to justify to the UK public why they were backing down if they could point to the unprecedented use of an automated nuclear launch system rather than a more foreseeable nuclear threat implemented via non-automated means.Footnote ⁷⁹

Another divergence from Studies 1 and 2 is that we find no significant evidence that UK MPs believe that nuclear accidents are more likely when threats are implemented via automation than without. This may reflect a greater belief among policymakers than the public in the testing and evaluation systems that a country would use in the real world prior to deploying such a system to prove it would work as intended. It also means leaders may be marginally less hesitant to adopt these kinds of systems than some believe.Footnote ⁸⁰

Although the results related to credibility and effectiveness for automated versus non-automated nuclear threats are somewhat inconsistent between the three studies, all yield at least some evidence that states may gain coercion benefits from developing and deploying automated systems.

Conclusion

The question of how to make nuclear threats more credible—or, indeed, whether nuclear threats can ever be credible against nuclear-armed states outside the context of deterring an attack against one’s homeland—has puzzled scholars and policymakers alike. Our study makes a key contribution to this debate. There are ways to make nuclear threats more credible, even when they are in the service of offensive and revisionist goals. We find concerning evidence—even among policymakers—that automated nuclear weapons launch systems in particular may enhance the credibility and/or effectiveness of nuclear threats in the context of crises. Doing so validates fears about the integration of automation—perhaps powered by AI—into NC3. It may also incentivize countries to adopt these kinds of systems, just as the Soviets did with the Dead Hand system.

Our project also raises a number of promising avenues for future research. Since our argument and empirics are focused on crisis scenarios, future work should assess their external validity to non-crisis situations. From a theoretical perspective, threats implemented via automated nuclear launch systems may still enhance credibility and effectiveness for longer-term, more general deterrent relationships. For example, they could make threats to launch a nuclear attack in response to a non-nuclear kinetic attack or a major cyber attack more credible, despite the fact that some scholars reasonably argue such threats would normally be irrational to carry out and thus lack credibility.Footnote ⁸¹ For countries without secure second-strike capabilities, some leaders might also think that automated nuclear systems could, theoretically, help deter nuclear attacks by reducing the chances a first strike attempt could successfully decapitate a country’s leadership and cripple their communications networks, preventing a retaliatory order from being given.Footnote ⁸²

Subsequent research can assess related questions while also evaluating the credibility of the claim that a state has actually activated an automated nuclear weapons launch system.Footnote ⁸³ While direct intelligence about the matter is likely to be highly germane, other factors—like the stakes of the crisis, the perceived risk tolerance of the leader, and the public or private statements made by the coercing country—may also play a significant role. For instance, if the stakes are low, then it may be less credible that a country would assume the escalation and reputation risks associated with turning on such a system, especially for an offensive effort.

Technological developments from artillery and aircraft to submarines and smart bombs have, historically, changed the character of warfare and international politics. AI and other computer-related advancements have a similar potential. This paper unpacks how a key emerging technology might be integrated with a powerful existing technology—nuclear weapons—and used for malign purposes. We find evidence that the danger is real and should be taken seriously.

Data Availability Statement

Replication files for this research note may be found at <https://doi.org/10.7910/DVN/9DKDEK>.

Supplementary Material

Supplementary material for this research note, is available at <https://doi.org/10.1017/S0020818325101215>.

Acknowledgments

We thank Sanghyun Han, Jenna Jordan, David Logan, Scott Sagan, participants of the MIT Security Studies Working Group and Seminar, the Georgia Tech STAIR Workshop, the Carnegie Mellon Political Science Research Workshop, and the 2024 International Studies Association Annual Conference for helpful comments and advice. We are also deeply grateful to Graham Elder, Isabel Leong, and Stotra Pandya for valuable research assistance.

Funding

This research note was made possible, in part, by funding from the Air Force Office of Scientific Research and the Minerva Research Initiative under grant #FA9550-18-1-0194.

Footnotes

1 This study was preregistered with OSF and approved by the MIT IRB.

2 Department of Defense 2023, 21.

3 Thompson Reference Thompson2009.

4 Scharre Reference Scharre2018; Horowitz Reference Horowitz2019; Lin Reference Lin2025.

5 Lowther and McGiffin Reference Lowther and McGiffin2019.

6 Schelling Reference Schelling1966; Kroenig Reference Kroenig2018.

7 Kroenig Reference Kroenig2018.

8 Sechser and Fuhrmann Reference Sechser and Fuhrmann2017, 173.

9 Lowther and McGiffin Reference Lowther and McGiffin2019; Sechser, Narang, and Talmadge Reference Sechser, Narang and Talmadge2019; Johnson Reference Johnson2020; Cox and Williams Reference Cox and Williams2021; Onderco and Zutt Reference Onderco and Zutt2021; Futter Reference Futter2022; Schneider, Schechter, and Shaffer Reference Schneider, Schechter and Shaffer2023.

10 Pauly and McDermott Reference Pauly and McDermott2022; Reiter Reference Reiter2025.

11 Fearon Reference Fearon1997; Yarhi-Milo, Kertzer, and Renshon Reference Yarhi-Milo, Kertzer and Renshon2018; Horowitz Reference Horowitz2019.

12 Lin Reference Lin2025.

13 Art and Greenhill Reference Art and Greenhill2018.

14 Pape Reference Pape1996, 9.

15 Jervis Reference Jervis1989.

16 Rauchhaus Reference Rauchhaus2009.

17 Asal and Beardsley Reference Asal and Beardsley2007. Bell and Miller Reference Bell and Miller2015 question such results using different, and arguably

more appropriate, methodologies.

18 Sagan and Waltz Reference Sagan and Waltz2013.

19 Sechser and Fuhrmann Reference Sechser and Fuhrmann2017. For a counterargument to the nuclear taboo—or tradition—perspective,

see Sagan and Valentino Reference Sagan and Valentino2017 and Schwartz Reference Schwartz2024. For a counterargument to the counterargument, see

Carpenter and Montgomery Reference Carpenter and Montgomery2020.

20 Tannenwald Reference Tannenwald1999, 436.

21 Sechser and Fuhrmann Reference Sechser and Fuhrmann2017, 236.

22 Schelling Reference Schelling1960.

23 Narang Reference Narang2014, 14.

24 Pauly and McDermott Reference Pauly and McDermott2022.

25 Kroenig Reference Kroenig2018. But see Logan Reference Logan2022 for a counterargument.

26 Sagan Reference Sagan1985.

27 Smetana, Vranka, and Rosendorf Reference Smetana, Vranka and Rosendorf2024b.

28 McManus Reference McManus2019; Schwartz Reference Schwartz2023.

29 Pauly and McDermott Reference Pauly and McDermott2022, 12.

30 Boulanin et al. Reference Boulanin, Saalman, Topychkanov, Su and Peldán Carlsson2020, 101.

31 Lin Reference Lin2025.

32 Lowther and McGiffin Reference Lowther and McGiffin2019.

33 Cox and Williams Reference Cox and Williams2021; Futter Reference Futter2022; Depp and Scharre Reference Depp and Scharre2024.

34 Cox and Williams Reference Cox and Williams2021.

35 Kallenborn Reference Kallenborn2022; Depp and Scharre Reference Depp and Scharre2024.

36 Horowitz, Scharre, and Velez-Green Reference Horowitz, Scharre and Velez-Green2019.

37 Johnson Reference Johnson2020.

38 Lohn and Geist Reference Lohn and Geist2018; Horowitz, Scharre, and Velez-Green Reference Horowitz, Scharre and Velez-Green2019; Johnson Reference Johnson2022; Depp and Scharre Reference Depp and Scharre2024; Horowitz and Kahn Reference Horowitz and Kahn2024.

39 Johnson Reference Johnson2020.

40 Schneider, Schechter, and Shaffer Reference Schneider, Schechter and Shaffer2023.

41 Schelling Reference Schelling1966.

42 Pauly and McDermott Reference Pauly and McDermott2022.

43 Reiter Reference Reiter2025, 225.

44 Reiter Reference Reiter2025.

45 Li and Chen Reference Li and Chen2021; Smetana Reference Smetana2025.

46 Sartori Reference Sartori2013; Yarhi-Milo Reference Yarhi-Milo2013; Takei Reference Takei2024.

47 See the appendix for the original wording of our preregistered hypotheses. They are substantively identical to the hypotheses here, but more specific to the details of the experiment.

48 Levendusky and Horowitz Reference Levendusky and Horowitz2012.

49 Pape Reference Pape1996; Horowitz and Reiter Reference Horowitz and Reiter2001.

50 Art and Greenhill Reference Art and Greenhill2018, 93.

51 Rosendorf, Smetana, and Vranka Reference Rosendorf, Smetana and Vranka2022, Reference Rosendorf, Smetana and Vranka2024.

52 Horowitz Reference Horowitz2016.

53 Young and Carpenter Reference Young and Carpenter2018.

54 Onderco and Zutt Reference Onderco and Zutt2021.

55 Rosendorf, Smetana, and Vranka Reference Rosendorf, Smetana and Vranka2024.

56 Reiter Reference Reiter2025.

57 There was also a third condition meant to test whether a more implicit nuclear threat was less credible and effective than these two explicit threats. We find strong evidence for this expectation. See the appendix.

58 Andrius Sytas, “Russian Threat to Baltic Security Rising — Estonian Intelligence Report,” Reuters, 8 February 2023. <https://www.reuters.com/world/europe/baltic-security-risk-rising-estonian-intelligence-service-says-2023-02-08/>.

59 Dafoe, Zhang, and Caughey Reference Dafoe, Zhang and Caughey2018.

60 Horowitz Reference Horowitz2019.

61 Adam Entous, “The Partnership: The Secret History of the War in Ukraine,” New York Times, 29 March 2025.

62 David Sanger, “Biden’s Armageddon Moment: When Nuclear Detonation Seemed Possible in Ukraine,” New York Times, 9 March 2024.

63 Whether Russia’s threat involves deterrence or compellence is somewhat subjective and in the eye of the beholder. It could be deterrence given that the goal is to prevent NATO from taking an action they have not yet taken. It could, alternatively, be compellence given that the goal is to convince NATO to, at least partially, abandon Article 5, which is a pre-existing policy. Future experimental work could directly manipulate whether the scenario involves a clearly deterrent or compellent threat.

64 Sechser and Fuhrmann Reference Sechser and Fuhrmann2017.

65 Peer et al. Reference Peer, Rothschild, Gordon, Evernden and Damer2022; Douglas, Ewell, and Brauer Reference Douglas, Ewell and Brauer2023.

66 Lin-Greenberg Reference Lin-Greenberg2021; Chu and Recchia Reference Chu and Recchia2022; Peez and Bethke Reference Peez and Bethke2025.

67 Tomz, Weeks, and Yarhi-Milo Reference Tomz, Weeks and Yarhi-Milo2020.

68 Tannenwald Reference Tannenwald1999; Das Reference Das2021; Smetana et al. Reference Smetana, Sukin, Herzog and Vranka2025.

69 Sagan and Valentino Reference Sagan and Valentino2017; Carpenter and Montgomery Reference Carpenter and Montgomery2020; Schwartz Reference Schwartz2024.

70 Kertzer Reference Kertzer2022.

71 Smetana and Onderco Reference Smetana and Onderco2022; Smetana, Vranka, and Rosendorf Reference Smetana, Vranka and Rosendorf2024a; Logan Reference Logan2025.

72 Chu and Recchia Reference Chu and Recchia2022; Smetana, Vranka, and Rosendorf Reference Smetana, Vranka and Rosendorf2024a.

73 Clifford, Sheagley, and Piston Reference Clifford, Sheagley and Piston2021.

74 Mummolo and Peterson Reference Mummolo and Peterson2019; Clifford, Sheagley, and Piston Reference Clifford, Sheagley and Piston2021.

75 See appendix Figure A.4.

76 Horowitz and Kahn Reference Horowitz and Kahn2024.

77 We did not ask about threat perceptions, military spending, or nuclear disarmament in Study 3 due to

space constraints.

78 Logan Reference Logan2025.

79 Levendusky and Horowitz Reference Levendusky and Horowitz2012.

80 Reiter Reference Reiter2025.

81 Sagan and Weiner Reference Sagan and Weiner2021.

82 Horowitz, Scharre, and Velez-Green Reference Horowitz, Scharre and Velez-Green2019.

83 Horowitz Reference Horowitz2019; Reiter Reference Reiter2025.

References

Art, Robert J., and Greenhill, Kelly M.. 2018. The Power and Limits of Compellence. Political Science Quarterly 133 (1):77–97.10.1002/polq.12738CrossRef Google Scholar

Asal, Victor, and Beardsley, Kyle. 2007. Proliferation and International Crisis Behavior. Journal of Peace Research 44 (2):139–55.10.1177/0022343307075118CrossRef Google Scholar

Bell, Mark S., and Miller, Nicholas L.. 2015. Questioning the Effect of Nuclear Weapons on Conflict. Journal of Conflict Resolution 59 (1):74–92.10.1177/0022002713499718CrossRef Google Scholar

Boulanin, Vincent, Saalman, Lora, Topychkanov, Petr, Su, Fei, and Peldán Carlsson, Moa. 2020. Artificial Intelligence, Strategic Stability and Nuclear Risk. SIPRI: Stockholm International Peace Research Institute.Google Scholar

Carpenter, Charli, and Montgomery, Alexander H.. 2020. The Stopping Power of Norms: Saturation Bombing, Civilian Immunity, and US Attitudes Toward the Laws of War. International Security 45 (2):140–69.10.1162/isec_a_00392CrossRef Google Scholar

Chu, Jonathan A., and Recchia, Stefano. 2022. Does Public Opinion Affect the Preferences of Foreign Policy Leaders? Experimental Evidence from the UK Parliament. The Journal of Politics 84 (3):1874–77.10.1086/719007CrossRef Google Scholar

Clifford, Scott, Sheagley, Geoffrey, and Piston, Spencer. 2021. Increasing Precision Without Altering Treatment Effects: Repeated Measures Designs in Survey Experiments. American Political Science Review 115 (3):1048–1065.10.1017/S0003055421000241CrossRef Google Scholar

Cox, Jessica, and Williams, Heather. 2021. The Unavoidable Technology: How Artificial Intelligence Can Strengthen Nuclear Stability. The Washington Quarterly 44 (1):69–85.10.1080/0163660X.2021.1893019CrossRef Google Scholar

Dafoe, Allan, Zhang, Baobao, and Caughey, Devin. 2018. Information Equivalence in Survey Experiments. Political Analysis 26 (4):399–416.10.1017/pan.2018.9CrossRef Google Scholar

Das, Debak. 2021. The Courtroom of World Opinion: Bringing the International Audience into Nuclear Crises. Global Studies Quarterly 1 (4):1–11.10.1093/isagsq/ksab028CrossRef Google Scholar

Department of Defense. 2023. DoD Directive 3000.09, Autonomy in Weapon Systems. <https://www.esd.whs.mil/portals/54/documents/dd/issuances/dodd/300009p.pdf>..>Google Scholar

Depp, Michael, and Scharre, Paul. 2024. Artificial Intelligence and Nuclear Stability. War on the Rocks, 16 January. <https://warontherocks.com/2024/01/artificial-intelligence-and-nuclear-stability/>..>Google Scholar

Douglas, Benjamin D., Ewell, Patrick J., and Brauer, Markus. 2023. Data Quality in Online Human-Subjects Research: Comparisons Between MTurk, Prolific, CloudResearch, Qualtrics, and SONA. Plos One 18 (3):1–17.10.1371/journal.pone.0279720CrossRef Google Scholar PubMed

Fearon, James D. 1997. Signaling Foreign Policy Interests: Tying Hands Versus Sinking Costs. Journal of Conflict Resolution 41 (1):68–90.10.1177/0022002797041001004CrossRef Google Scholar

Futter, Andrew. 2022. Disruptive Technologies and Nuclear Risks: What’s New and What Matters. Survival 64 (1):99–120.10.1080/00396338.2022.2032979CrossRef Google Scholar

Horowitz, Michael, and Reiter, Dan. 2001. When Does Aerial Bombing Work? Quantitative Empirical Tests, 1917–1999. Journal of Conflict Resolution 45 (2):147–73.10.1177/0022002701045002001CrossRef Google Scholar

Horowitz, Michael C. 2016. Public Opinion and the Politics of the Killer Robots Debate. Research and Politics 3 (1):1–8.10.1177/2053168015627183CrossRef Google Scholar

Horowitz, Michael C. 2019. When Speed Kills: Lethal Autonomous Weapon Systems, Deterrence and Stability. Journal of Strategic Studies 42 (6):764–88.10.1080/01402390.2019.1621174CrossRef Google Scholar

Horowitz, Michael C., and Kahn, Lauren. 2024. Bending the Automation Bias Curve: A Study of Human and AI-based Decision Making in National Security Contexts. International Studies Quarterly 68 (2):1–15.10.1093/isq/sqae020CrossRef Google Scholar

Horowitz, Michael C., Scharre, Paul, and Velez-Green, Alexander. 2019. A Stable Nuclear Future? The Impact of Autonomous Systems and Artificial Intelligence. arXiv preprint arXiv:1912.05291.Google Scholar

Jervis, Robert. 1989. The Meaning of the Nuclear Revolution: Statecraft and the Prospect of Armageddon. Cornell University Press.Google Scholar

Johnson, James. 2020. Artificial Intelligence in Nuclear Warfare: A Perfect Storm of Instability? The Washington Quarterly 43 (2):197–211.10.1080/0163660X.2020.1770968CrossRef Google Scholar

Johnson, James. 2022. Delegating Strategic Decision-Making to Machines: Dr. Strangelove Redux? Journal of Strategic Studies 45 (3):439–77.10.1080/01402390.2020.1759038CrossRef Google Scholar

Kallenborn, Zachary. 2022. Giving an AI Control of Nuclear Weapons: What Could Possibly Go Wrong? Bulletin of the Atomic Scientists.Google Scholar

Kertzer, Joshua D. 2022. Re-Assessing Elite-Public Gaps in Political Behavior. American Journal of Political Science 66 (3):539–53.10.1111/ajps.12583CrossRef Google Scholar

Kroenig, Matthew. 2018. The Logic of American Nuclear Strategy: Why Strategic Superiority Matters. Oxford University Press.10.1093/oso/9780190849184.001.0001CrossRef Google Scholar

Levendusky, Matthew S., and Horowitz, Michael C.. 2012. When Backing Down Is the Right Decision: Partisanship, New Information, and Audience Costs. The Journal of Politics 74 (2):323–38.10.1017/S002238161100154XCrossRef Google Scholar

Li, Xiaojun, and Chen, Dingding. 2021. Public Opinion, International Reputation, and Audience Costs in an Authoritarian Regime. Conflict Management and Peace Science 38 (5):543–60.10.1177/0738894220906374CrossRef Google Scholar

Lin, Herbert. 2025. Artificial Intelligence and Nuclear Weapons: A Commonsense Approach to Understanding Costs and Benefits. Texas National Security Review 8 (3):98–109.10.1353/tns.00007CrossRef Google Scholar

Lin-Greenberg, Erik. 2021. Soldiers, Pollsters, and International Crises: Public Opinion and the Military’s Advice on the Use of Force. Foreign Policy Analysis 17 (3):1–12.10.1093/fpa/orab009CrossRef Google Scholar

Logan, David C. 2022. The Nuclear Balance Is What States Make of It. International Security 46 (4):172–215.10.1162/isec_a_00434CrossRef Google Scholar

Logan, David C. 2025. Elite–Public Gaps on Nuclear Weapons: The Roles of Salience and Knowledge. International Organization 79 (3):574–97.10.1017/S0020818325100799CrossRef Google Scholar

Lohn, Andrew J., and Geist, Edward. 2018. How Might Artificial Intelligence Affect the Risk of Nuclear War? RAND Corporation.Google Scholar

Lowther, Adam, and McGiffin, Curtis. 2019. America Needs a “Dead Hand.” War on the Rocks, 16 August. <https://warontherocks.com/2019/08/america-needs-a-dead-hand/>..>Google Scholar

McManus, Roseanne W. 2019. Revisiting the Madman Theory: Evaluating the Impact of Different Forms of Perceived Madness in Coercive Bargaining. Security Studies 28 (5):976–1009.10.1080/09636412.2019.1662482CrossRef Google Scholar

Mummolo, Jonathan, and Peterson, Erik. 2019. Demand Effects in Survey Experiments: An Empirical Assessment. American Political Science Review 113 (2):517–29.10.1017/S0003055418000837CrossRef Google Scholar

Narang, Vipin. 2014. Nuclear Strategy in the Modern Era: Regional Powers and International Conflict. Princeton University Press.Google Scholar

Onderco, Michal, and Zutt, Madeline. 2021. Emerging Technology and Nuclear Security: What Does the Wisdom of the Crowd Tell Us? Contemporary Security Policy 42 (3):286–311.10.1080/13523260.2021.1928963CrossRef Google Scholar

Pape, Robert A. 1996. Bombing to Win: Air Power and Coercion in War. Cornell University Press.Google Scholar

Pauly, Reid B.C., and McDermott, Rose. 2022. The Psychology of Nuclear Brinkmanship. International Security 47 (3):9–51.10.1162/isec_a_00451CrossRef Google Scholar

Peer, Eyal, Rothschild, David, Gordon, Andrew, Evernden, Zak, and Damer, Ekaterina. 2022. Data Quality of Platforms and Panels for Online Behavioral Research. Behavior Research Methods 54 (4):1643–62.10.3758/s13428-021-01694-3CrossRef Google Scholar PubMed

Peez, Anton, and Bethke, Felix S.. 2025. Does Public Opinion on Foreign Policy Affect Elite Preferences? Evidence from the 2022 US Sanctions Against Russia. International Studies Quarterly 69 (1):1–11.Google Scholar

Rauchhaus, Robert. 2009. Evaluating the Nuclear Peace Hypothesis: A Quantitative Approach. Journal of Conflict Resolution 53 (2):258–77.10.1177/0022002708330387CrossRef Google Scholar

Reiter, Dan. 2025. Untied Hands: How States Avoid the Wrong Wars. Cambridge University Press.10.1017/9781009596060CrossRef Google Scholar

Rosendorf, Ondřej, Smetana, Michal, and Vranka, Marek. 2022. Autonomous Weapons and Ethical Judgments: Experimental Evidence on Attitudes Toward the Military Use of “Killer Robots.” Peace and Conflict: Journal of Peace Psychology 28 (2):177–83.10.1037/pac0000601CrossRef Google Scholar

Rosendorf, Ondřej, Smetana, Michal, and Vranka, Marek. 2024. Algorithmic Aversion? Experimental Evidence on the Elasticity of Public Attitudes to “Killer Robots.” Security Studies 33 (1):115–45.10.1080/09636412.2023.2250259CrossRef Google Scholar

Sagan, Scott D. 1985. Nuclear Alerts and Crisis Management. International Security 9 (4):99–139.10.2307/2538543CrossRef Google Scholar

Sagan, Scott D., and Valentino, Benjamin A.. 2017. Revisiting Hiroshima in Iran: What Americans Really Think about Using Nuclear Weapons and Killing Noncombatants. International Security 42 (1):41–79.10.1162/ISEC_a_00284CrossRef Google Scholar

Sagan, Scott D., and Weiner, Allen S.. 2021. The US Says It Can Answer Cyberattacks with Nuclear Weapons. That’s Lunacy. Washington Post, 9 July.Google Scholar

Sagan, Scott D., and Waltz, Kenneth N.. 2013. The Spread of Nuclear Weapons: An Enduring Debate. W.W. Norton and Company.Google Scholar

Sartori, Anne E. 2013. Deterrence by Diplomacy. Princeton University Press.Google Scholar

Scharre, Paul. 2018. Army of None: Autonomous Weapons and the Future of War. W.W. Norton & Company.Google Scholar

Schelling, Thomas C. 1960. The Strategy of Conflict. Harvard University Press.Google Scholar

Schelling, Thomas C. 1966. Arms and Influence. Yale University Press.Google Scholar

Schneider, Jacquelyn, Schechter, Benjamin, and Shaffer, Rachael. 2023. Hacking Nuclear Stability: Wargaming Technology, Uncertainty, and Escalation. International Organization 77 (3):633–67.10.1017/S0020818323000115CrossRef Google Scholar

Schwartz, Joshua A. 2023. Madman or Mad Genius? The International Benefits and Domestic Costs of the Madman Strategy. Security Studies 32 (2):271–305.10.1080/09636412.2023.2197619CrossRef Google Scholar

Schwartz, Joshua A. 2024. When Foreign Countries Push the Button. International Security 48 (4):47–86.10.1162/isec_a_00483CrossRef Google Scholar

Sechser, Todd S., and Fuhrmann, Matthew. 2017. Nuclear Weapons and Coercive Diplomacy. Cambridge University Press.Google Scholar

Sechser, Todd S., Narang, Neil, and Talmadge, Caitlin. 2019. Emerging Technologies and Strategic Stability in Peacetime, Crisis, and War. Journal of Strategic Studies 42 (6):727–35.10.1080/01402390.2019.1626725CrossRef Google Scholar

Smetana, Michal. 2025. Microfoundations of Domestic Audience Costs in Nondemocratic Regimes: Experimental Evidence from Putin’s Russia. Journal of Peace Research 62 (2):278–94.10.1177/00223433231220252CrossRef Google Scholar

Smetana, Michal, and Onderco, Michal. 2022. Elite-Public Gaps in Attitudes to Nuclear Weapons: New Evidence from a Survey of German Citizens and Parliamentarians. International Studies Quarterly 66 (2):1–10.10.1093/isq/sqac017CrossRef Google Scholar

Smetana, Michal, Sukin, Lauren, Herzog, Stephen, and Vranka, Marek. 2025. Atomic Responsiveness: How Public Opinion Shapes Elite Beliefs and Preferences on Nuclear Weapons Use. European Journal of International Security, 1–21.10.1017/eis.2025.10031CrossRef Google Scholar

Smetana, Michal, Vranka, Marek, and Rosendorf, Ondřej. 2024a. Elite-Public Gaps in Support for Nuclear and Chemical Strikes: New Evidence from a Survey of British Parliamentarians and Citizens. Research and Politics 11 (3):1–6.10.1177/20531680241276795CrossRef Google Scholar

Smetana, Michal, Vranka, Marek, and Rosendorf, Ondřej. 2024b. The “Commitment Trap” Revisited: Experimental Evidence on Ambiguous Nuclear Threats. Journal of Experimental Political Science 11 (1):64–77.10.1017/XPS.2023.8CrossRef Google Scholar

Takei, Makito. 2024. Audience Costs and the Credibility of Public Versus Private Threats in International Crises. International Studies Quarterly 68 (3):1–9.10.1093/isq/sqae091CrossRef Google Scholar

Tannenwald, Nina. 1999. The Nuclear Taboo: The United States and the Normative Basis of Nuclear Non-Use. International Organization 53 (3):433–68.10.1162/002081899550959CrossRef Google Scholar

Thompson, Nicholas. 2009. Inside the Apocalyptic Soviet Doomsday Machine. Wired News.Google Scholar

Tomz, Michael, Weeks, Jessica L.P., and Yarhi-Milo, Keren. 2020. Public Opinion and Decisions about Military Force in Democracies. International Organization 74 (1):119–43.10.1017/S0020818319000341CrossRef Google Scholar

Yarhi-Milo, Keren. 2013. Tying Hands Behind Closed Doors: The Logic and Practice of Secret Reassurance. Security Studies 22 (3):405–435.10.1080/09636412.2013.816126CrossRef Google Scholar

Yarhi-Milo, Keren, Kertzer, Joshua D., and Renshon, Jonathan. 2018. Tying Hands, Sinking Costs, and Leader Attributes. Journal of Conflict Resolution 62 (10):2150–79.10.1177/0022002718785693CrossRef Google Scholar

Young, Kevin L., and Carpenter, Charli. 2018. Does Science Fiction Affect Political Fact? Yes and No: A Survey Experiment on “Killer Robots.” International Studies Quarterly 62 (3):562–76.10.1093/isq/sqy028CrossRef Google Scholar

Figure 1. Automated nuclear threats enhance credibilityNote: *p < .10; **p < .05; ***p < .01.

Figure 2. Automated nuclear threats enhance effectiveness

Figure 3. Automated threats increase the perceived risk of accidentsNote: *p < .10; **p < .05; ***p < .01.

Table 1. Findings among United Kingdom Members of Parliament

Schwartz and Horowitz supplementary material

File 1.2 MB

Article contents

Delegating Destruction: Coercive Threats and Automated Nuclear Systems

Abstract

Keywords

Information

Debates in the Literature

The Debate over Nuclear Weapons and Coercion

The Debate over Automation and Strategic Stability

Theory: Tying Hands via Automation

Data and Methods

Study 1

Experimental Design

Sample

Study 2

Study 3

Results

Study 1

Robustness and Study 2

Study 3

Conclusion

Data Availability Statement

Supplementary Material

Acknowledgments

Funding

Footnotes

References

Schwartz and Horowitz supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests