A Taxonomy and Comprehensive Survey of Scenario Generation for Autonomous Driving: Methods, Challenges, and Emerging Trends in Safety-Critical Testing

08 September 2025, Version 1
This content is an early or alternative research output and has not been peer-reviewed by Cambridge University Press at the time of posting.

Abstract

Achieving the safety of autonomous vehicles (AVs) requires testing methodologies that are rigorous, scalable, and capable of addressing both routine and safety-critical situations. Traditional evaluation through mileage accumulation often fails to expose autonomous driving systems (ADS) to rare but high-impact events. Scenario-based testing has therefore emerged as a key paradigm, enabling the systematic construction of targeted and repeatable test cases. Central to this paradigm is scenario generation, which provides the essential means of populating simulations with realistic, diverse, and safety-relevant traffic situations. This survey presents a taxonomy and critical review of scenario generation methods for autonomous driving. We classify existing approaches into three paradigms—rule-based, data-driven, and learning-based—and analyze their methodologies, representative techniques, and supporting tools such as simulation platforms and scenario description languages. We also summarize evaluation metrics used to measure realism, diversity, and criticality, highlighting trade-offs between interpretability, scalability, and generalizability. Despite rapid progress, several challenges remain. Key issues include bridging the “reality gap” between virtual and real traffic, improving robustness under limited or biased datasets, and effectively modeling rare but safety-critical events. Emerging research directions are also explored, including language-driven generation, hybrid frameworks that integrate symbolic rules with generative learning, and standardized open scenario repositories to enhance reproducibility and regulatory acceptance. By consolidating diverse research efforts into a unified framework, this work provides structured insights for both researchers and practitioners. It aims to support the development of scalable and certifiable testing pipelines that are essential for the safe deployment of autonomous driving technologies.

Keywords

Autonomous Driving
autonomous vehicles
Simulation
safety testing

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting and Discussion Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.
Comment number 1, Marat Fazulzyanov: Dec 24, 2025, 07:55

Данный обзор представляет собой своевременную и методологически выверенную работу, которая систематизирует быстро развивающуюся область генерации сценариев для тестирования автономного транспорта. Авторы точно определяют ключевую проблему перехода от пассивного накопления пробега к активному, целеполагающему тестированию на основе сценариев, что является единственным реалистичным путем к сертификации САД. Главная ценность работы — в предложенной четкой таксономии (правила, данные, машинное обучение), которая позволяет не просто каталогизировать методы, но и понять фундаментальные компромиссы между интерпретируемостью (сила логических правил) и масштабируемостью/обобщаемостью (сила ML-подходов). Анализ метрик оценки реалистичности, разнообразия и критичности прямо указывает на "узкие места" в валидации самих методов генерации. Критически важным выводом является выделение нерешенных проблем, особенно "разрыва в реальности" и сложности генерации критически важных, но статистически редких событий (edge cases). Это указывает на то, что текущие методы, основанные преимущественно на экстраполяции существующих данных, по своей сути ограничены и могут не охватить истинно неизвестные опасные ситуации.