Search results for Artificial Intelligence and Natural Language Processing

7 - Learning HTN Methods
from Part II - Hierarchical Task Networks
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 132-146
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

HTN planning algorithms require a set of HTN methods that provide knowledge about potential problem-solving strategies. Typically these methods are written by a domain expert, but this chapter is about some ways to learn HTN methods from examples. It describes how to learn HTN methods in learning-by-demonstration situations in which a learner is given examples of plans for various tasks, and also in situations where the learner is given only the plans and must infer what tasks the plans accomplish. The chapter also speculates briefly about prospects for a “planning-to-learn” approach in which a learner generates its own examples using a classical planner.

13 - Learning Nondeterministic Models
from Part IV - Nondeterministic Models
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 342-346
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Learning for nondeterministic models can take advantage of most of the techniques developed for probabilistic models (Chapter 10). Indeed, note that in reinforcement learning (RL), probabilities of action transitions are not needed, so RL techniques can be applied to nondeterministic models too. For instance, we can use the algorithms for Q-learning, parametric Q-learning, and deep Q-learning. However, these algorithms do not give explicit description models of actions. In this chapter, we therefore discuss some intuitions and also some challenges of how the techniques for learning deterministic action specifications could be extended to deal with nondeterministic models. Note, however, that learning lifted action schemas in nondeterministic models is still an open problem.

19 - Learning for Temporal Acting and Planning
from Part VI - Temporal Models
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 438-446
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Temporal models are quite rich, allowing concurrency and temporal constraints to be handled. But the development of the temporal models is a bottleneck, to be eased with machine learning techniques. In this chapter, we first briefly address the problem of learning heuristics for temporal planning (Section 19.1). We then consider the issue of learning durative action schema and temporal methods (Section 19.2). The chapter outlines the proposed approaches, based on techniques seen earlier in the book, without getting into detailed descriptions of the corresponding procedures.

18 - Acting with Temporal Controllability
from Part VI - Temporal Models
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 419-437
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter addresses the issues of acting with temporal models . It presents methods for handling dynamic controllability (Section 18.1), dispatching (Section 18.2), and execution and refinement of a temporal plan (Section 18.3). It proposes methods for acting with a reactive temporal refinement engine (Section 18.4), planning with Monte Carlo rollouts (Section 18.5), and integrating planning and acting (Section 18.6).

Preface
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp xv-xviii
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

11 - Acting with Nondeterministic Models
from Part IV - Nondeterministic Models
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 275-308
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In this chapter we introduce different representations and techniques for acting with nondeterministic models: nondeterministic state transition systems (Section 11.1), automata (Section 11.2), behavior trees (Section 11.3), and Petri nets (Section 11.4).

Part VIII - Other Topics and Perspectives
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 535-536
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In the past, techniques for natural language translation were not very relevant for acting and planning systems. However, with the recent advent of large language models and their various multimodal extensions into foundation models, this is no longer the case. This last part introduces large language models and their potential benefits in acting, planning, and learning. It discusses the perceiving, monitoring, and goal reasoning functions for deliberation.

List of Algorithms
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 569-572
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

10 - Reinforcement Learning
from Part III - Probabilistic Models
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 228-270
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Learning to act with probabilistic models is the area of reinforcement learning (RL), the topic of this chapter. RL in some ways parallels the adaptation mechanisms of natural beings to their environment, relying on feedback mechanisms and extending the homeostasis regulations to complex behaviors. With continual learning, an actor can cope with a continually changing environment.This chapter first introduces the main principles of reinforcement learning. It presents a simple Q-learning RL algorithm. It shows how to generalize a learned relation with a parametric representation. it introduces neural network methods, which play a major in learning and are needed for deep RL (Section 10.5) and policy-based RL (Section 10.6). The issues of aided reinforcement learning with shaped rewards, imitation learning, and inverse reinforcement learning are addressed next. Section 10.8 is about probabilistic planning and RL.

Index
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 610-614
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part V - Hierarchical Refinement Models
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 347-348
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This part of the book is devoted to acting, planning, and learning with operational models of actions expressed with a hierarchical task-oriented representation. Operational models are valuable for acting. They allow for detailed descriptions of complex actions handling dynamic environments with exogenous events. The representation relies on hierarchical refinement methods that describe alternative ways to handle tasks and react to events. A method can be any complex algorithm, decomposing a task into subtasks and primitive actions. Subtasks are refined recursively. Actions trigger the execution of sensory-motor procedures in closed loops that query and change the world stochastically.

21 - Task and Motion Planning
from Part VII - Motion and Manipulation Models in Robotics
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 485-525
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Task and motion planning (TAMP) problems combine abstract causal relations from preconditions to effects with computational geometry, kinematics, and dynamics. This chapter is about the integration of planning for motion/manipulation with planning for abstract actions. It introduces the main sampling-based algorithms for motion planning. Manipulation planning is subsequently introduced. A few approaches specific to TAMP are then presented.

17 - Temporal Representation and Planning
from Part VI - Temporal Models
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 391-418
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter is about planning approaches with explicit time in the descriptive and operational models of actions, as well as in the models of the expected evolution of the world not caused by the actor. It describes a planning algorithm that handles durative and concurrent activities with respect to a predicted dynamics. Section 17.1 presents a knowledge representation for modeling actions and tasks with temporal variables using temporal refinement methods. Temporal plans and planning problems are defined as chronicles, i.e., collections of assertions and tasks with explicit temporal constraints. A planning algorithm with temporal refinement methods is developed in Section 17.2. The basic techniques for managing temporal and domain constraints are then presented in Section 17.3.

4 - Learning Deterministic Models
from Part I - Deterministic State-Transition Systems
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 71-94
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter is about two key aspects of learning with deterministic models: learning heuristics to speed up the search for a solution plan and the automated synthesis of the model itself. We discuss how to learn heuristics for exploring parts of the search space that are more likely to lead to solutions. We then address the problem of how to learn a deterministic model, with a focus on learning action schemas.

Appendix B - Other Mathematical Background
from Appendices
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 566-568
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part III - Probabilistic Models
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 147-148
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The motivations for acting and planning with probabilistic models are about handling uncertainty in a quantitative way, with optimal or near-optimal decisions. The future is never entirely and precisely predictable. Uncertainty can be due to exogenous events in the environment, from nature and other actors, to noisy sensing and information gathering actions, to possible failures and outcomes of imprecise or intrinsically nondeterministic actions. Models are necessarily incomplete. Knowledge about open environments is partial. Part of what may happen can be only be modeled with uncertainty. Even in closed predictable environments, complete deterministic models may be too complex to develop. The three chapters in Part III tackle acting, planning, and learning in a probabilistic setting.

9 - Planning with Probabilistic Models
from Part III - Probabilistic Models
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 176-227
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter is about planning techniques for solving MDP problems. It presents algorithms that seeks optimal or near-optimal solution policies for a domain. Most of the chapter is focused on indefinite-horizon goal reachability domains that have positive costs and a safe solution; they may have dead ends, but those are avoidable. The chapter presents dynamic programming algorithms, heuristics search methods and their heuristics, linear programming methods, and online and Monte Carlo tree search techniques.

Frontmatter
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp i-iv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

8 - Probabilistic Representation and Acting
from Part III - Probabilistic Models
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp 149-175
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In probabilistic models, an action can have several possible outcomes that are not equally likely; their distribution can be estimated relying on statistics of past observations. The purpose is to act optimally with respect to an optimization criterion of the estimated likelihood of action effects and their cost. The usual formal probabilistic models are Markov decision processes (MDPs). An MDP is a nondeterministic state-transition system with a probability distribution and a cost distribution. The probability distribution defines how likely it is to get to a state 𝑠′ when an action 𝑎 is performed in a state 𝑠. The chapter presents MDPs in flat then structured state-space representations. Section 8.3 covers modeling issues of a probabilistic domain with MDPs and variants such as the stochastic shortest path model (SSP) or the constrained MDP (C-MDP) model. Section 8.4 focuses on acting with MDPs. Partially observable MDPs and other extended models are discussed in Section 8.5.

Foreword
- By Michela Milano
Malik Ghallab, LAAS-CNRS, Toulouse, Dana Nau, University of Maryland, College Park, Paolo Traverso, Fondazione Bruno Kessler, Trento, Italy
Foreword by Michela Milano, Università degli Studi, Bologna, Italy
Book:

Acting, Planning, and Learning

Published online:

19 May 2025

Print publication:

05 June 2025, pp xiii-xiv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Artificial Intelligence and Natural Language Processing

Refine search

Refine search

Actions for selected content:

3242 results in Artificial Intelligence and Natural Language Processing

7 - Learning HTN Methods

Summary

13 - Learning Nondeterministic Models

Summary

19 - Learning for Temporal Acting and Planning

Summary

18 - Acting with Temporal Controllability

Summary

Preface

11 - Acting with Nondeterministic Models

Summary

Part VIII - Other Topics and Perspectives

Summary

List of Algorithms

10 - Reinforcement Learning

Summary

Index

Part V - Hierarchical Refinement Models

Summary

21 - Task and Motion Planning

Summary

17 - Temporal Representation and Planning

Summary

4 - Learning Deterministic Models

Summary

Appendix B - Other Mathematical Background

Part III - Probabilistic Models

Summary

9 - Planning with Probabilistic Models

Summary

Frontmatter

8 - Probabilistic Representation and Acting

Summary

Foreword

Artificial Intelligence and Natural Language Processing

Refine search

Refine search

Actions for selected content:

Save Search

3242 results in Artificial Intelligence and Natural Language Processing

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary