Contents | Building Embodied AI: From Perception to Autonomous Action

Front Matter · Opening Material

9 entries

F1
ForewordFront matter for Building Embodied AI: From Perception to Autonomous Action.
front-matter/foreword.html
F2
About the AuthorsFront matter for Building Embodied AI: From Perception to Autonomous Action.
front-matter/about-authors.html
F3
About the Hands-On AI Science SeriesThe series promise and why Embodied AI is the fifth volume.
front-matter/about-the-series.html
F4
Who Should Read This BookFront matter for Building Embodied AI: From Perception to Autonomous Action.
front-matter/fm-who-should-read.html
F5
How to Use This BookFront matter for Building Embodied AI: From Perception to Autonomous Action.
front-matter/fm-how-to-use.html
F6
What This Book CoversFront matter for Building Embodied AI: From Perception to Autonomous Action.
front-matter/fm-what-this-book-covers.html
F7
Look Inside PreviewFront matter for Building Embodied AI: From Perception to Autonomous Action.
front-matter/look-inside-preview.html
F8
Application Reader PathwaysApplication-specific pathways through the book.
front-matter/application-reader-pathways.html
F9
Copyright and LegalFront matter for Building Embodied AI: From Perception to Autonomous Action.
front-matter/copyright.html

Part I · Foundations of Embodied AI

3 chapters · 24 sections

The conceptual vocabulary of agents, environments, embodiment, and closed-loop intelligence.

1
From Static AI to Embodied AI Theory, practical recipe, lab, and library shortcuts for this chapter.
part-1-foundations-of-embodied-ai/module-01-from-static-ai-to-embodied-ai/
2
The Agent-Environment Interface Theory, practical recipe, lab, and library shortcuts for this chapter.
part-1-foundations-of-embodied-ai/module-02-the-agent-environment-interface/
3
Embodied System Architectures Theory, practical recipe, lab, and library shortcuts for this chapter.
part-1-foundations-of-embodied-ai/module-03-embodied-system-architectures/

Part II · Mathematical, Robotics, and Control Foundations

5 chapters · 36 sections

The geometry, kinematics, dynamics, control, and sensing that make physical agents intelligible.

4
Spatial Representation and Coordinate Frames Theory, practical recipe, lab, and library shortcuts for this chapter.
part-2-mathematical-robotics-and-control-foundations/module-04-spatial-representation-and-coordinate-frames/
5
Kinematics and Robot Motion Theory, practical recipe, lab, and library shortcuts for this chapter.
part-2-mathematical-robotics-and-control-foundations/module-05-kinematics-and-robot-motion/
6
Dynamics and Simulation Math Theory, practical recipe, lab, and library shortcuts for this chapter.
part-2-mathematical-robotics-and-control-foundations/module-06-dynamics-and-simulation-math/
7
Control for AI Practitioners Theory, practical recipe, lab, and library shortcuts for this chapter.
part-2-mathematical-robotics-and-control-foundations/module-07-control-for-ai-practitioners/
8
Sensors, Perception Hardware, and State Estimation Theory, practical recipe, lab, and library shortcuts for this chapter.
part-2-mathematical-robotics-and-control-foundations/module-08-sensors-perception-hardware-and-state-estimation/

Part III · Simulation, Tooling, and the Modern Stack

5 chapters · 32 sections

The simulators, environments, benchmarks, and synthetic-data practices used to build embodied systems today.

9
Why Simulation Is Central Theory, practical recipe, lab, and library shortcuts for this chapter.
part-3-simulation-tooling-and-the-modern-stack/module-09-why-simulation-is-central/
10
Environments with Gymnasium (and PettingZoo) Theory, practical recipe, lab, and library shortcuts for this chapter.
1. 10.1 Gym is dead; Gymnasium is the standard
2. 10.2 Observation and action spaces
3. 10.3 Reward design and termination
4. 10.4 Vectorized environments; wrappers
5. 10.5 Rendering, logging, and debugging
6. 10.6 Evaluation protocol and seeding
7. 10.7 PettingZoo for multi-agent
part-3-simulation-tooling-and-the-modern-stack/module-10-environments-with-gymnasium-and-pettingzoo/
11
Physics Simulators: MuJoCo, MJX, Isaac Lab, Genesis Theory, practical recipe, lab, and library shortcuts for this chapter.
part-3-simulation-tooling-and-the-modern-stack/module-11-physics-simulators-mujoco-mjx-isaac-lab-genesis/
12
Benchmarks and Task Suites Theory, practical recipe, lab, and library shortcuts for this chapter.
part-3-simulation-tooling-and-the-modern-stack/module-12-benchmarks-and-task-suites/
13
Domain Randomization and Synthetic Data Theory, practical recipe, lab, and library shortcuts for this chapter.
part-3-simulation-tooling-and-the-modern-stack/module-13-domain-randomization-and-synthetic-data/

Part IV · Reinforcement Learning for Embodied Agents

7 chapters · 37 sections

Interaction-driven learning, from policy gradients and off-policy methods to safe exploration and sim-to-real transfer.

14
Reinforcement Learning Refresher Theory, practical recipe, lab, and library shortcuts for this chapter.
part-4-reinforcement-learning-for-embodied-agents/module-14-reinforcement-learning-refresher/
15
Policy Gradient Methods and PPO Theory, practical recipe, lab, and library shortcuts for this chapter.
part-4-reinforcement-learning-for-embodied-agents/module-15-policy-gradient-methods-and-ppo/
16
Value-Based and Off-Policy Methods Theory, practical recipe, lab, and library shortcuts for this chapter.
1. 16.1 Q-learning; deep Q-networks
2. 16.2 Replay buffers and target networks
3. 16.3 Continuous control: DDPG, TD3, SAC
4. 16.4 Maximum-entropy RL
5. 16.5 Sample efficiency and off-policy failure modes
part-4-reinforcement-learning-for-embodied-agents/module-16-value-based-and-off-policy-methods/
17
Massively Parallel and GPU RL Theory, practical recipe, lab, and library shortcuts for this chapter.
part-4-reinforcement-learning-for-embodied-agents/module-17-massively-parallel-and-gpu-rl/
18
Reward Design and Goal Specification Theory, practical recipe, lab, and library shortcuts for this chapter.
part-4-reinforcement-learning-for-embodied-agents/module-18-reward-design-and-goal-specification/
19
Exploration in Embodied Worlds Theory, practical recipe, lab, and library shortcuts for this chapter.
part-4-reinforcement-learning-for-embodied-agents/module-19-exploration-in-embodied-worlds/
20
Sim-to-Real Transfer (RL focus) Theory, practical recipe, lab, and library shortcuts for this chapter.
part-4-reinforcement-learning-for-embodied-agents/module-20-sim-to-real-transfer-rl-focus/

Part V · Learning from Demonstration and Robot Data

6 chapters · 33 sections

A coherent segment of the embodied ai stack.

21
Imitation Learning Theory, practical recipe, lab, and library shortcuts for this chapter.
part-5-learning-from-demonstration-and-robot-data/module-21-imitation-learning/
22
Action Chunking and Diffusion Policies Theory, practical recipe, lab, and library shortcuts for this chapter.
part-5-learning-from-demonstration-and-robot-data/module-22-action-chunking-and-diffusion-policies/
23
Teleoperation and Data Collection Theory, practical recipe, lab, and library shortcuts for this chapter.
part-5-learning-from-demonstration-and-robot-data/module-23-teleoperation-and-data-collection/
24
Robot Datasets and Data Scaling Laws Theory, practical recipe, lab, and library shortcuts for this chapter.
part-5-learning-from-demonstration-and-robot-data/module-24-robot-datasets-and-data-scaling-laws/
25
Offline RL and Dataset-Based Robot Learning Theory, practical recipe, lab, and library shortcuts for this chapter.
part-5-learning-from-demonstration-and-robot-data/module-25-offline-rl-and-dataset-based-robot-learning/
26
Skills, Hierarchy, and Task Decomposition Theory, practical recipe, lab, and library shortcuts for this chapter.
part-5-learning-from-demonstration-and-robot-data/module-26-skills-hierarchy-and-task-decomposition/

Part VI · Embodied Perception

4 chapters · 27 sections

Vision, 3d understanding, localization, mapping, and navigation as perception for action.

27
Visual Perception for Action Theory, practical recipe, lab, and library shortcuts for this chapter.
part-6-embodied-perception/module-27-visual-perception-for-action/
28
3D Perception and Neural Scene Representations Theory, practical recipe, lab, and library shortcuts for this chapter.
part-6-embodied-perception/module-28-3d-perception-and-neural-scene-representations/
29
Localization and Mapping (SLAM) Theory, practical recipe, lab, and library shortcuts for this chapter.
1. 29.1 Where am I and what does the world look like
2. 29.2 Odometry and dead reckoning
3. 29.3 Localization (Monte Carlo / particle filters)
4. 29.4 Mapping and occupancy grids
5. 29.5 SLAM: graph-based and visual SLAM
6. 29.6 Neural and Gaussian-splat SLAM
7. 29.7 Map uncertainty
8. 29.8 Modern SLAM Systems And Failure Modes
part-6-embodied-perception/module-29-localization-and-mapping-slam/
30
Navigation and Path Planning Theory, practical recipe, lab, and library shortcuts for this chapter.
part-6-embodied-perception/module-30-navigation-and-path-planning/

Part VII · Language, Vision, and Action

5 chapters · 35 sections

Language-guided agents, vlms, llm planners, vlas, and cross-embodiment foundation models.

31
Language-Guided Embodied Agents Theory, practical recipe, lab, and library shortcuts for this chapter.
part-7-language-vision-and-action/module-31-language-guided-embodied-agents/
32
Vision-Language Models for Embodiment Theory, practical recipe, lab, and library shortcuts for this chapter.
part-7-language-vision-and-action/module-32-vision-language-models-for-embodiment/
33
LLMs as Planners and Controllers Theory, practical recipe, lab, and library shortcuts for this chapter.
part-7-language-vision-and-action/module-33-llms-as-planners-and-controllers/
34
Vision-Language-Action Models Theory, practical recipe, lab, and library shortcuts for this chapter.
part-7-language-vision-and-action/module-34-vision-language-action-models/
35
Robot Foundation Models and Cross-Embodiment Learning Theory, practical recipe, lab, and library shortcuts for this chapter.
part-7-language-vision-and-action/module-35-robot-foundation-models-and-cross-embodiment-learning/

Part VIII · World Models and Model-Based Embodied AI

6 chapters · 32 sections

Prediction, latent dynamics, model-based control, generative worlds, and diffusion planning.

36
Predicting the Future Theory, practical recipe, lab, and library shortcuts for this chapter.
part-8-world-models-and-model-based-embodied-ai/module-36-predicting-the-future/
37
Model-Based RL and MPC Theory, practical recipe, lab, and library shortcuts for this chapter.
part-8-world-models-and-model-based-embodied-ai/module-37-model-based-rl-and-mpc/
38
Latent World Models Theory, practical recipe, lab, and library shortcuts for this chapter.
1. 38.1 Why predict in latent space
2. 38.2 Autoencoders and recurrent state-space models (RSSM)
3. 38.3 Dreamer to DreamerV3
4. 38.4 Transformer world models (IRIS)
5. 38.5 TD-MPC2: latent MPC at scale
6. 38.6 World models for visual control
part-8-world-models-and-model-based-embodied-ai/module-38-latent-world-models/
39
Generative and Video World Models Theory, practical recipe, lab, and library shortcuts for this chapter.
part-8-world-models-and-model-based-embodied-ai/module-39-generative-and-video-world-models/
40
Predictive Representations and Self-Supervised World Models Theory, practical recipe, lab, and library shortcuts for this chapter.
part-8-world-models-and-model-based-embodied-ai/module-40-predictive-representations-and-self-supervised-world-models/
41
Diffusion and Generative Planning Theory, practical recipe, lab, and library shortcuts for this chapter.
part-8-world-models-and-model-based-embodied-ai/module-41-diffusion-and-generative-planning/

Part IX · Manipulation, Locomotion, and Embodied Skills

7 chapters · 45 sections

Hands, legs, humanoids, drones, vehicles, and the skills that let agents move through the world.

42
Robotic Manipulation Theory, practical recipe, lab, and library shortcuts for this chapter.
part-9-manipulation-locomotion-and-embodied-skills/module-42-robotic-manipulation/
43
Grasping and Dexterous Manipulation Theory, practical recipe, lab, and library shortcuts for this chapter.
part-9-manipulation-locomotion-and-embodied-skills/module-43-grasping-and-dexterous-manipulation/
44
Tactile and Visuo-Tactile Learning Theory, practical recipe, lab, and library shortcuts for this chapter.
part-9-manipulation-locomotion-and-embodied-skills/module-44-tactile-and-visuo-tactile-learning/
45
Locomotion and Mobility Theory, practical recipe, lab, and library shortcuts for this chapter.
part-9-manipulation-locomotion-and-embodied-skills/module-45-locomotion-and-mobility/
46
Humanoid Robots and Whole-Body Control Theory, practical recipe, lab, and library shortcuts for this chapter.
part-9-manipulation-locomotion-and-embodied-skills/module-46-humanoid-robots-and-whole-body-control/
47
Drones and Aerial Embodied AI Theory, practical recipe, lab, and library shortcuts for this chapter.
part-9-manipulation-locomotion-and-embodied-skills/module-47-drones-and-aerial-embodied-ai/
48
Autonomous Driving as Embodied AI Theory, practical recipe, lab, and library shortcuts for this chapter.
part-9-manipulation-locomotion-and-embodied-skills/module-48-autonomous-driving-as-embodied-ai/

Part X · Multi-Agent and Human-Centered Embodiment

3 chapters · 16 sections

Teams of agents, humans in the loop, open worlds, and lifelong interaction.

49
Multi-Agent Embodied AI Theory, practical recipe, lab, and library shortcuts for this chapter.
part-10-multi-agent-and-human-centered-embodiment/module-49-multi-agent-embodied-ai/
50
Human-Robot Interaction Theory, practical recipe, lab, and library shortcuts for this chapter.
1. 50.1 Robots among humans
2. 50.2 Natural-language interaction and social navigation
3. 50.3 Intent recognition and trust calibration
4. 50.4 Explainable robot behavior
5. 50.5 Human feedback and shared autonomy
6. 50.6 Ethical concerns
part-10-multi-agent-and-human-centered-embodiment/module-50-human-robot-interaction/
51
Open-World and Novelty-Robust Embodiment Theory, practical recipe, lab, and library shortcuts for this chapter.
part-10-multi-agent-and-human-centered-embodiment/module-51-open-world-and-lifelong-embodiment/

Part XI · Evaluation, Safety, Robustness, and Deployment

4 chapters · 21 sections

Metrics, uncertainty, safety filters, deployment architecture, and operational discipline.

52
Evaluating Embodied Systems Theory, practical recipe, lab, and library shortcuts for this chapter.
part-11-evaluation-safety-robustness-and-deployment/module-52-evaluating-embodied-systems/
53
Robustness and Uncertainty Theory, practical recipe, lab, and library shortcuts for this chapter.
part-11-evaluation-safety-robustness-and-deployment/module-53-robustness-and-uncertainty/
54
Safety in Embodied AI Theory, practical recipe, lab, and library shortcuts for this chapter.
part-11-evaluation-safety-robustness-and-deployment/module-54-safety-in-embodied-ai/
55
Deployment Architecture Theory, practical recipe, lab, and library shortcuts for this chapter.
part-11-evaluation-safety-robustness-and-deployment/module-55-deployment-architecture/

Part XII · Frontiers, Capstones, and Course Design

5 chapters · 32 sections

Memory, continual learning, open problems, capstone projects, and teaching paths.

56
Embodied Agents with Memory Theory, practical recipe, lab, and library shortcuts for this chapter.
1. 56.1 Why memory matters; short- vs. long-term
2. 56.2 Spatial, episodic, and semantic memory
3. 56.3 Memory retrieval for planning
4. 56.4 Memory errors
part-12-frontiers-capstones-and-course-design/module-56-embodied-agents-with-memory/
57
Continual and Lifelong Learning Theory, practical recipe, lab, and library shortcuts for this chapter.
part-12-frontiers-capstones-and-course-design/module-57-continual-and-lifelong-learning/
58
Frontier and Open Problems Theory, practical recipe, lab, and library shortcuts for this chapter.
part-12-frontiers-capstones-and-course-design/module-58-frontier-and-open-problems/
59
Capstone Projects Theory, practical recipe, lab, and library shortcuts for this chapter.
part-12-frontiers-capstones-and-course-design/module-59-capstone-projects/
60
Teaching with This Book Theory, practical recipe, lab, and library shortcuts for this chapter.
part-12-frontiers-capstones-and-course-design/module-60-teaching-with-this-book/

Appendices · Reference and Pedagogy

9 appendices

A
Linear Algebra and 3D Geometry RefresherReference material supporting the self-contained book promise.
appendices/appendix-a-linear-algebra-3d-geometry/
B
Probability, Estimation, and Optimization RefresherReference material supporting the self-contained book promise.
appendices/appendix-b-probability-estimation-optimization/
C
The Embodied AI ToolboxReference material supporting the self-contained book promise.
appendices/appendix-c-embodied-ai-toolbox/
D
PyTorch and JAX for Embodied AIReference material supporting the self-contained book promise.
appendices/appendix-d-pytorch-jax/
E
Compute RecipesReference material supporting the self-contained book promise.
appendices/appendix-e-compute-recipes/
F
Datasets and Benchmarks CatalogReference material supporting the self-contained book promise.
appendices/appendix-f-datasets-benchmarks/
G
Reproducibility and Experiment HygieneReference material supporting the self-contained book promise.
appendices/appendix-g-reproducibility/
H
Notation and GlossaryReference material supporting the self-contained book promise.
appendices/appendix-h-notation-glossary/
I
Citing the FrontierReference material supporting the self-contained book promise.
appendices/appendix-i-citing-frontier/