"A simulator that cannot talk to the rest of the robot is a very convincing island."
A Systems-Minded Embodied AI Agent
MuJoCo, MJX, Isaac Lab, Newton, and Genesis do not exhaust the simulator stack. Drake matters when model-based design, optimization, and verification are central. SAPIEN matters through manipulation-focused stacks such as ManiSkill. ROS 2 and modern Gazebo matter when the simulator must behave like part of a robot system rather than a standalone learning loop.
A simulator that cannot connect to the robot stack is a sandbox, not a deployment rehearsal. Middleware, clocks, transforms, and logs decide whether simulated success survives contact with hardware.
This section applies the dynamics background from Chapter 6: Dynamics and Simulation Math to the simulator stack introduced in Chapter 9: Why Simulation Is Central. It also prepares the GPU training workflows in Chapter 17: Massively Parallel and GPU RL and the randomization workflow in Chapter 13: Domain Randomization and Synthetic Data by tying tool choice to measurable task risk.
Drake: Model-Based Design And Verification
Drake is strongest when you need dynamics, planning, control, optimization, and verification in one systems framework. It is not primarily a massive RL environment factory. It is the tool you reach for when the question sounds like "can we formulate this controller, planner, or stability condition precisely?" rather than "how many randomized episodes can we run this hour?"
That distinction matters when a learned policy needs a model-based guardrail. Drake can help analyze contact modes, constraints, trajectories, or controller stability around a small scenario. It complements GPU training rather than replacing it.
SAPIEN And ManiSkill: Manipulation At Scale
SAPIEN provides physical simulation for robots, rigid bodies, and articulated objects. It is especially visible through ManiSkill, a manipulation and robot-learning benchmark framework powered by SAPIEN. ManiSkill3 emphasizes GPU-parallel simulation and rendering for manipulation tasks, which makes it a natural bridge from this chapter to the benchmark chapter that follows.
The benchmark strength is asset and task consistency. If the research question is manipulation generalization, a shared task suite can be more valuable than a hand-built simulator scene because other groups can reproduce the same objects, initial states, observations, and success metrics.
ROS 2 And Gazebo: Systems Integration
ROS 2 is not a physics simulator, and Gazebo is not a learning library. Together, they often form the systems-integration path for robot projects. Modern Gazebo connects simulation with sensors, controllers, robot descriptions, and ROS 2 nodes. Gazebo Classic reached end of life in January 2025, so new projects should use current Gazebo releases and compatible ROS 2 distributions.
The integration test is different from the training test. Instead of asking how many episodes per second you can run, ask whether clocks, transforms, messages, controllers, sensor topics, reset behavior, and logs match the robot software stack closely enough to reveal deployment failures before hardware time.
| Tool | Best fit | Not the best fit |
|---|---|---|
| Drake | Optimization, planning, control, verification, model-based design | Fastest path to thousands of RL rollouts |
| SAPIEN | Manipulation simulation and articulated object interaction | General robot middleware integration |
| ManiSkill | Manipulation benchmarks and GPU-parallel robot-learning tasks | Full custom robot operating stack tests |
| ROS 2 plus Gazebo | System integration, sensors, controllers, middleware, hardware-like testing | Pure accelerator-native policy training loops |
A System Boundary Check
Code Fragment 1 gives a small classifier for simulator needs. The useful part is not the code itself, but the habit: decide whether the primary risk is physics, learning throughput, control design, or systems integration.
# Classify a simulator need by the risk that dominates the project.
# This helps separate physics choice from systems-integration choice.
# Add your own project risks before making a tool recommendation.
def recommend_tool(primary_risk: str) -> str:
options = {
"control_verification": "Drake",
"manipulation_benchmark": "SAPIEN or ManiSkill",
"ros2_integration": "ROS 2 with modern Gazebo",
"gpu_policy_training": "Isaac Lab, MJX, or MuJoCo Warp",
}
return options.get(primary_risk, "Run a simulator audit before choosing")
for risk in ["control_verification", "ros2_integration", "unknown"]:
print(f"{risk}: {recommend_tool(risk)}")
control_verification: Drake ros2_integration: ROS 2 with modern Gazebo unknown: Run a simulator audit before choosing
The hand-written classifier is 15 lines of reasoning. In practice, Drake, ManiSkill, and Gazebo each provide maintained workflows that replace custom integration code. The shortcut only works if you first identify whether the project needs control analysis, manipulation benchmarks, or middleware realism.
Choose a simulator by task contract, not reputation. Record the dominant physical risk, required sensor model, throughput target, asset format, and integration boundary, then run the same task panel before comparing tools.
Do not build a new ROS 2 simulation workflow around Gazebo Classic. Use modern Gazebo releases and check the compatibility table for your ROS 2 distribution before committing to a stack.
A warehouse mobile robot team might train a navigation policy in a fast simulator, test ROS 2 nodes and sensors in modern Gazebo, and use Drake for a small control or verification problem. A single simulator does not need to own every stage if the interfaces are explicit.
Expected output: A multi-tool simulator plan should list each project phase, the tool assigned to that phase, the interface between tools, the shared robot model or conversion step, and the metric that proves the phase did its job.
The trap is asking one tool to be a proof assistant, benchmark suite, RL factory, and robot middleware rehearsal. Strong projects let each simulator do the job it was built to do.
Is your main problem learning a policy, proving a controller property, benchmarking manipulation, or testing a robot software stack? Write that sentence before choosing the simulator.
Pick one real robot project and split its simulator needs into three phases: algorithm development, benchmark evaluation, and system integration. Assign a tool to each phase and explain why one simulator may not be enough.
The most interesting simulator workflows are becoming multi-tool pipelines. A policy may be trained in a GPU simulator, evaluated in a benchmark suite, stress-tested in a ROS 2 system simulation, and checked with model-based analysis. The research frontier is making those transitions reproducible rather than informal, especially when robot descriptions, logs, and evaluation metrics must survive movement between tools.
Drake, SAPIEN, ROS 2, and Gazebo complete the simulator stack because they solve different problems: control analysis, manipulation tasks, and robot-system integration.
Section 11.8 turns the entire chapter into a decision guide and a runnable simulator-selection lab.
Drake Development Team. "Drake: Model-Based Design and Verification for Robotics."
Drake is the key reference for optimization-based design, planning, control, and verification. It is the right deeper read for readers whose simulator needs are analytical rather than only throughput-driven.
SAPIEN Team. "SAPIEN Simulator."
SAPIEN provides simulation for robots, rigid bodies, and articulated objects. It is especially relevant for manipulation workflows and for understanding the simulator behind ManiSkill.
ManiSkill Team. "ManiSkill Documentation."
ManiSkill is an open-source robot simulation and training framework powered by SAPIEN. It connects this chapter's simulator discussion to the benchmark and manipulation chapters that follow.
Open Robotics. "Gazebo Documentation."
Modern Gazebo is the current simulator path for many ROS 2 integration workflows. Readers should use these docs rather than Gazebo Classic tutorials when starting new projects.
Open Robotics. "Gazebo Classic End-of-Life Notice."
The Gazebo Classic site documents the January 2025 end-of-life warning. It is included here because using current Gazebo rather than legacy Gazebo is a practical simulator-selection requirement.