Section 8.6: Bayesian filtering: Kalman, EKF, particle filters | Building Embodied AI: From Perception to Autonomous Action

A Careful Control Loop

Big Picture

Bayesian filtering: Kalman, EKF, particle filters is one lens on sensors, perception hardware, and state estimation. We study it because an embodied agent needs decisions that survive contact with noisy sensors, delayed effects, and changing environments.

This section develops the technical contract for Bayesian filtering: Kalman, EKF, particle filters into a usable mental model. First we define the object of study, then we connect it to the agent loop, then we test it with a compact implementation.

The key question in Bayesian filtering: Kalman, EKF, particle filters is practical: what must the agent know, what can it observe, what action is available, and what evidence shows that the action worked under the stated conditions?

Action Is The Test

A representation earns its place when it changes the measurable action interface. In Bayesian filtering: Kalman, EKF, particle filters, the reader should keep asking which decision becomes easier, safer, or more reliable.

Theory

For Bayesian filtering: Kalman, EKF, particle filters, the practical design rule is to make the interface inspectable before optimization begins: inputs, outputs, units, latency, bounds, and failure labels should all be visible in the saved artifact.

Mechanism

The mechanism in Bayesian filtering: Kalman, EKF, particle filters is the contract between representation and action. Name what enters the module, what leaves it, which assumptions make that transformation valid, and which log would reveal a bad handoff.

The Kalman Filter

The Kalman filter is the exact recursive solution to state estimation when the dynamics and the measurement are linear and all noise is Gaussian. It keeps a belief about the state as a mean vector $\hat{x}$ and a covariance matrix $P$, and it alternates two steps every time step: a predict step that pushes the belief forward through the motion model, and an update step that pulls it back toward a fresh measurement.

Let the state evolve as $x_k = A_k x_{k-1} + B_k u_k + w_k$ with process noise $w_k \sim \mathcal{N}(0, Q_k)$, and let the sensor report $z_k = H_k x_k + v_k$ with measurement noise $v_k \sim \mathcal{N}(0, R_k)$. The predict step is

$$\hat{x}^-_k = A_k \hat{x}_{k-1} + B_k u_k$$ $$P^-_k = A_k P_{k-1} A_k^T + Q_k$$

The superscript minus marks the prior belief, before the measurement is seen. Notice that $P$ always grows in the predict step: pushing the belief through an imperfect model can only add uncertainty. The update step then incorporates the measurement $z_k$:

$$K_k = P^-_k H_k^T (H_k P^-_k H_k^T + R_k)^{-1}$$ $$\hat{x}_k = \hat{x}^-_k + K_k (z_k - H_k \hat{x}^-_k)$$ $$P_k = (I - K_k H_k) P^-_k$$

The matrix $K_k$ is the Kalman gain, and it is the heart of the filter. Read it as a trust dial. When $K_k \to 0$ the filter ignores the measurement and rides on the prediction; when $K_k \to I$ (in the scalar sense) it throws away the prediction and snaps to the sensor. The gain is computed automatically from the relative sizes of the prior covariance $P^-_k$ and the measurement noise $R_k$: a confident prediction or a noisy sensor lowers the gain, and a vague prediction or a sharp sensor raises it.

The term $z_k - H_k \hat{x}^-_k$ is the innovation, the difference between what the sensor actually reported and what the prediction expected it to report. A filter that is working well produces an innovation that looks like zero-mean white noise; a biased or growing innovation is the clearest signal that the model or the noise settings are wrong.

Q And R Are The Two Knobs

$Q_k$ is the process noise: raise it to tell the filter to trust the motion model less and the sensor more. $R_k$ is the measurement noise: raise it to tell the filter to trust the sensor less and the model more. Almost all hand tuning of a Kalman filter is choosing the ratio between these two.

Worked Example: a 1D Constant-Velocity Tracker

The cleanest way to feel the predict-update loop is to track a target that moves at roughly constant velocity while measuring only its position. The state is $x = [\text{position}, \text{velocity}]^T$. The motion model says position advances by velocity times $\Delta t$ and velocity stays put, so $A = \begin{bmatrix} 1 & \Delta t \\ 0 & 1 \end{bmatrix}$. The sensor sees position only, so $H = [1\ \ 0]$. Velocity is never measured directly; the filter infers it from how position changes, which is exactly what makes the example instructive.

# 1D constant-velocity Kalman filter.
# State x = [position, velocity]^T. We measure noisy position only.
import numpy as np

dt = 1.0
A = np.array([[1.0, dt],      # position += velocity * dt
              [0.0, 1.0]])    # velocity constant
H = np.array([[1.0, 0.0]])    # measure position only
Q = np.array([[0.01, 0.0],    # process noise: the model is not perfect
              [0.0, 0.01]])
R = np.array([[1.0]])         # measurement noise variance (sigma^2 = 1)

# Ground truth: starts at 0, moves at 1.0 unit/step.
rng = np.random.default_rng(7)
steps = 20
true_pos = np.arange(steps) * 1.0
meas = true_pos + rng.normal(0.0, 1.0, size=steps)

x = np.array([[0.0], [0.0]])  # initial state guess
P = np.eye(2) * 5.0           # initial uncertainty (deliberately high)

print(" k  meas   est_pos  est_vel   P_pos")
for k in range(steps):
    # Predict
    x = A @ x
    P = A @ P @ A.T + Q
    # Update
    z = np.array([[meas[k]]])
    S = H @ P @ H.T + R
    Kk = P @ H.T @ np.linalg.inv(S)        # Kalman gain
    x = x + Kk @ (z - H @ x)               # innovation correction
    P = (np.eye(2) - Kk @ H) @ P
    print(f"{k:2d} {meas[k]:6.2f}  {x[0,0]:7.2f}  {x[1,0]:6.2f}  {P[0,0]:6.3f}")

print(f"\nfinal velocity estimate = {x[1,0]:.3f} (truth = 1.000)")
print(f"final position variance = {P[0,0]:.3f} (started at 5.0)")

Code Fragment 8.6.1: a complete 1D constant-velocity Kalman tracker. Over 20 steps the velocity estimate converges toward the true 1.0 unit/step even though velocity is never measured, and the position variance $P_{00}$ contracts from 5.0 to about 0.37 as the filter accumulates evidence.

Running this prints a converging trace. The velocity estimate climbs from the initial guess of 0 toward roughly 0.8 to 1.0 within a handful of steps, and the position variance shrinks monotonically as repeated measurements add information. The filter recovers an unmeasured quantity (velocity) purely from the structure of the motion model and the time-correlation of position measurements.

Nonlinear Beliefs: EKF and Particle Filters

Real robots rarely have linear dynamics. A differential-drive robot turning by a heading angle, a range-bearing landmark measurement, and a camera projection are all nonlinear. The extended Kalman filter (EKF) handles this by linearizing the nonlinear functions around the current estimate. If the motion model is $x_k = f(x_{k-1}, u_k)$, the EKF replaces $A_k$ with the Jacobian

$$F_k = \frac{\partial f}{\partial x}\Big|_{\hat{x}_{k-1}}$$

evaluated at the current estimate, and similarly replaces $H_k$ with the Jacobian of the measurement function. The covariance equations then run exactly as before but with these local linear approximations. The EKF is the workhorse of classical robot localization and SLAM, but it inherits one weakness: if the function is strongly curved near the estimate, or if the estimate is far from the truth, the linearization point is poor and the filter can diverge while still reporting a confident covariance.

When the belief itself is non-Gaussian (multimodal, sharply bounded, or shaped by discrete hypotheses), no single mean-and-covariance pair describes it. The particle filter abandons the Gaussian assumption entirely and represents the belief as a cloud of weighted samples (particles). Each particle is a hypothesis about the state; the predict step pushes every particle through the (possibly nonlinear) motion model with sampled noise, the update step reweights each particle by how well it explains the measurement, and a resampling step concentrates particles in high-probability regions. This Monte Carlo approach is the basis of AMCL, the standard ROS localization stack, and it shines exactly where the Kalman family struggles: the kidnapped-robot problem, ambiguous symmetric environments, and heavy-tailed sensor noise. The cost is compute, which grows with the number of particles, and the risk is particle impoverishment, where resampling discards the rare particle that happened to be correct.

Library Shortcut

The fragment should expose prediction, innovation, gain or weights, covariance, and accepted measurements. FilterPy is useful for teaching; robot_localization and AMCL carry the pattern into ROS systems.

Practical Recipe

Write the observation, action, and success metric before choosing a model.
Build a baseline that is simple enough to debug by inspection.
Add the library implementation only after the baseline behavior is understood.
Record failures as structured cases: perception error, state error, planning error, control error, or evaluation error.
Run at least one perturbation test before trusting the result.

Common Failure Mode

The common mistake in Bayesian filtering: Kalman, EKF, particle filters is to celebrate the component score before checking the closed-loop handoff. The failure usually appears at the boundary: stale state, wrong frame, delayed action, saturated actuator, or metric that ignores the real task cost.

Practical Example

A robotics team using Bayesian filtering: Kalman, EKF, particle filters should log not only final success, but intermediate observations, chosen actions, controller status, and recovery events. The logs reveal whether the method is solving the task or merely passing the easiest episodes.

Memory Hook

For bayesian filtering: kalman, ekf, particle filters, the useful test is simple: could a teammate point to the log line, plot, or trace that proves the idea changed the agent's next action?

Research Frontier

For Bayesian filtering: Kalman, EKF, particle filters, treat frontier claims as hypotheses until they expose enough detail to reproduce the result: data boundary, embodiment, controller interface, evaluation panel, and failure cases.

Self Check

Can you name the observation, state estimate, action, success metric, and most likely failure mode for Bayesian filtering: Kalman, EKF, particle filters? If not, the system boundary is still too vague.

Production Pattern

Bayesian filtering: Kalman, EKF, particle filters sits inside the Part II robotics contract: geometry defines where things are, kinematics defines what motion is possible, dynamics defines what motion costs, control defines how errors are corrected, and sensing defines what the agent can know on time.

A Bayesian filter is a predict-update loop whose covariance is part of the output, not a decoration. This makes the section useful to students, builders, and researchers at the same time: the idea has an intuitive role, a formal interface, a runnable check, and a failure mode that can be reproduced.

Mechanism To Watch

For Bayesian filtering: Kalman, EKF, particle filters, state estimation converts imperfect observations into a belief usable by control. Preserve calibration, covariance, timestamp, frame, dropout behavior, and latency.

Library Choices And Verification Checks

Tool or Library	What It Handles	Verification Check
OpenCV	handles camera models, calibration, projection, and vision preprocessing	Verify intrinsics, distortion, image timestamp, and frame-to-camera transform.
ROS 2 robot_localization	fuses odometry, IMU, GPS, pose, and twist streams through ROS estimation nodes	Verify covariance, frame IDs, timestamps, and rejected measurement counts.
FilterPy	teaches and prototypes Kalman, extended Kalman, unscented, and particle filters	Verify process noise, measurement noise, innovation, and covariance growth.
Kalibr	supports practical work on Bayesian filtering: Kalman, EKF, particle filters	Verify the library output against the hand-built baseline on one small case.
Open3D	supports practical work on Bayesian filtering: Kalman, EKF, particle filters	Verify the library output against the hand-built baseline on one small case.

Use this recipe when turning Bayesian filtering: Kalman, EKF, particle filters into code, a simulator experiment, or a robot diagnostic. The point is not to use every library. The point is to keep the hand-built baseline and the maintained-tool path comparable.

Define each sensor message with units, frame, timestamp source, calibration file, and covariance meaning.
Run a static test, a slow-motion test, and a dropout test before fusing streams.
Compare the hand filter with FilterPy or ROS 2 robot_localization using identical measurements and noise settings.
Log innovation, covariance, delayed messages, rejected measurements, and downstream control effect.
Treat perception output as a belief with uncertainty, not as ground truth handed to the controller.

Evidence Gate

For Bayesian filtering: Kalman, EKF, particle filters, compare methods only through one saved artifact that preserves the inputs, outputs, units, timestamps, latency budget, configuration, seed, metric definition, and failure labels relevant to this section. The comparison is meaningful only when the same script evaluates the same panel.

Exercise Extension

Extend the section exercise by adding one perturbation specific to Bayesian filtering: Kalman, EKF, particle filters and one latency or uncertainty check. Save the result in the EvidenceRecord schema, then explain which library output you trust and why.

Filtering failures come from a wrong process model, bad measurement likelihood, overconfident covariance, delayed updates, or particle impoverishment. Audit prediction, update, gating, and reset before changing the policy.

Technical Core

Bayesian filtering answers a simple robotics question: after the robot moves and a noisy sensor reports something, what should the robot believe now? The Kalman filter gives the closed-form answer for linear Gaussian systems, the extended Kalman filter linearizes a nonlinear model around the current estimate, and particle filters represent belief with samples when one Gaussian is not enough. Figure 8.6.T summarizes the chain this section must preserve when moving from a teaching example to a real embodied system.

Figure 8.6.T: The technical core for Bayesian filtering: Kalman, EKF, particle filters connects assumptions, model, algorithm, evidence, and failure analysis.

Formal Object

The linear Kalman filter predicts $\hat x^-_t=F\hat x_{t-1}+Bu_t$ and $P^-_t=FP_{t-1}F^\top+Q$, then computes $K_t=P^-_tH^\top(HP^-_tH^\top+R)^{-1}$. The update is $\hat x_t=\hat x^-_t+K_t(z_t-H\hat x^-_t)$ and $P_t=(I-K_tH)P^-_t$. The term $z_t-H\hat x^-_t$ is the innovation, the part of the measurement that the prediction failed to explain.

Filter choice and diagnostic audit

Use a Kalman filter when dynamics and measurements are close to linear and noise is close to Gaussian.
Use an extended Kalman filter when nonlinear functions can be locally linearized with reliable Jacobians.
Use a particle filter when belief is multimodal, strongly nonlinear, or constrained by discrete hypotheses.
Record innovation, Kalman gain, posterior covariance, and rejected measurements at every update.
Replay delayed, missing, and outlier measurements to see whether the filter fails gracefully.

Technical Contract For Bayesian Filters

Filter	Assumption	Failure Diagnostic
Kalman filter	Linear dynamics, linear measurement model, Gaussian noise.	Innovation is biased, covariance collapses, or residuals are not white.
Extended Kalman filter	Nonlinear model is locally well approximated by a Jacobian.	Linearization point is poor, angle wrapping is mishandled, or Jacobians are wrong.
Unscented Kalman filter	Belief can stay approximately Gaussian while sigma points capture nonlinear effects.	Sigma-point spread misses discontinuities or constraints.
Particle filter	Belief can be represented by weighted samples.	Particle depletion, poor proposal distribution, or resampling removes rare true hypotheses.

Expected output is a belief trace, not only a best estimate. The mean should move toward credible measurements, the covariance should contract only when information is actually added, and the innovation should look like noise after the model explains what it can explain.

Failure Mode To Test

A filter fails when it reports high confidence after unmodeled slip, delayed measurements, a bad Jacobian, or particle impoverishment. Smooth output is not evidence of a correct belief.

Section References

Core references for Bayesian filtering: Kalman, EKF, particle filters: Modern Robotics; Murray, Li, and Sastry; Siciliano et al.; LaValle; and official documentation for Drake, MuJoCo, Pinocchio, CasADi, python-control, GTSAM, ROS 2, and OpenCV as applicable.

Use these references to check notation, frame conventions, units, solver assumptions, and maintained-library behavior.

Key Takeaway

Bayesian filtering: Kalman, EKF, particle filters is useful when it makes the perception-action loop more reliable, not when it merely adds a more impressive model name.

Exercise 8.6.1

Design a method-matched experiment for Bayesian filtering: Kalman, EKF, particle filters. Specify the environment, observations, actions, metric, one perturbation, and the library output you would compare against the hand-built baseline.