Section 60.3: Two-semester sequence

"The first term builds the parts. The second term asks them to survive each other."

A Two-Term Embodied Curriculum
Technical illustration for Section 60.3: Two-semester sequence.
Figure 60.3A: A two-semester sequence laid out as two 14-week blocks: semester one covers foundations through RL and IL, semester two covers language-action models, world models, and the capstone, with research-track students branching to a literature-seminar thread in week 20.
Big Picture

Two-semester sequence gives Teaching with This Book a concrete systems role: use the first term for foundations and the second for integrated systems and capstones. The section keeps asking what the agent observes, what it remembers or updates, which action changes, and what evidence would convince a skeptical reader.

This section develops the technical contract for two-semester sequence into a usable mental model. First we define the object of study, then we connect it to the agent loop, then we test it with a compact implementation.

The key question in Two-semester sequence is practical: what must the agent know, what can it observe, what action is available, and what evidence shows that the action worked under the stated conditions?

Action Is The Test

Two-semester sequence should be judged by the action it improves. A section claim is strong when it names the decision, the measurement, and the failure mode before a larger model or simulator is introduced.

Theory

For Two-semester sequence, the practical design rule is to make the interface inspectable before optimization begins: inputs, outputs, units, latency, bounds, and failure labels should all be visible in the saved artifact.

Mechanism

The mechanism in Two-semester sequence is the contract between representation and action. Name what enters the module, what leaves it, which assumptions make that transformation valid, and which log would reveal a bad handoff.

Worked Example

For Two-semester sequence, keep one concrete rollout in view. A sensor reading becomes an estimate, the estimate constrains an action, the action changes the world, and the next observation confirms or contradicts the assumption. The section's idea is useful only if it improves that loop.

Library Shortcut

For Two-semester sequence, the small contract exists to expose the teaching artifact before tooling takes over. Use notebooks, simulators, shared logs, rubrics, and capstone studios only when they preserve the same observation, action, metric, and failure fields.

Practical Recipe

  1. Write the observation, action, and success metric before choosing a model.
  2. Build a baseline that is simple enough to debug by inspection.
  3. Add the library implementation only after the baseline behavior is understood.
  4. Record failures as structured cases: perception error, state error, planning error, control error, or evaluation error.
  5. Run at least one perturbation test before trusting the result.
Common Failure Mode

The common mistake in Two-semester sequence is to trust a component score before checking the closed-loop interface. The failure usually appears where state, timing, authority, or evaluation context crosses a module boundary.

Practical Example

A team using Two-semester sequence starts by writing the task panel, not by picking the largest model. They keep a baseline run, a maintained-tool run, and a perturbation run in the same result folder. The comparison is accepted only when the action trace, metric, and failure labels come from one script.

Memory Hook

When two-semester sequence feels abstract, ask what would be different in the next frame of video, the next robot state, or the next safety margin.

Research Frontier

For Two-semester sequence, the open research question is not whether a larger policy can produce a better demo. The sharper question is whether the method improves reliability across new scenes, new embodiments, delayed feedback, and rare failures under an evaluation protocol that another lab can reproduce.

Self Check

For Two-semester sequence, can you name the observation, action, protected assumption, success metric, and one likely failure case? If any field is vague, rewrite the contract before adding model complexity.

Topic-Native Deepening

A two-semester sequence is where the book can fully breathe. The first term can establish the physics, estimation, simulation, and policy-learning foundations; the second can move into language, world models, safety, and longer capstones without compressing everything into one overloaded arc.

The course-design challenge is coherence across the handoff. Students should feel that the second term extends the same loop rather than starting a second disconnected subject.

Why This Section Matters

Two-semester sequence becomes teachable once the student can state the operative variables, the decision boundary, and the evidence artifact. The section should therefore be read together with Part V and Part VIII, where the same loop is developed from adjacent angles.

Formal Object

Let semester one build foundation set $F$ and semester two build extension set $E$. The sequence works when prerequisite edges form a sparse DAG from $F$ to $E$, not a tangled graph that forces constant review of forgotten assumptions.

This is why the first semester should overinvest in frames, interfaces, data cards, and evaluation discipline. Those concepts quietly support everything interesting that happens later.

Algorithm: Split the book across two terms
  1. Use term one for perception, dynamics, control, state estimation, simulation, RL, and imitation.
  2. End term one with a modest integrative project that proves students can close the loop.
  3. Open term two with a brief refresh, then add language, VLAs, planning, 3D representation, and deployment topics.
  4. Run a larger second-term capstone that can draw from both terms without re-teaching prerequisites.
  5. Keep one shared evidence schema across both semesters so artifacts remain comparable.
Recommended Two-Term Split
DimensionWhat To SpecifyWhy It Matters
Term oneFoundations, control, state estimation, simulation, RL, imitationTechnical floor and first integrative project.
Interterm artifactBaseline system plus replay and postmortemPrevents term-two amnesia.
Term twoLanguage, planning, world models, safety, deployment, frontier topicsAdvanced synthesis.
Final deliverableResearch-grade capstone with proposal and defenseUses both halves of the sequence.
def validate_sequence(payload: dict[str, object]) -> dict[str, object]:
    assert payload, "payload must not be empty"
    return payload

# Two-term sequence card.
sequence = {
    "term_one_project": "simulator-based mobile manipulation baseline",
    "term_two_project": "language-conditioned embodied capstone",
    "shared_evidence_schema": True,
}
print(validate_sequence(sequence))
{'term_one_project': 'simulator-based mobile manipulation baseline', 'term_two_project': 'language-conditioned embodied capstone', 'shared_evidence_schema': True}
Code Fragment 60.3.A summarizes the topic-specific evidence card for two-semester sequence.

The expected output should reveal continuity across terms. If the evidence schema changes between semesters, students will struggle to connect the advanced work back to the foundations.

Library Shortcut

After the from-scratch contract is clear, the practical route uses Same book stack plus course project repositories, CI, shared data cards, simulator presets. The payoff is that standard interfaces, logging, batching, and replay support move from ad hoc glue code into maintained infrastructure, while the evidence schema stays the same.

Project Or Teaching Use

The strongest two-term designs keep term-one artifacts alive as baselines for term two. That makes progress legible and reduces the temptation to discard hard-won infrastructure every semester.

Research Frontier

A frontier teaching opportunity is to let term-two students reproduce or stress-test a current research claim using the infrastructure they built in term one. That is how the sequence becomes a research pipeline instead of two classes.

Expected Output Interpretation

For Two-semester sequence, the artifact should show the course-design decision, the evidence students must produce, and the failure mode that would trigger a revised assignment or rubric.

Key Takeaway
Exercise 60.3.1

Design a method-matched experiment for Two-semester sequence. Specify the environment, observation schema, action interface, metric, and one perturbation that targets the section's core assumption.

Section References

Biggs, J. Teaching for Quality Learning at University. Open University Press, 1999.

Use for constructive alignment between learning outcomes, activities, and assessment.

Anderson, L. W. and Krathwohl, D. R. A Taxonomy for Learning, Teaching, and Assessing. Longman, 2001.

Use for designing assessments that move from recall to analysis, creation, and evaluation.

What's Next?

Next, continue with the following teaching section, where the Two-semester sequence contract becomes a concrete course-design decision.