t = 0
What it tests: train a policy at 16×16 / 4 agents (the parent), then warm-start a 32×32 / 10-agent run from it (the child) and see how far that initialisation carries — on open and rooms, two seeds, for both perception arms (base = SLAM-only, occ = occupancy belief). The 16² parents are shown alongside for context (labelled 16² parent). The interesting comparison is the warm-started 32² child against a 32² trained-from-scratch baseline.
| Stage | Arm | Map | Coverage | Connectivity | seeds |
|---|---|---|---|---|---|
| computing from the rendered rollouts… | |||||
The run list is grouped by arm, with the warm-started 32² child and its 16² parent adjacent under each. Both columns are computed live from the rendered rollouts; connectivity is the mean giant-component fraction over the rollout.