7.2. Hypothesis 2: Fast Behavior Authoring
To support our second hypothesis, we’ll show how quickly new behaviors can be authored on the real robot using the runtime edit-test loop, rather than redeploy, restart, or retrain workflows.
In Figure 7.20, we summarize measured from-scratch authoring sessions on real hardware. Each row shows normalized active authoring time on a log-second axis with internal milestone structure recovered from screen recordings. Checkmarks mark first autonomous success. Gaps between daily sessions are excluded for the multi-day Alex bring-up.

The subsections below walk through each row in Figure 7.20 with a representative still, the authoring-time table, and a link to the screen recording.
7.2.1. Scratch Right Push Opening (Nadia 2023)
This row is our earliest measured scratch door opening on Nadia, at 33 minutes. However, the behavior used ArUco marker perception, a squared up approach, and teleoperation for the final traversal steps. The session is shown in Figure 7.21. Authoring times are presented in Table 7.18.
Table 7.18 is our earliest measured scratch door session at 33 minutes. The session used ArUco perception and teleoperated traversal steps rather than full autonomy. It still documents runtime editing on the robot, but it should not be directly compared to later sessions that produced fully autonomous behaviors.
7.2.2. Right Pull Opening Loop (Unitree H1-2, 2026-01-02)
On January 2, 2026, we authored a looping standing right pull lever handle opening behavior on the Unitree H1-2 from an empty behavior tree. The session reached first autonomous looping execution in 31 minutes and 43 seconds of active authoring time. The progression is shown in Figure 7.22. Authoring times are presented in Table 7.19.
The first autonomous looping opening came at 31 minutes 43 seconds (Table 7.19). The goto loop at 31:27 immediately began the 32/32 repeated opening test in Figure 7.17. This session shows the diagnose, edit, and retest loop in one sitting, which is what hypothesis 2 claims relative to redeploy or retrain workflows.
7.2.3. First Right Pull Door Bring Up (Alex, 2026-01-20 to 24)
From January 20 through 24, 2026, we brought up the first full Alex pull lever door traversal from scratch. The measured active authoring total is 11 hours, 10 minutes, and 17 seconds. That total is cumulative across five daily sessions and excludes inactive gaps between sessions. A frame from the bring up period is shown in Figure 7.23. Authoring times are presented in Table 7.20.
The 11 hour 10 minute total spans five days of incremental trial-and-error authoring (Table 7.20). Major milestones include the first open from a staggered stance on day 3 and the hold open fallback added on day 5 after force related trouble. These Alex authoring sessions show the cost of bringing up the first door traversal behavior on a new hardware platform.
7.2.4. Scratch Left Push Door (Alex, 2026-02-22)
On February 22, 2026, we authored a left push door traversal on Alex from an empty Left Push Door sequence to first fully automatic success in 1 hour, 59 minutes, and 48 seconds of active authoring.
This is the measured scratch case used in the comparative authoring process figure.
The session is shown in Figure 7.24.
Authoring times are presented in Table 7.21.
This is our measured fully autonomous scratch case for literature comparison: 1 hour 59 minutes 48 seconds from empty sequence to first automatic success (Table 7.21). Failure at 1:50:11 from arm task space error was fixed in about 9 minutes by bringing the right arm in using jointspace for traversal. That short edit and retest loop is the kind of workflow Figure 7.31 contrasts against multi day retraining pipelines.
Across the four rows in Figure 7.20, measured active authoring ranged from 30 minutes to 11 hours. The 33 minute session from 2023 on Nadia resulted in a behavior that was only partially autonomous, while the February 22, 2026 Alex left push row is the fairest fully autonomous scratch comparison at just under 2 hours. Each session documented failure diagnosis, logic edits, and retesting on the robot without a redeploy or retraining cycle.