RQ1: Does KIL improve generalization over alternative representations?

One representative rollout per condition (in-distribution, novel objects, scene variations) for each method and task. Methods: RGB, S²-Diffusion, KIL (IM), KIL (IN), KIL (T).