Abstract—We present generative predictive control (GPC), a framework for inference-time enhancement of pretrained behavior-cloning policies. Rather than retraining or fine-tuning, GPC augments a frozen diffusion policy at deployment by coupling it with a predictive world model. Concretely, we train an action-conditioned world model on expert demonstrations and random exploration rollouts to forecast the consequences of action proposals produced by the diffusion policy, then perform lightweight online planning that ranks and refines these proposals via model-based look-ahead. This combination of a generative prior with predictive foresight enables test-time adaptation. Across diverse robotic manipulation tasks—state- and vision-based, in simulation and on real hardware—GPC consistently outperforms standard behavior cloning and compares favorably to other inference-time adaptation baselines.
Keywords: world model; predictive world modeling; diffusion policy; behavior cloning; inference-time adaptation; model-based planning; online planning.
World Model Prediction in GPC-RANK.
World Model Prediction in GPC-OPT.
World Model Prediction in GPC-RANK.
World Model Prediction in GPC-OPT.
World Model Prediction in GPC-RANK.
World Model Prediction in GPC-OPT.
Baseline Test 0: Success.
GPC-RANK Test 0: Success.
GPC-OPT Test 0: Success.
Baseline Test 1: Failure.
GPC-RANK Test 1: Success.
GPC-OPT Test 1: Success.
Baseline Test 2: Success.
GPC-RANK Test 2: Success.
GPC-OPT Test 2: Success.
Baseline Test 3: Failure.
GPC-RANK Test 3: Failure.
GPC-OPT Test 3: Success.
Baseline Test 4: Failure.
GPC-RANK Test 4: Success.
GPC-OPT Test 4: Success.
Baseline Test 5: Failure.
GPC-RANK Test 5: Success.
GPC-OPT Test 5: Failure.
Baseline Test 6: Success.
GPC-RANK Test 6: Failure.
GPC-OPT Test 6: Failure.
Baseline Test 7: Failure.
GPC-RANK Test 7: Failure.
GPC-OPT Test 7: Success.
Baseline Test 8: Success.
GPC-RANK Test 8: Success.
GPC-OPT Test 8: Success.
Baseline Test 9: Success.
GPC-RANK Test 9: Success.
GPC-OPT Test 9: Failure.
Baseline Test 0: Failure.
GPC-RANK Test 0: Success.
GPC-OPT Test 0: Success.
Baseline Test 1: Success.
GPC-RANK Test 1: Success.
GPC-OPT Test 1: Success.
Baseline Test 2: Failure.
GPC-RANK Test 2: Success.
GPC-OPT Test 2: Failure.
Baseline Test 3: Failure.
GPC-RANK Test 3: Failure.
GPC-OPT Test 3: Failure.
Baseline Test 4: Success.
GPC-RANK Test 4: Success.
GPC-OPT Test 4: Success.
Baseline Test 0: Success.
GPC-RANK Test 0: Success.
GPC-OPT Test 0: Success.
Baseline Test 1: Success.
GPC-RANK Test 1: Success.
GPC-OPT Test 1: Success.
Baseline Test 2: Failure.
GPC-RANK Test 2: Success.
GPC-OPT Test 2: Success.
Baseline Test 3: Failure.
GPC-RANK Test 3: Failure.
GPC-OPT Test 3: Failure.
Baseline Test 4: Failure.
GPC-RANK Test 4: Failure.
GPC-OPT Test 4: Success.
Baseline Test 0: Success.
GPC-RANK Test 0: Success.
GPC-OPT Test 0: Success.
Baseline Test 1: Success.
GPC-RANK Test 1: Success.
GPC-OPT Test 1: Success.
Baseline Test 2: Failure.
GPC-RANK Test 2: Success.
GPC-OPT Test 2: Success.
Baseline Test 0: Failure.
GPC-RANK Test 0: Success.
Baseline Test 1: Success.
GPC-RANK Test 1: Success.
Baseline Test 2: Failure.
GPC-RANK Test 2: Success.
Baseline Test 3: Failure.
GPC-RANK Test 3: Success.
Baseline Test 4: Failure.
GPC-RANK Test 4: Failure.
Baseline Test 5: Failure.
GPC-RANK Test 5: Success.
Baseline Test 6: Success.
GPC-RANK Test 6: Success.
Baseline Test 7: Success.
GPC-RANK Test 7: Success.
Baseline Test 8: Failure.
GPC-RANK Test 8: Failure.
Baseline Test 9: Failure.
GPC-RANK Test 9: Failure.
@article{qi25ral-gpc,
title={Inference-Time Enhancement of Generative Robot Policies via Predictive World Modeling},
note={Previously titled: Strengthening Generative Robot Policies through Predictive World Modeling},
author={Qi, Han and Yin, Haocheng and Zhu, Aris and Du, Yilun and Yang, Heng},
journal={IEEE Robotics and Automation Letters (RAL)},
year={2025}
}