Overview of the RIFT: Building on the IL pre-trained model, RIFT first performs route-level interaction analysis to identify critical background vehicles and their associated reference lines, enabling the generation of realistic and multimodal trajectories. To isolate style-level controllability from the trajectory-level realism and route-level controllability established during pre-training, only the scoring head is fine-tuned via RIFT, with the remaining components kept frozen. Specifically, RIFT computes group-relative advantages over all candidate rollouts, promoting alignment with user-preferred styles and mitigating covariate shift through RL fine-tuning.
The AV-centric traffic simulation consists of the autonomoud vehicle(AV, implemented as PDM-Lite), Critical Background Vehicles (CBVs), and background vehicles (BVs), where the AV follows a predefined global route and the CBVs may interact with it at route level.
RIFT consistently outperforms all baselines in both aspects across most settings. While supervised learning methods achieve slightly lower CPK and ORR, this improvement is primarily due to their inherently conservative behavior, derived from the expert PDM-Lite, which prioritizes safety by avoiding risky maneuvers.
RIFT produces realistic and well-structured scenarios that are effective at exposing the limitations of modern AV systems.
RIFT demonstrates higher average speed and acceleration, indicating more interactive behavior, while maintaining realistic motion profiles.
@article{chen2025riftclosedlooprlfinetuning,
title={RIFT: Closed-Loop RL Fine-Tuning for Realistic and Controllable Traffic Simulation},
author={Keyu Chen and Wenchao Sun and Hao Cheng and Sifa Zheng},
journal={arXiv preprint arXiv:2505.03344},
year={2025}
}