Inference-Time Policy Steering through Human Interactions
Paper • 2411.16627 • Published • 3
Pre-trained Diffusion Policy checkpoint used in Inference-Time Policy Steering through Human Interactions (paper, project page, code).
The model was trained on the D4RL Maze2D dataset and is intended to be loaded with the LeRobot policy classes.
Clone the inference repo, then load this checkpoint directly from the Hub:
git clone https://github.com/yanweiw/itps.git && cd itps
pip install -e .
python interact_maze2d.py -p dp --hf
Or load it programmatically:
from itps.common.policies.diffusion.modeling_diffusion import DiffusionPolicy
policy = DiffusionPolicy.from_pretrained("felixw/itps-dp")
policy.eval()
@article{wang2024itps,
title={Inference-Time Policy Steering through Human Interactions},
author={Wang, Yanwei and others},
journal={arXiv preprint arXiv:2411.16627},
year={2024}
}
MIT — see LICENSE.