Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees Paper • 2311.08384 • Published Nov 14, 2023
Harnessing Density Ratios for Online Reinforcement Learning Paper • 2401.09681 • Published Jan 18, 2024