Cross-Domain Policy Optimization via Bellman Consistency and Hybrid Critics

Ming-Hong Chen1*   Kuan-Chen Pan1*   You-De Huang1*   Xi Liu2   Ping-Chun Hsieh1
1 National Yang Ming Chiao Tung University, Hsinchu, Taiwan
2 Applied Machine Learning, Meta AI, Menlo Park, CA, USA
* These authors contributed equally to this work.
Code Paper

The project page is coming soon.

Citation

@inproceedings{
chen2026cross,
title={Cross-domain policy optimization via bellman consistency and hybrid critics},
author={Ming-Hong, Chen and Kuan-Chen, Pan and You-De, Huang and Xi, Liu and Ping-Chun, Hsieh},
booktitle={The Fourteenth International Conference on Learning Representations},
year={2026},
url={https://openreview.net/forum?id=kTXRFtWHnM}
}