[ECCV 2026] An official implementation of Correlation-Weighted Multi-Reward Optimization for Compositional Generation
-
Updated
Jul 3, 2026 - Python
[ECCV 2026] An official implementation of Correlation-Weighted Multi-Reward Optimization for Compositional Generation
This repository hosts the code accompanying the NeurIPS24 paper "Multiple Rewards Best Policy Identification". Our study investigates the exploration problem in Reinforcement Learning (RL) in presence of multiple rewards.
Add a description, image, and links to the multi-reward topic page so that developers can more easily learn about it.
To associate your repository with the multi-reward topic, visit your repo's landing page and select "manage topics."