Published in International Conference on Learning Representations, 2022
Using a single value function (critic) shared over multiple tasks in Actor-Critic multi-task reinforcement learning (MTRL) can result in negative interference between tasks, which can compromise learning performance. Multi-Critic Actor Learning (MultiCriticAL) proposes instead maintaining separate critics for each task being trained while training a single multi-task actor.
Recommended citation: Mysore, S., Cheng, G., Zhao, Y., Saenko, K, & Wu, M. "Multi-Critic Actor Learning: Teaching RL Policies to Act with Style". International Conference on Learning Representations https://openreview.net/forum?id=rJvY_5OzoI