EA Play FIFA 23 F1™ 22 Madden NFL 23 Apex Legends Battlefield™ 2042 The Sims 4 Electronic Arts Home Electronics Arts Home Latest Games Coming Soon Free-To-Play EA SPORTS EA Originals Games Library EA app Deals PC PlayStation Xbox Nintendo Switch Mobile Pogo The EA app EA Play Playtesting Company Careers News Technology EA Studios EA Partners Our Commitments Positive Play People & Inclusive Culture Social Impact Environment Help EA Community Forums Player and Parental Tools Accessibility Press Investors Latest Games Coming Soon Free-To-Play EA SPORTS EA Originals Games Library EA app Deals PC PlayStation Xbox Nintendo Switch Mobile Pogo The EA app EA Play Playtesting Company Careers News Technology EA Studios EA Partners Our Commitments Positive Play People & Inclusive Culture Social Impact Environment Help EA Community Forums Player and Parental Tools Accessibility Press Investors

Multi-Critic Actor Learning: Teaching RL Policies to Act with Style

Authors: Siddharth Mysore, George Cheng, Yunqi Zhao, Kate Saenko, Meng Wu

Publication Date: 2022

Published in: International Conference on Learning Representations, 2022

Publication link: https://openreview.net/forum?id=rJvY_5OzoI

Abstract: Using a single value function (critic) shared over multiple tasks in Actor-Critic multi-task reinforcement learning (MTRL) can result in negative interference between tasks, which can compromise learning performance. Multi-Critic Actor Learning (MultiCriticAL) proposes instead maintaining separate critics for each task being trained while training a single multi-task actor. Explicitly distinguishing between tasks also eliminates the need for critics to learn to do so and mitigates interference between task-value estimates. MultiCriticAL is tested in the context of multi-style learning, a special case of MTRL where agents are trained to behave with different distinct behavior styles, and yields up to 56% performance gains over the single-critic baselines and even successfully learns behavior styles in cases where single-critic approaches may simply fail to learn. In a simulated real-world use case, MultiCriticAL enables learning policies that smoothly transition between multiple fighting styles on an experimental build of EA’s UFC game.

Related News

Procedural Terrain in EA SPORTS PGA Tour

EA Technology
May 15, 2023
How Procedural Tools are Reshaping Terrain Workflows, empowered by the Frostbite Engine. Combining non-destructive workflows with LiDAR 3D scans from drones.

Creating Need for Speed: Unbound's Signature Style

EA Technology
Apr 21, 2023
Adonis Stevenson, VFX Director at Criterion Games, explores the stylized VFX of Need for Speed: Unbound in his GDC 2023 talk.

Frostbite Presents at GDC 2023

EA Technology
Apr 19, 2023
See Frostbite's powerful terrain tools in action, learn about feedback-driven development methods, and more from this year's GDC talks.