CAIS Seminar: Sharan Vaswani
Title: A Systematic Framework for Designing Policy Gradient Methods for Reinforcement Learning Abstract: Reinforcement learning (RL) studies sequential decision-making problems in which an agent interacts with an environment, receives feedback in the form of rewards, and aims to learn a policy that maximizes long-term performance. RL has found applications in medicine, industrial control, robotics, and, […]
