Fact — claim — Knowledge Tree

Zilong Deng, Simon Khan, and Shaofeng Zou study the sample complexity of risk-sensitive Reinforcement Learning with a generative model, specifically focusing on maximizing the Conditional Value at Risk (CVaR) with a risk tolerance level tau at each step, a problem they name Iterated CVaR.

Authors

Person: Samuel Tesfazgi, Leonhard Sprandl, Sandra Hirche Organization: AISTATS
Track: Poster Session 3 - aistats 2026

Sources

Track: Poster Session 3 - aistats 2026 virtual.aistats.org Samuel Tesfazgi, Leonhard Sprandl, Sandra Hirche · AISTATS via serper

Referenced by nodes (2)

reinforcement learning concept
generative models concept