Constrained Reinforcement Learning with Average Reward Objective

Bog

Format
Bog, paperback
Engelsk
118 sider

Indgår i serie
Foundations and Trends R in Optimization

Normalpris: kr. 899,95

Medlemspris: kr. 789,95 For at købe bogen til medlemspris skal du have et medlemskab med Shopping-fordele. Du kan prøve medlemskabet gratis i 7 dage. Medlemskabet fornyes automatisk og kan altid opsiges.

Leveringstid: 7-9 Hverdage (Sendes fra fjernlager)
Forventet levering: 25-11-2024
Kan pakkes ind og sendes som gave
Split betalingen op med

Beskrivelse

Reinforcement Learning (RL) serves as a versatile framework for sequential decision-making, finding applications across diverse domains such as robotics, autonomous driving, recommendation systems, supply chain optimization, biology, mechanics, and finance. The primary objective of these applications is to maximize the average reward. Real-world scenarios often necessitate adherence to specific constraints during the learning process.

This monograph focuses on the exploration of various model-based and model-free approaches for Constrained RL within the context of average reward Markov Decision Processes (MDPs). The investigation commences with an examination of model-based strategies, delving into two foundational methods - optimism in the face of uncertainty and posterior sampling. Subsequently, the discussion transitions to parametrized model-free approaches, where the primal dual policy gradient-based algorithm is explored as a solution for constrained MDPs.

The monograph provides regret guarantees and analyzes constraint violation for each of the discussed setups. For the above exploration, the authors assume the underlying MDP to be ergodic. Further, this monograph extends its discussion to encompass results tailored for weakly communicating MDPs, thereby broadening the scope of its findings and their relevance to a wider range of practical scenarios.

Læs hele beskrivelsen

Detaljer

SprogEngelsk
Sidetal118
Udgivelsesdato21-08-2024
ISBN139781638283966
Forlag Now Publishers Inc
FormatPaperback

Størrelse og vægt

Vægt193 g

Dybde0,7 cm

10 cm

15,6 cm

23,4 cm

Constrained Reinforcement Learning with Average Reward Objective

Findes i disse kategorier...