Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms | Agenda Bookshop Skip to content
Black Friday Sale Now On! | Buy 3 Get 1 Free on all books | Instore & Online.
Black Friday Sale Now On! | Buy 3 Get 1 Free on all books | Instore & Online.
A01=Qinbo Bai
A01=Vaneet Aggarwal
A01=Washim Uddin Mondal
Age Group_Uncategorized
Age Group_Uncategorized
Author_Qinbo Bai
Author_Vaneet Aggarwal
Author_Washim Uddin Mondal
automatic-update
Category1=Non-Fiction
Category=THR
COP=United States
Delivery_Delivery within 10-20 working days
Language_English
PA=Available
Price_€50 to €100
PS=Active
softlaunch

Constrained Reinforcement Learning with Average Reward Objective: Model-Based and Model-Free Algorithms

Reinforcement Learning (RL) serves as a versatile framework for sequential decision-making, finding applications across diverse domains such as robotics, autonomous driving, recommendation systems, supply chain optimization, biology, mechanics, and finance. The primary objective of these applications is to maximize the average reward. Real-world scenarios often necessitate adherence to specific constraints during the learning process. This monograph focuses on the exploration of various model-based and model-free approaches for Constrained RL within the context of average reward Markov Decision Processes (MDPs). The investigation commences with an examination of model-based strategies, delving into two foundational methods optimism in the face of uncertainty and posterior sampling. Subsequently, the discussion transitions to parametrized model-free approaches, where the primal dual policy gradient-based algorithm is explored as a solution for constrained MDPs. The monograph provides regret guarantees and analyzes constraint violation for each of the discussed setups. For the above exploration, the authors assume the underlying MDP to be ergodic. Further, this monograph extends its discussion to encompass results tailored for weakly communicating MDPs, thereby broadening the scope of its findings and their relevance to a wider range of practical scenarios. See more
Current price €73.14
Original price €76.99
Save 5%
A01=Qinbo BaiA01=Vaneet AggarwalA01=Washim Uddin MondalAge Group_UncategorizedAuthor_Qinbo BaiAuthor_Vaneet AggarwalAuthor_Washim Uddin Mondalautomatic-updateCategory1=Non-FictionCategory=THRCOP=United StatesDelivery_Delivery within 10-20 working daysLanguage_EnglishPA=AvailablePrice_€50 to €100PS=Activesoftlaunch
Delivery/Collection within 10-20 working days
Product Details
  • Weight: 176g
  • Dimensions: 156 x 234mm
  • Publication Date: 21 Aug 2024
  • Publisher: now publishers Inc
  • Publication City/Country: United States
  • Language: English
  • ISBN13: 9781638283966

Customer Reviews

Be the first to write a review
0%
(0)
0%
(0)
0%
(0)
0%
(0)
0%
(0)
We use cookies to ensure that we give you the best experience on our website. If you continue we'll assume that you are understand this. Learn more
Accept