Stochastic Optimization Methods for Policy Evaluation in Reinforcement Learning

Name: Stochastic Optimization Methods for Policy Evaluation in Reinforcement Learning
Brand: now publishers Inc
SKU: 9781638283706
Price: 50.34 EUR
Availability: InStock

★★★★★

English

By (author): Shaocong Ma Yi Zhou

This monograph introduces various value-based approaches for solving the policy evaluation problem in the online reinforcement learning (RL) scenario, which aims to learn the value function associated with a specific policy under a single Markov decision process (MDP). Approaches vary depending on whether they are implemented in an on-policy or off-policy manner. In on-policy settings, where the evaluation of the policy is conducted using data generated from the same policy that is being assessed, classical techniques such as TD(0), TD(), and their extensions with function approximation or variance reduction are employed in this setting. For off-policy evaluation, where samples are collected under a different behavior policy, this monograph introduces gradient-based two-timescale algorithms like GTD2, TDC, and variance-reduced TDC. These algorithms are designed to minimize the mean-squared projected Bellman error (MSPBE) as the objective function. This monograph also discusses their finite-sample convergence upper bounds and sample complexity. See more

€50.34

€52.99

Save 5%

Quantity

A01=Shaocong MaA01=Yi ZhouAge Group_UncategorizedAuthor_Shaocong MaAuthor_Yi Zhouautomatic-updateCategory1=Non-FictionCategory=THRCOP=United StatesDelivery_Delivery within 10-20 working daysLanguage_EnglishPA=AvailablePrice_€50 to €100PS=Activesoftlaunch

Delivery/Collection within 10-20 working days

Product Details

Weight: 94g
Dimensions: 156 x 234mm
Publication Date: 11 Jul 2024
Publisher: now publishers Inc
Publication City/Country: United States
Language: English
ISBN13: 9781638283706

Stochastic Optimization Methods for Policy Evaluation in Reinforcement Learning

Product Details

Customer Reviews

Added to your cart: