VIDEO DOI: https://doi.org/10.48448/q9x9-mc69
PAPER DOI: Reinforcement Learning, Learning, Safe Policy Improvement,

technical paper

AAMAS 2020

May 11, 2020

Live on Underline

Safe Policy Improvement with an Estimated Baseline Policy

Please log in to leave a comment

Downloads

SlidesTranscript English (automatic)

Next from AAMAS 2020

Viral Vs. Effective: Utility Based Influence Maximization
technical paper

Viral Vs. Effective: Utility Based Influence Maximization

AAMAS 2020

Noam HazonAmos AzariaYael Sabato
Yael Sabato and 2 other authors

11 May 2020

Similar lecture

OPT-GAN: A Broad-Spectrum Global Optimizer for Black-box Problems by Learning Distribution
poster

OPT-GAN: A Broad-Spectrum Global Optimizer for Black-box Problems by Learning Distribution

AAAI 2023

+4Minfang LuShuangrong LiuLin Wang
Lin Wang and 6 other authors

11 February 2023