Shielded Decision-Making in MDPs.

AllImages Shopping Videos Maps News Books

Safe Reinforcement Learning via Probabilistic Shields - arXiv

Jul 16, 2018 � This paper targets the efficient construction of a safety shield for decision making in scenarios that incorporate uncertainty.

(PDF) Shielded Decision-Making in MDPs - ResearchGate

www.researchgate.net › publication › 326459531_Shielded_Decision-Maki...

Jul 16, 2018 � We present the concept of a shield that forces decision-making to provably adhere to safety requirements with high probability. Our method�...

Shielded Decision-Making in MDPs - Graz University of Technology

graz.elsevierpure.com › publications › shielded-decision-making-in-mdps

Shielded Decision-Making in MDPs. / Jansen, Nils; K�nighofer, Bettina; Junges, Sebastian et al. 2018. (arXiv.org e-Print archive). ... Jansen N, K�nighofer B,�...

Scholarly articles for Shielded Decision-Making in MDPs.

scholar.google.com › citations

Shielded decision-making in MDPs
Jansen � Cited by 53

Safe reinforcement learning via probabilistic shields
Jansen � Cited by 11

Safe reinforcement learning using probabilistic shields
Jansen � Cited by 113

Shielded Decision-Making in MDPs - Semantic Scholar

www.semanticscholar.org › paper › Shielded-Decision-Making-in-MDPs-Ja...

This work presents the concept of a shield that forces decision-making to provably adhere to safety requirements with high probability, and presents a�...

[PDF] Safe Reinforcement Learning Using Probabilistic Shields - DROPS

drops.dagstuhl.de › storage › LIPIcs.CONCUR.2020.3.pdf

We introduce the concept of a probabilistic shield that enables RL decision-making to adhere to safety constraints with high probability. We employ formal�...

[PDF] Safe Reinforcement Learning via Shielding under Partial Observability

ojs.aaai.org › index.php › AAAI › article › view

We furthermore show that a shield can be used to bootstrap state-of-the-art RL agents: they re- main safe after initial learning in a shielded setting, allowing.

Safe Reinforcement Learning Using Probabilistic Shields

graz.elsevierpure.com › publications › safe-reinforcement-learning-using-p...

We introduce the concept of a probabilistic shield that enables RL decision-making to adhere to safety constraints with high probability. We employ formal�...

Missing: Shielded | Show results with:Shielded

Online shielding for reinforcement learning - SpringerLink

link.springer.com › Innovations in Systems and Software Engineering

Sep 23, 2022 � We propose an approach for online safety shielding of RL agents. During runtime, the shield analyses the safety of each available action.

[PDF] Shielding in Resource-Constrained Goal POMDPs

ojs.aaai.org › index.php › AAAI › article › view

Shields are typically computed via formal methods approaches, and hence they can guarantee that the shielded algorithm satisfies the desired safety�...

[PDF] Shield Synthesis for Reinforcement Learning - Radboud Repository

repository.ubn.ru.nl › bitstream › handle

A shield prevents any wrong decisions from the agent. Shields fulfill the following requirements: 1. Guaranteed correctness: If the sequential decision-making�...