Google
Jul 16, 2018This paper targets the efficient construction of a safety shield for decision making in scenarios that incorporate uncertainty.
Jul 16, 2018We present the concept of a shield that forces decision-making to provably adhere to safety requirements with high probability. Our method�...
Shielded Decision-Making in MDPs. / Jansen, Nils; K�nighofer, Bettina; Junges, Sebastian et al. 2018. (arXiv.org e-Print archive). ... Jansen N, K�nighofer B,�...
This work presents the concept of a shield that forces decision-making to provably adhere to safety requirements with high probability, and presents a�...
We introduce the concept of a probabilistic shield that enables RL decision-making to adhere to safety constraints with high probability. We employ formal�...
People also ask
We furthermore show that a shield can be used to bootstrap state-of-the-art RL agents: they re- main safe after initial learning in a shielded setting, allowing.
We introduce the concept of a probabilistic shield that enables RL decision-making to adhere to safety constraints with high probability. We employ formal�...
Missing: Shielded | Show results with:Shielded
Sep 23, 2022We propose an approach for online safety shielding of RL agents. During runtime, the shield analyses the safety of each available action.
Shields are typically computed via formal methods approaches, and hence they can guarantee that the shielded algorithm satisfies the desired safety�...
A shield prevents any wrong decisions from the agent. Shields fulfill the following requirements: 1. Guaranteed correctness: If the sequential decision-making�...