Shields to Guarantee Probabilistic Safety in MDPs

By Linus Heck, Filip Macák, Roman Andriushchenko, Milan Češka, Sebastian Junges

· ArXiv · AI/CL/LG · May 11, 2026

Shields to Guarantee Probabilistic Safety extends classical shielding frameworks to probabilistic safety, proving impossibility of strong guarantees and providing offline/online shield constructions.

Categories: Research

Excerpt

Shielding is a prominent model-based technique to ensure safety of autonomous agents. Classical shielding aims to ensure that nothing bad ever happens and comes with strong guarantees about safety and maximal permissiveness. However, shielding systems for probabilistic safety, where something bad is allowed to happen with an acceptable probability, has proven to be more intricate. This paper presents a formal framework that conservatively extends classical shields to probabilistic safety. In this framework, we (i) demonstrate the impossibility of preserving the strong guarantees on safety and permissiveness, (ii) provide natural shields with weaker guarantees, and (iii) introduce offline and online shield constructions ensuring strong safety guarantees. The empirical evaluation highlights the practical advantages of the new shields, as well as their computational feasibility.

Read at source: https://arxiv.org/abs/2605.10888v1