SocialReasoning-Bench: Measuring whether AI agents act in users’ best interests

By Tyler Payne, Will Epperson, Safoora Yousefi, Zachary Huang, Gagan Bansal, Wenyue Hua, Maya Murad, Asli Celikyilmaz, Saleema Amershi

· Microsoft Research AI · May 11, 2026

Microsoft released SocialReasoning-Bench to test whether AI agents optimize for users’ interests during task execution.

Categories: Research

Excerpt

<p>Using SocialReasoning Bench, we observed a stable pattern across models—agents execute competently, but fail to consistently improve the user’s position, even with explicit instructions to optimize for user interest.</p> <p>The post <a href="https://www.microsoft.com/en-us/research/blog/socialreasoning-bench-measuring-whether-ai-agents-act-in-users-best-interests/">SocialReasoning-Bench: Measuring whether AI agents act in users’ best interests</a> appeared first on <a href="https://www.microsoft.com/en-us/research">Microsoft Research</a>.</p>

Read at source: https://www.microsoft.com/en-us/research/blog/socialreasoning-bench-measuring-whether-ai-agents-act-in-users-best-interests/