Cybersecurity analysis: GPT-5.5 reaches a similar level of performance as Mythos Preview and is the second model to solve a multi-step cyberattack simulation (AI Security Institute)

Techmeme ·

AI Security Institute's evaluation finds GPT-5.5 matches Mythos Preview in cyber capabilities, becoming the second model to complete a multi-step cyberattack simulation.

Categories: Model Releases, Research

Excerpt

<a href="https://www.aisi.gov.uk/blog/our-evaluation-of-openais-gpt-5-5-cyber-capabilities"><img align="RIGHT" border="0" hspace="4" src="http://www.techmeme.com/260430/i36.jpg" vspace="4" /></a> <p><a href="https://www.techmeme.com/260430/p36#a260430p36" title="Techmeme permalink"><img height="12" src="http://www.techmeme.com/img/pml.png" style="border: none; padding: 0; margin: 0;" width="11" /></a> <a href="https://www.aisi.gov.uk/">AI Security Institute</a>:<br /> <span style="font-size: 1.3em;"><b><a href="https://www.aisi.gov.uk/blog/our-evaluation-of-openais-gpt-5-5-cyber-capabilities">Cybersecurity analysis: GPT-5.5 reaches a similar level of performance as Mythos Preview and is the second model to solve a multi-step cyberattack simulation</a></b></span>&nbsp; &mdash;&nbsp; In April, our evaluation of Anthropic's Claude Mythos Preview found that it represented a step up in cyber performance &hellip; </p>