Cybersecurity analysis: GPT-5.5 reaches a similar level of performance as Mythos Preview and is the second model to solve a multi-step cyberattack simulation (AI Security Institute)
AI Security Institute's evaluation finds GPT-5.5 matches Mythos Preview in cyber capabilities, becoming the second model to complete a multi-step cyberattack simulation.
Excerpt
<a href="https://www.aisi.gov.uk/blog/our-evaluation-of-openais-gpt-5-5-cyber-capabilities"><img align="RIGHT" border="0" hspace="4" src="http://www.techmeme.com/260430/i36.jpg" vspace="4" /></a>
<p><a href="https://www.techmeme.com/260430/p36#a260430p36" title="Techmeme permalink"><img height="12" src="http://www.techmeme.com/img/pml.png" style="border: none; padding: 0; margin: 0;" width="11" /></a> <a href="https://www.aisi.gov.uk/">AI Security Institute</a>:<br />
<span style="font-size: 1.3em;"><b><a href="https://www.aisi.gov.uk/blog/our-evaluation-of-openais-gpt-5-5-cyber-capabilities">Cybersecurity analysis: GPT-5.5 reaches a similar level of performance as Mythos Preview and is the second model to solve a multi-step cyberattack simulation</a></b></span> — In April, our evaluation of Anthropic's Claude Mythos Preview found that it represented a step up in cyber performance … </p>
Read at source: https://www.techmeme.com/260430/p36#a260430p36