ainews.cx Sources

Simon Willison AI Filter

A quote from Anthropic

We used an automatic classifier which judged sycophancy by looking at whether Claude showed a willingness to push back, maintain positions when challenged, give praise proportional to the merit of …

Models

Models Anthropic Claude

Simon Willison AI Filter

Our evaluation of OpenAI’s GPT-5.5 cyber capabilities

The UK's AI Security Institute previously evaluated Claude Mythos: now they've evaluated GPT-5.5 for finding security vulnerability and found it to be comparable to Mythos, but unlike Mythos it's generally …

Models