ainews.cx Sources

Simon Willison AI Filter

A quote from Anthropic

We used an automatic classifier which judged sycophancy by looking at whether Claude showed a willingness to push back, maintain positions when challenged, give praise proportional to the merit of …

Models

Models Anthropic Claude

Simon Willison AI Filter

Our evaluation of OpenAI’s GPT-5.5 cyber capabilities

The UK's AI Security Institute previously evaluated Claude Mythos: now they've evaluated GPT-5.5 for finding security vulnerability and found it to be comparable to Mythos, but unlike Mythos it's generally …

Models

Simon Willison AI Filter

Release: llm 0.32a1

Access large language models from the command-line

Models

Simon Willison AI Filter

LLM 0.32a0 is a major backwards-compatible refactor

I just released LLM 0.32a0, an alpha release of my LLM Python library and CLI tool for accessing LLMs, with some consequential changes that I’ve been working towards for quite …

Models