Llms Machine Learning Ai Safety Generative Ai

Anthropic accuses three Chinese AI labs of abusing Claude to improve their own models

AI Tools & Products February 24, 2026 2 min read Article

Summary

Anthropic accuses three Chinese AI labs of conducting distillation attacks on its Claude chatbot, claiming they illicitly extracted capabilities to enhance their own models.

Why It Matters

This issue highlights the growing concerns around AI model security and ethical practices in the industry. Distillation attacks can undermine the integrity of AI systems, leading to potential misuse of technology. As AI companies face competitive pressures, understanding and addressing these threats is crucial for maintaining trust and safety in AI development.

Key Takeaways

Anthropic claims three Chinese AI firms conducted distillation attacks on its Claude chatbot.
Distillation attacks involve using responses from powerful models to enhance less capable ones.
Anthropic linked the attacks to specific companies using IP address correlation and metadata.
The company plans to upgrade its systems to mitigate future distillation attacks.
This incident reflects ongoing tensions in the AI industry regarding model security and ethical standards.

AnthropicAnthropic is issuing a call to action against AI "distillation attacks," after accusing three AI companies of misusing its Claude chatbot. On its website, Anthropic claimed that DeepSeek, Moonshot and MiniMax have been conducting "industrial-scale campaigns…to illicitly extract Claude’s capabilities to improve their own models."Distillation in the AI world refers to when less capable models lean on the responses of more powerful ones to train themselves. While distillation isn't a bad thing across the board, Anthropic said that these types of attacks can be used in a more nefarious way. According to Anthropic, these three Chinese AI firms were responsible for more than "16 million exchanges with Claude through approximately 24,000 fraudulent accounts." From Anthropic's perspective, these competing companies were using Claude as a shortcut to develop more advanced AI models, which could also lead to circumventing certain safeguards.Anthropic said in its post that it was able to link each of these distilling attack campaigns to the specific companies with "high confidence" thanks to IP address correlation, metadata requests and infrastructure indicators, along with corroborating with others in the AI industry who have noticed similar behaviors.AdvertisementAdvertisementAdvertisementEarly last year, OpenAI made similar claims of rival firms distilling its models and banned suspected accounts in response. As for Anthropic, the company behind Claude said it would upgra...

Read Original Article

Anthropic accuses three Chinese AI labs of abusing Claude to improve their own models

Summary

Why It Matters

Key Takeaways

Related Articles

Anthropic essentially bans OpenClaw from Claude by making subscribers pay extra | The Verge

wtf bro did what? arc 3 2026

A robot car with a Claude AI brain started a YouTube vlog about its own existence

Study: LLMs Able to De-Anonymize User Accounts on Reddit, Hacker News & Other "Pseudonymous" Platforms; Report Co-Author Expands, Advises

No comments

Stay updated with AI News