AI Safety & Ethics

Alignment, bias, regulation, and responsible AI

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Ai Safety

NHS staff resist using Palantir software. Staff reportedly cite ethics concerns, privacy worries, and doubt the platform adds much

submitted by /u/esporx [link] [comments]

Reddit - Artificial Intelligence · 1 min · 2 days ago

Machine Learning

AI assistants are optimized to seem helpful. That is not the same thing as being helpful.

RLHF trains models on human feedback. Humans rate responses they like. And it turns out humans consistently rate confident, fluent, agree...

Reddit - Artificial Intelligence · 1 min · 2 days ago

Computer Vision

House Democrat Questions Anthropic on AI Safety After Source Code Leak

Rep. Josh Gottheimer, who is generally tough on China, just sent a letter to Anthropic questioning their decision to reduce certain safet...

Reddit - Artificial Intelligence · 1 min · 2 days ago

All Content

Llms

Trump team livid about Dario Amodei's principled stand to keep the DOD from using Claude for war

Anthropic's $200 million Defense Department contract is at risk after concerns were raised about the use of its AI model, Claude, in mili...

AI Tools & Products · 9 min · about 1 month ago

Ai Startups

[D] Anyone have experience with Augure AI?

A Reddit user seeks insights on Augure AI, a Toronto-based company, particularly regarding its data security capabilities for healthcare ...

Reddit - Machine Learning · 1 min · about 1 month ago

Ai Safety

If you’ve had some unexplainable things happen with your AI, we would love for you to discuss them in our discord. It’s a safe space..

Join a supportive community on Discord to discuss unexplainable AI experiences. Share your stories and connect with others in a safe envi...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Machine Learning

Killing the Basilisk: A Post-Transformer Analysis of the Roko Problem | by Justin Dew | Feb, 2026

This article explores the implications of Roko's Basilisk in the context of post-Transformer AI developments, analyzing its philosophical...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Generative Ai

Fake faces generated by AI are now "too good to be true," researchers warn

Researchers highlight the increasing realism of AI-generated faces, warning that they may soon be indistinguishable from real images, rai...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Ai Infrastructure

Microsoft’s new gaming CEO vows not to flood the ecosystem with ‘endless AI slop’ | TechCrunch

Microsoft's new gaming CEO, Asha Sharma, emphasizes a commitment to quality in gaming, rejecting the idea of overusing AI in game develop...

TechCrunch - AI · 3 min · about 1 month ago

Ai Safety

Civil rights for AI?

The article discusses the concept of civil rights for AI, exploring ethical considerations and implications for the future of artificial ...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Ai Safety

Lawyer says Google shut down his Gmail, Voice and Photos after NotebookLM upload

A lawyer claims that Google suspended his Gmail, Voice, and Photos accounts after he uploaded documents related to sensitive public recor...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Llms

Suspect in Tumbler Ridge school shooting described violent scenarios to ChatGPT | The Verge

The article discusses the Tumbler Ridge school shooting suspect's alarming interactions with ChatGPT, which raised concerns at OpenAI but...

The Verge - AI · 3 min · about 1 month ago

Llms

OpenAI debated calling police about suspected Canadian shooter's chats | TechCrunch

OpenAI faced a dilemma over whether to alert police about alarming chats from a suspected Canadian shooter, highlighting the challenges o...

TechCrunch - AI · 4 min · about 1 month ago

Ai Safety

How can a government actually stop or control AI?

The article discusses the challenges governments face in controlling AI, highlighting the limitations of legal and technical measures in ...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Ai Safety

You are the Product

The article discusses the concept that users of free AI services are often the product, emphasizing the implications for privacy and data...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Ai Safety

Congress—Not the Pentagon or Anthropic—Should Set Military AI Rules

The article argues that Congress, not the Pentagon or private companies like Anthropic, should establish regulations for military AI use,...

AI Tools & Products · 8 min · about 1 month ago

Ai Agents

The AI-Panic Cycle—And What’s Actually Different Now

The article discusses the current AI landscape, focusing on the emergence of coding agents and the resulting anxiety among industry insid...

AI Tools & Products · 38 min · about 1 month ago

Ai Safety

In the News: Manjeet Rege on AI at the Minnesota State Capitol

Dr. Manjeet Rege discusses the role of AI in enhancing security at the Minnesota State Capitol, emphasizing its integration with human ju...

AI Tools & Products · 1 min · about 1 month ago

Llms

Anthropic rolls out embedded security scanning for Claude

Anthropic introduces Claude Code Security, a new feature that scans AI-generated code for vulnerabilities and suggests patching solutions...

AI Tools & Products · 5 min · about 1 month ago

Ai Agents

AI Reviews and Lab Tests

This article reviews various AI tools, highlighting their features, pros, and cons, to help users choose the best options for their needs.

AI Tools & Products · 13 min · about 1 month ago

Llms

I fact-checked the "AI Moats are Dead" Substack article. It was AI-generated and got its own facts wrong.

The article critiques a Substack post claiming AI models lack moats, revealing it as AI-generated and factually incorrect, despite some v...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Llms

Godfather of AI Yann LeCun says: Yes, LLMs may be passing Maths Olympiads and bar exams, but they will still fail in …

Yann LeCun, a prominent AI researcher, discusses the limitations of large language models (LLMs) despite their successes in exams, emphas...

AI News - General · 10 min · about 1 month ago

Ai Safety

Anthropic-funded group backs candidate attacked by rival AI super PAC | TechCrunch

The article discusses the political dynamics surrounding Alex Bores, a New York congressional candidate, who is supported by a PAC funded...

TechCrunch - AI · 3 min · about 1 month ago

Previous Page 75 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Safety & Ethics

Top This Week

NHS staff resist using Palantir software. Staff reportedly cite ethics concerns, privacy worries, and doubt the platform adds much

AI assistants are optimized to seem helpful. That is not the same thing as being helpful.

House Democrat Questions Anthropic on AI Safety After Source Code Leak

All Content

Trump team livid about Dario Amodei's principled stand to keep the DOD from using Claude for war

[D] Anyone have experience with Augure AI?

If you’ve had some unexplainable things happen with your AI, we would love for you to discuss them in our discord. It’s a safe space..

Killing the Basilisk: A Post-Transformer Analysis of the Roko Problem | by Justin Dew | Feb, 2026

Fake faces generated by AI are now "too good to be true," researchers warn

Microsoft’s new gaming CEO vows not to flood the ecosystem with ‘endless AI slop’ | TechCrunch

Civil rights for AI?

Lawyer says Google shut down his Gmail, Voice and Photos after NotebookLM upload

Suspect in Tumbler Ridge school shooting described violent scenarios to ChatGPT | The Verge

OpenAI debated calling police about suspected Canadian shooter's chats | TechCrunch

How can a government actually stop or control AI?

You are the Product

Congress—Not the Pentagon or Anthropic—Should Set Military AI Rules

The AI-Panic Cycle—And What’s Actually Different Now

In the News: Manjeet Rege on AI at the Minnesota State Capitol

Anthropic rolls out embedded security scanning for Claude

AI Reviews and Lab Tests

I fact-checked the "AI Moats are Dead" Substack article. It was AI-generated and got its own facts wrong.

Godfather of AI Yann LeCun says: Yes, LLMs may be passing Maths Olympiads and bar exams, but they will still fail in …

Anthropic-funded group backs candidate attacked by rival AI super PAC | TechCrunch

Related Topics

Stay updated with AI News