AI Safety & Ethics

Alignment, bias, regulation, and responsible AI

Top This Week

Ai Safety

NHS staff resist using Palantir software. Staff reportedly cite ethics concerns, privacy worries, and doubt the platform adds much

submitted by /u/esporx [link] [comments]

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

AI assistants are optimized to seem helpful. That is not the same thing as being helpful.

RLHF trains models on human feedback. Humans rate responses they like. And it turns out humans consistently rate confident, fluent, agree...

Reddit - Artificial Intelligence · 1 min ·
Computer Vision

House Democrat Questions Anthropic on AI Safety After Source Code Leak

Rep. Josh Gottheimer, who is generally tough on China, just sent a letter to Anthropic questioning their decision to reduce certain safet...

Reddit - Artificial Intelligence · 1 min ·

All Content

Trump team livid about Dario Amodei's principled stand to keep the DOD from using Claude for war
Llms

Trump team livid about Dario Amodei's principled stand to keep the DOD from using Claude for war

Anthropic's $200 million Defense Department contract is at risk after concerns were raised about the use of its AI model, Claude, in mili...

AI Tools & Products · 9 min ·
Ai Startups

[D] Anyone have experience with Augure AI?

A Reddit user seeks insights on Augure AI, a Toronto-based company, particularly regarding its data security capabilities for healthcare ...

Reddit - Machine Learning · 1 min ·
Ai Safety

If you’ve had some unexplainable things happen with your AI, we would love for you to discuss them in our discord. It’s a safe space..

Join a supportive community on Discord to discuss unexplainable AI experiences. Share your stories and connect with others in a safe envi...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

Killing the Basilisk: A Post-Transformer Analysis of the Roko Problem | by Justin Dew | Feb, 2026

This article explores the implications of Roko's Basilisk in the context of post-Transformer AI developments, analyzing its philosophical...

Reddit - Artificial Intelligence · 1 min ·
Generative Ai

Fake faces generated by AI are now "too good to be true," researchers warn

Researchers highlight the increasing realism of AI-generated faces, warning that they may soon be indistinguishable from real images, rai...

Reddit - Artificial Intelligence · 1 min ·
Microsoft’s new gaming CEO vows not to flood the ecosystem with ‘endless AI slop’ | TechCrunch
Ai Infrastructure

Microsoft’s new gaming CEO vows not to flood the ecosystem with ‘endless AI slop’ | TechCrunch

Microsoft's new gaming CEO, Asha Sharma, emphasizes a commitment to quality in gaming, rejecting the idea of overusing AI in game develop...

TechCrunch - AI · 3 min ·
Ai Safety

Civil rights for AI?

The article discusses the concept of civil rights for AI, exploring ethical considerations and implications for the future of artificial ...

Reddit - Artificial Intelligence · 1 min ·
Ai Safety

Lawyer says Google shut down his Gmail, Voice and Photos after NotebookLM upload

A lawyer claims that Google suspended his Gmail, Voice, and Photos accounts after he uploaded documents related to sensitive public recor...

Reddit - Artificial Intelligence · 1 min ·
Suspect in Tumbler Ridge school shooting described violent scenarios to ChatGPT | The Verge
Llms

Suspect in Tumbler Ridge school shooting described violent scenarios to ChatGPT | The Verge

The article discusses the Tumbler Ridge school shooting suspect's alarming interactions with ChatGPT, which raised concerns at OpenAI but...

The Verge - AI · 3 min ·
OpenAI debated calling police about suspected Canadian shooter's chats | TechCrunch
Llms

OpenAI debated calling police about suspected Canadian shooter's chats | TechCrunch

OpenAI faced a dilemma over whether to alert police about alarming chats from a suspected Canadian shooter, highlighting the challenges o...

TechCrunch - AI · 4 min ·
Ai Safety

How can a government actually stop or control AI?

The article discusses the challenges governments face in controlling AI, highlighting the limitations of legal and technical measures in ...

Reddit - Artificial Intelligence · 1 min ·
Ai Safety

You are the Product

The article discusses the concept that users of free AI services are often the product, emphasizing the implications for privacy and data...

Reddit - Artificial Intelligence · 1 min ·
Congress—Not the Pentagon or Anthropic—Should Set Military AI Rules
Ai Safety

Congress—Not the Pentagon or Anthropic—Should Set Military AI Rules

The article argues that Congress, not the Pentagon or private companies like Anthropic, should establish regulations for military AI use,...

AI Tools & Products · 8 min ·
The AI-Panic Cycle—And What’s Actually Different Now
Ai Agents

The AI-Panic Cycle—And What’s Actually Different Now

The article discusses the current AI landscape, focusing on the emergence of coding agents and the resulting anxiety among industry insid...

AI Tools & Products · 38 min ·
In the News: Manjeet Rege on AI at the Minnesota State Capitol
Ai Safety

In the News: Manjeet Rege on AI at the Minnesota State Capitol

Dr. Manjeet Rege discusses the role of AI in enhancing security at the Minnesota State Capitol, emphasizing its integration with human ju...

AI Tools & Products · 1 min ·
Anthropic rolls out embedded security scanning for Claude
Llms

Anthropic rolls out embedded security scanning for Claude

Anthropic introduces Claude Code Security, a new feature that scans AI-generated code for vulnerabilities and suggests patching solutions...

AI Tools & Products · 5 min ·
AI Reviews and Lab Tests
Ai Agents

AI Reviews and Lab Tests

This article reviews various AI tools, highlighting their features, pros, and cons, to help users choose the best options for their needs.

AI Tools & Products · 13 min ·
Llms

I fact-checked the "AI Moats are Dead" Substack article. It was AI-generated and got its own facts wrong.

The article critiques a Substack post claiming AI models lack moats, revealing it as AI-generated and factually incorrect, despite some v...

Reddit - Artificial Intelligence · 1 min ·
Godfather of AI Yann LeCun says: Yes, LLMs may be passing Maths Olympiads and bar exams, but they will still fail in …
Llms

Godfather of AI Yann LeCun says: Yes, LLMs may be passing Maths Olympiads and bar exams, but they will still fail in …

Yann LeCun, a prominent AI researcher, discusses the limitations of large language models (LLMs) despite their successes in exams, emphas...

AI News - General · 10 min ·
Anthropic-funded group backs candidate attacked by rival AI super PAC | TechCrunch
Ai Safety

Anthropic-funded group backs candidate attacked by rival AI super PAC | TechCrunch

The article discusses the political dynamics surrounding Alex Bores, a New York congressional candidate, who is supported by a PAC funded...

TechCrunch - AI · 3 min ·
Previous Page 75 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime