AI Safety & Ethics

Alignment, bias, regulation, and responsible AI

Top This Week

[2512.21106] Semantic Refinement with LLMs for Graph Representations
Llms

[2512.21106] Semantic Refinement with LLMs for Graph Representations

Abstract page for arXiv paper 2512.21106: Semantic Refinement with LLMs for Graph Representations

arXiv - Machine Learning · 4 min ·
[2511.22294] Structure is Supervision: Multiview Masked Autoencoders for Radiology
Machine Learning

[2511.22294] Structure is Supervision: Multiview Masked Autoencoders for Radiology

Abstract page for arXiv paper 2511.22294: Structure is Supervision: Multiview Masked Autoencoders for Radiology

arXiv - Machine Learning · 4 min ·
[2511.18123] Bias Is a Subspace, Not a Coordinate: A Geometric Rethinking of Post-hoc Debiasing in Vision-Language Models
Llms

[2511.18123] Bias Is a Subspace, Not a Coordinate: A Geometric Rethinking of Post-hoc Debiasing in Vision-Language Models

Abstract page for arXiv paper 2511.18123: Bias Is a Subspace, Not a Coordinate: A Geometric Rethinking of Post-hoc Debiasing in Vision-La...

arXiv - Machine Learning · 4 min ·

All Content

Pentagon gives AI firm ultimatum: lift military limits by Friday or lose $200M deal
Ai Safety

Pentagon gives AI firm ultimatum: lift military limits by Friday or lose $200M deal

The Pentagon has issued an ultimatum to AI firm Anthropic, demanding the removal of military use restrictions on its Claude AI by Friday ...

AI Tools & Products · 8 min ·
OpenAI defeats xAI’s trade secrets lawsuit | The Verge
Ai Startups

OpenAI defeats xAI’s trade secrets lawsuit | The Verge

OpenAI successfully dismissed xAI's trade secrets lawsuit, with the court ruling that xAI failed to demonstrate any misconduct by OpenAI ...

The Verge - AI · 4 min ·
Ai Safety

Inside Anthropic’s existential negotiations with the Pentagon

The article explores Anthropic's complex negotiations with the Pentagon regarding AI safety and ethical concerns, highlighting the implic...

Reddit - Artificial Intelligence · 1 min ·
Ai Safety

Anthropic believes RSI (recursive self improvement) could arrive “as soon as early 2027”

Anthropic predicts that recursive self-improvement (RSI) in AI could be realized as early as 2027, highlighting significant advancements ...

Reddit - Artificial Intelligence · 1 min ·
Anthropic won’t budge as Pentagon escalates AI dispute | TechCrunch
Nlp

Anthropic won’t budge as Pentagon escalates AI dispute | TechCrunch

The Pentagon demands Anthropic to loosen AI restrictions or face penalties, raising concerns over government control, vendor reliance, an...

TechCrunch - AI · 5 min ·
Ai Safety

Hegseth warns Anthropic to let the military use the company’s AI tech as it sees fit, AP source says

Hegseth urges Anthropic to allow military access to its AI technology, emphasizing the importance of defense applications in AI development.

Reddit - Artificial Intelligence · 1 min ·
Seedance 2.0 might be gen AI video’s next big hope, but it’s still slop | The Verge
Machine Learning

Seedance 2.0 might be gen AI video’s next big hope, but it’s still slop | The Verge

The article critiques Seedance 2.0, an AI video generation tool by ByteDance, highlighting its impressive visuals but ultimately labeling...

The Verge - AI · 8 min ·
Machine Learning

[R] 91k production agent interactions (Feb 1–23, 2026): distribution shift toward tool-chain escalation + multimodal injection — notes on multilabel detection + evaluation

This report analyzes 91,284 interactions from AI agents to assess threat detection efficacy, focusing on multilabel classification and pe...

Reddit - Machine Learning · 1 min ·
Generative Ai

Teens are using AI frequently in their daily lives, and many parents aren't aware, survey finds

A recent survey reveals that teens are increasingly using AI tools in their daily lives, often without parental knowledge, highlighting a...

Reddit - Artificial Intelligence · 1 min ·
Ai Safety

Hegseth and Anthropic CEO set to meet as debate intensifies over the military's use of AI

Hegseth and Anthropic CEO are set to discuss the military's AI use, amidst growing debates on ethical implications and safety concerns su...

Reddit - Artificial Intelligence · 1 min ·
Ai Safety

[P] Lattice – Track what top AI labs are publishing daily across 24 research topics

Lattice is a tool designed to help users track daily AI research publications from leading labs across 24 topics, providing summaries and...

Reddit - Machine Learning · 1 min ·
How the creator of Claude Code sees the future of AI | The Verge
Llms

How the creator of Claude Code sees the future of AI | The Verge

The Vergecast features Boris Cherny discussing the rise of Claude Code, an AI tool that has gained traction among developers, and explore...

The Verge - AI · 4 min ·
Ai Safety

Here is what I think Indian version of xianxia may be.

The article presents a conceptual exploration of an Indian version of xianxia, blending cultural elements with a unique metaphysical fram...

Reddit - Artificial Intelligence · 1 min ·
The Download: Radioactive rhinos, and the rise and rise of peptides | MIT Technology Review
Ai Safety

The Download: Radioactive rhinos, and the rise and rise of peptides | MIT Technology Review

This edition of The Download explores the use of technology in combating wildlife poaching, the rise of peptides in wellness culture, and...

MIT Technology Review - AI · 7 min ·
AI Will Never Be Conscious | WIRED
Ai Safety

AI Will Never Be Conscious | WIRED

Michael Pollan's article explores the implications of AI consciousness, arguing that while AI can perform tasks, it lacks true personhood...

Wired - AI · 17 min ·
Inside Anthropic’s existential negotiations with the Pentagon | The Verge
Ai Startups

Inside Anthropic’s existential negotiations with the Pentagon | The Verge

The article discusses Anthropic's tense negotiations with the Pentagon over a $200 million military contract, highlighting the implicatio...

The Verge - AI · 11 min ·
Machine Learning

[D] Papers with no code

The discussion highlights concerns over the prevalence of academic papers in machine learning that lack accompanying code, questioning th...

Reddit - Machine Learning · 1 min ·
Why conservationists are making rhinos radioactive | MIT Technology Review
Ai Safety

Why conservationists are making rhinos radioactive | MIT Technology Review

Conservationists are using innovative technologies, including radioactive isotopes, to combat wildlife trafficking and protect endangered...

MIT Technology Review - AI · 15 min ·
Police HQ launches AI-AAC cell
Ai Startups

Police HQ launches AI-AAC cell

The Nepal Police inaugurated an Artificial Intelligence and Advanced Analytics Cell (AI-AAC) to enhance crime investigation and national ...

AI News - General · 4 min ·
Anthropic accuses three Chinese AI labs of abusing Claude to improve their own models
Llms

Anthropic accuses three Chinese AI labs of abusing Claude to improve their own models

Anthropic accuses three Chinese AI labs of conducting distillation attacks on its Claude chatbot, claiming they illicitly extracted capab...

AI Tools & Products · 2 min ·
Previous Page 57 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime