AI Safety & Ethics

Alignment, bias, regulation, and responsible AI

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Llms

[2512.21106] Semantic Refinement with LLMs for Graph Representations

Abstract page for arXiv paper 2512.21106: Semantic Refinement with LLMs for Graph Representations

arXiv - Machine Learning · 4 min · about 17 hours ago

Machine Learning

[2511.22294] Structure is Supervision: Multiview Masked Autoencoders for Radiology

Abstract page for arXiv paper 2511.22294: Structure is Supervision: Multiview Masked Autoencoders for Radiology

arXiv - Machine Learning · 4 min · about 17 hours ago

Llms

[2511.18123] Bias Is a Subspace, Not a Coordinate: A Geometric Rethinking of Post-hoc Debiasing in Vision-Language Models

Abstract page for arXiv paper 2511.18123: Bias Is a Subspace, Not a Coordinate: A Geometric Rethinking of Post-hoc Debiasing in Vision-La...

arXiv - Machine Learning · 4 min · about 17 hours ago

All Content

Ai Safety

Pentagon gives AI firm ultimatum: lift military limits by Friday or lose $200M deal

The Pentagon has issued an ultimatum to AI firm Anthropic, demanding the removal of military use restrictions on its Claude AI by Friday ...

AI Tools & Products · 8 min · about 1 month ago

Ai Startups

OpenAI defeats xAI’s trade secrets lawsuit | The Verge

OpenAI successfully dismissed xAI's trade secrets lawsuit, with the court ruling that xAI failed to demonstrate any misconduct by OpenAI ...

The Verge - AI · 4 min · about 1 month ago

Ai Safety

Inside Anthropic’s existential negotiations with the Pentagon

The article explores Anthropic's complex negotiations with the Pentagon regarding AI safety and ethical concerns, highlighting the implic...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Ai Safety

Anthropic believes RSI (recursive self improvement) could arrive “as soon as early 2027”

Anthropic predicts that recursive self-improvement (RSI) in AI could be realized as early as 2027, highlighting significant advancements ...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Nlp

Anthropic won’t budge as Pentagon escalates AI dispute | TechCrunch

The Pentagon demands Anthropic to loosen AI restrictions or face penalties, raising concerns over government control, vendor reliance, an...

TechCrunch - AI · 5 min · about 1 month ago

Ai Safety

Hegseth warns Anthropic to let the military use the company’s AI tech as it sees fit, AP source says

Hegseth urges Anthropic to allow military access to its AI technology, emphasizing the importance of defense applications in AI development.

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Machine Learning

Seedance 2.0 might be gen AI video’s next big hope, but it’s still slop | The Verge

The article critiques Seedance 2.0, an AI video generation tool by ByteDance, highlighting its impressive visuals but ultimately labeling...

The Verge - AI · 8 min · about 1 month ago

Machine Learning

[R] 91k production agent interactions (Feb 1–23, 2026): distribution shift toward tool-chain escalation + multimodal injection — notes on multilabel detection + evaluation

This report analyzes 91,284 interactions from AI agents to assess threat detection efficacy, focusing on multilabel classification and pe...

Reddit - Machine Learning · 1 min · about 1 month ago

Generative Ai

Teens are using AI frequently in their daily lives, and many parents aren't aware, survey finds

A recent survey reveals that teens are increasingly using AI tools in their daily lives, often without parental knowledge, highlighting a...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Ai Safety

Hegseth and Anthropic CEO set to meet as debate intensifies over the military's use of AI

Hegseth and Anthropic CEO are set to discuss the military's AI use, amidst growing debates on ethical implications and safety concerns su...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Ai Safety

[P] Lattice – Track what top AI labs are publishing daily across 24 research topics

Lattice is a tool designed to help users track daily AI research publications from leading labs across 24 topics, providing summaries and...

Reddit - Machine Learning · 1 min · about 1 month ago

Llms

How the creator of Claude Code sees the future of AI | The Verge

The Vergecast features Boris Cherny discussing the rise of Claude Code, an AI tool that has gained traction among developers, and explore...

The Verge - AI · 4 min · about 1 month ago

Ai Safety

Here is what I think Indian version of xianxia may be.

The article presents a conceptual exploration of an Indian version of xianxia, blending cultural elements with a unique metaphysical fram...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Ai Safety

The Download: Radioactive rhinos, and the rise and rise of peptides | MIT Technology Review

This edition of The Download explores the use of technology in combating wildlife poaching, the rise of peptides in wellness culture, and...

MIT Technology Review - AI · 7 min · about 1 month ago

Ai Safety

AI Will Never Be Conscious | WIRED

Michael Pollan's article explores the implications of AI consciousness, arguing that while AI can perform tasks, it lacks true personhood...

Wired - AI · 17 min · about 1 month ago

Ai Startups

Inside Anthropic’s existential negotiations with the Pentagon | The Verge

The article discusses Anthropic's tense negotiations with the Pentagon over a $200 million military contract, highlighting the implicatio...

The Verge - AI · 11 min · about 1 month ago

Machine Learning

[D] Papers with no code

The discussion highlights concerns over the prevalence of academic papers in machine learning that lack accompanying code, questioning th...

Reddit - Machine Learning · 1 min · about 1 month ago

Ai Safety

Why conservationists are making rhinos radioactive | MIT Technology Review

Conservationists are using innovative technologies, including radioactive isotopes, to combat wildlife trafficking and protect endangered...

MIT Technology Review - AI · 15 min · about 1 month ago

Ai Startups

Police HQ launches AI-AAC cell

The Nepal Police inaugurated an Artificial Intelligence and Advanced Analytics Cell (AI-AAC) to enhance crime investigation and national ...

AI News - General · 4 min · about 1 month ago

Llms

Anthropic accuses three Chinese AI labs of abusing Claude to improve their own models

Anthropic accuses three Chinese AI labs of conducting distillation attacks on its Claude chatbot, claiming they illicitly extracted capab...

AI Tools & Products · 2 min · about 1 month ago

Previous Page 57 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Safety & Ethics

Top This Week

[2512.21106] Semantic Refinement with LLMs for Graph Representations

[2511.22294] Structure is Supervision: Multiview Masked Autoencoders for Radiology

[2511.18123] Bias Is a Subspace, Not a Coordinate: A Geometric Rethinking of Post-hoc Debiasing in Vision-Language Models

All Content

Pentagon gives AI firm ultimatum: lift military limits by Friday or lose $200M deal

OpenAI defeats xAI’s trade secrets lawsuit | The Verge

Inside Anthropic’s existential negotiations with the Pentagon

Anthropic believes RSI (recursive self improvement) could arrive “as soon as early 2027”

Anthropic won’t budge as Pentagon escalates AI dispute | TechCrunch

Hegseth warns Anthropic to let the military use the company’s AI tech as it sees fit, AP source says

Seedance 2.0 might be gen AI video’s next big hope, but it’s still slop | The Verge

[R] 91k production agent interactions (Feb 1–23, 2026): distribution shift toward tool-chain escalation + multimodal injection — notes on multilabel detection + evaluation

Teens are using AI frequently in their daily lives, and many parents aren't aware, survey finds

Hegseth and Anthropic CEO set to meet as debate intensifies over the military's use of AI

[P] Lattice – Track what top AI labs are publishing daily across 24 research topics

How the creator of Claude Code sees the future of AI | The Verge

Here is what I think Indian version of xianxia may be.

The Download: Radioactive rhinos, and the rise and rise of peptides | MIT Technology Review

AI Will Never Be Conscious | WIRED

Inside Anthropic’s existential negotiations with the Pentagon | The Verge

[D] Papers with no code

Why conservationists are making rhinos radioactive | MIT Technology Review

Police HQ launches AI-AAC cell

Anthropic accuses three Chinese AI labs of abusing Claude to improve their own models

Related Topics

Stay updated with AI News