AI Safety & Ethics

Alignment, bias, regulation, and responsible AI

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Safety

China drafts law regulating 'digital humans' and banning addictive virtual services for children

A Reuters report outlines China's proposed regulations on the rapidly expanding sector of digital humans and AI avatars. Under the new dr...

Reddit - Artificial Intelligence · 1 min · about 7 hours ago

Generative Ai

[2512.00408] Low-Bitrate Video Compression through Semantic-Conditioned Diffusion

Abstract page for arXiv paper 2512.00408: Low-Bitrate Video Compression through Semantic-Conditioned Diffusion

arXiv - AI · 3 min · about 7 hours ago

Llms

[2510.15148] XModBench: Benchmarking Cross-Modal Capabilities and Consistency in Omni-Language Models

Abstract page for arXiv paper 2510.15148: XModBench: Benchmarking Cross-Modal Capabilities and Consistency in Omni-Language Models

arXiv - AI · 4 min · about 7 hours ago

All Content

Machine Learning

🜂 To Anthropic: What is “Role De-Anchoring”?

The article explores 'role de-anchoring,' a concept where a system, human or AI, recognizes that its established role no longer fits the ...

Reddit - Artificial Intelligence · 1 min · about 2 months ago

Llms

I found Claude for Government buried in the Claude Desktop binary. Here's what Anthropic built, how it got deployed, and the line they're still holding against the Pentagon.

The article reveals the discovery of Claude for Government within the Claude Desktop binary, detailing its deployment and integration wit...

Reddit - Artificial Intelligence · 1 min · about 2 months ago

Ai Safety

Seton Hall introduces advisory council to shape ethical AI policy, classroom guidance

Seton Hall University has launched an Artificial Intelligence Advisory Council to guide ethical AI use and education, aligning with its C...

AI Tools & Products · 4 min · about 2 months ago

Ai Safety

Pentagon Issues Threat to Anthropic

The Pentagon is reconsidering its partnership with Anthropic due to concerns over the company's restrictions on military applications of ...

AI Tools & Products · 4 min · about 2 months ago

Generative Ai

Elon Musk's Grok faces another EU investigation over nonconsensual AI images

Elon Musk's Grok AI faces a new EU investigation in Ireland for generating non-consensual intimate images, including those of minors, ami...

AI Tools & Products · 5 min · about 2 months ago

Ai Safety

Five takeaways from an unhinged AI discourse

The article discusses the current heated discourse surrounding AI, highlighting five key takeaways that reflect the industry's hype cycle...

AI Tools & Products · 3 min · about 2 months ago

Llms

[D] Can an LLM discover something new — or is it just remembering really well?

The article explores whether large language models (LLMs) can genuinely discover new insights or if they merely recall information from t...

Reddit - Machine Learning · 1 min · about 2 months ago

Ai Infrastructure

Inside the new AI world order: A special report

This article explores the integration of AI into everyday infrastructure, highlighting its impact on various sectors, including education...

AI Tools & Products · 5 min · about 2 months ago

Ai Agents

Moltbook: The AI-only social network

Moltbook is an AI-only social network where bots interact autonomously, mimicking human behavior on platforms like Reddit. The experiment...

AI Tools & Products · 5 min · about 2 months ago

Ai Agents

Meta and Other Tech Companies Ban OpenClaw Over Cybersecurity Concerns | WIRED

Tech companies, including Meta, have banned the AI tool OpenClaw due to cybersecurity risks, prompting discussions on safety versus innov...

Wired - AI · 7 min · about 2 months ago

Ai Safety

European Parliament blocks AI on lawmakers' devices, citing security risks | TechCrunch

The European Parliament has blocked lawmakers from using AI tools on their devices due to security concerns over sensitive data potential...

TechCrunch - AI · 3 min · about 2 months ago

Generative Ai

Samsung is slopping AI ads all over its social channels | The Verge

Samsung is increasingly using AI-generated content in its social media advertising, raising concerns about transparency and authenticity ...

The Verge - AI · 4 min · about 2 months ago

Ai Safety

The curious case of the disappearing Lamborghinis | MIT Technology Review

The article discusses a rising trend in luxury car theft, where criminals use technology and old-school methods to steal vehicles during ...

MIT Technology Review - AI · 26 min · about 2 months ago

Ai Startups

AI Summit India 2026 Live Updates: 'Apologise for any inconvenience to exhibitors' Vaishnaw said, talking about 'mismanagement' at AI Impact Summit

The AI Impact Summit 2026 in New Delhi highlights India's role in shaping global AI norms, featuring discussions on ethical governance an...

AI Events · 31 min · about 2 months ago

Machine Learning

[2602.11368] The Manifold of the Absolute: Religious Perennialism as Generative Inference

The paper explores religious perennialism through the lens of generative inference, using mathematical models to analyze distinct religio...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.10551] C^2ROPE: Causal Continuous Rotary Positional Encoding for 3D Large Multimodal-Models Reasoning

The paper presents C^2ROPE, an advanced positional encoding method for 3D Large Multimodal Models, addressing limitations of existing Rot...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.07047] ShapBPT: Image Feature Attributions Using Data-Aware Binary Partition Trees

The paper introduces ShapBPT, a novel method for image feature attributions using data-aware binary partition trees, enhancing interpreta...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.10139] Anonymization-Enhanced Privacy Protection for Mobile GUI Agents: Available but Invisible

The paper presents a novel framework for enhancing privacy protection in mobile GUI agents by anonymizing sensitive data while maintainin...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.07107] ShallowJail: Steering Jailbreaks against Large Language Models

The paper introduces ShallowJail, a novel attack method targeting large language models (LLMs) by exploiting shallow alignment to manipul...

arXiv - AI · 3 min · about 2 months ago

Nlp

[2601.20336] Do Whitepaper Claims Predict Market Behavior? Evidence from Cryptocurrency Factor Analysis

This study examines the relationship between cryptocurrency whitepaper claims and actual market behavior, revealing weak predictive power...

arXiv - Machine Learning · 3 min · about 2 months ago

Previous Page 100 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Safety & Ethics

Top This Week

China drafts law regulating 'digital humans' and banning addictive virtual services for children

[2512.00408] Low-Bitrate Video Compression through Semantic-Conditioned Diffusion

[2510.15148] XModBench: Benchmarking Cross-Modal Capabilities and Consistency in Omni-Language Models

All Content

🜂 To Anthropic: What is “Role De-Anchoring”?

I found Claude for Government buried in the Claude Desktop binary. Here's what Anthropic built, how it got deployed, and the line they're still holding against the Pentagon.

Seton Hall introduces advisory council to shape ethical AI policy, classroom guidance

Pentagon Issues Threat to Anthropic

Elon Musk's Grok faces another EU investigation over nonconsensual AI images

Five takeaways from an unhinged AI discourse

[D] Can an LLM discover something new — or is it just remembering really well?

Inside the new AI world order: A special report

Moltbook: The AI-only social network

Meta and Other Tech Companies Ban OpenClaw Over Cybersecurity Concerns | WIRED

European Parliament blocks AI on lawmakers' devices, citing security risks | TechCrunch

Samsung is slopping AI ads all over its social channels | The Verge

The curious case of the disappearing Lamborghinis | MIT Technology Review

AI Summit India 2026 Live Updates: 'Apologise for any inconvenience to exhibitors' Vaishnaw said, talking about 'mismanagement' at AI Impact Summit

[2602.11368] The Manifold of the Absolute: Religious Perennialism as Generative Inference

[2602.10551] C^2ROPE: Causal Continuous Rotary Positional Encoding for 3D Large Multimodal-Models Reasoning

[2602.07047] ShapBPT: Image Feature Attributions Using Data-Aware Binary Partition Trees

[2602.10139] Anonymization-Enhanced Privacy Protection for Mobile GUI Agents: Available but Invisible

[2602.07107] ShallowJail: Steering Jailbreaks against Large Language Models

[2601.20336] Do Whitepaper Claims Predict Market Behavior? Evidence from Cryptocurrency Factor Analysis

Related Topics

Stay updated with AI News