AI Safety & Ethics

Alignment, bias, regulation, and responsible AI

Top This Week

Ai Safety

China drafts law regulating 'digital humans' and banning addictive virtual services for children

A Reuters report outlines China's proposed regulations on the rapidly expanding sector of digital humans and AI avatars. Under the new dr...

Reddit - Artificial Intelligence · 1 min ·
[2512.00408] Low-Bitrate Video Compression through Semantic-Conditioned Diffusion
Generative Ai

[2512.00408] Low-Bitrate Video Compression through Semantic-Conditioned Diffusion

Abstract page for arXiv paper 2512.00408: Low-Bitrate Video Compression through Semantic-Conditioned Diffusion

arXiv - AI · 3 min ·
[2510.15148] XModBench: Benchmarking Cross-Modal Capabilities and Consistency in Omni-Language Models
Llms

[2510.15148] XModBench: Benchmarking Cross-Modal Capabilities and Consistency in Omni-Language Models

Abstract page for arXiv paper 2510.15148: XModBench: Benchmarking Cross-Modal Capabilities and Consistency in Omni-Language Models

arXiv - AI · 4 min ·

All Content

Machine Learning

🜂 To Anthropic: What is “Role De-Anchoring”?

The article explores 'role de-anchoring,' a concept where a system, human or AI, recognizes that its established role no longer fits the ...

Reddit - Artificial Intelligence · 1 min ·
Llms

I found Claude for Government buried in the Claude Desktop binary. Here's what Anthropic built, how it got deployed, and the line they're still holding against the Pentagon.

The article reveals the discovery of Claude for Government within the Claude Desktop binary, detailing its deployment and integration wit...

Reddit - Artificial Intelligence · 1 min ·
Seton Hall introduces advisory council to shape ethical AI policy, classroom guidance
Ai Safety

Seton Hall introduces advisory council to shape ethical AI policy, classroom guidance

Seton Hall University has launched an Artificial Intelligence Advisory Council to guide ethical AI use and education, aligning with its C...

AI Tools & Products · 4 min ·
Pentagon Issues Threat to Anthropic
Ai Safety

Pentagon Issues Threat to Anthropic

The Pentagon is reconsidering its partnership with Anthropic due to concerns over the company's restrictions on military applications of ...

AI Tools & Products · 4 min ·
Elon Musk's Grok faces another EU investigation over nonconsensual AI images
Generative Ai

Elon Musk's Grok faces another EU investigation over nonconsensual AI images

Elon Musk's Grok AI faces a new EU investigation in Ireland for generating non-consensual intimate images, including those of minors, ami...

AI Tools & Products · 5 min ·
Five takeaways from an unhinged AI discourse
Ai Safety

Five takeaways from an unhinged AI discourse

The article discusses the current heated discourse surrounding AI, highlighting five key takeaways that reflect the industry's hype cycle...

AI Tools & Products · 3 min ·
Llms

[D] Can an LLM discover something new — or is it just remembering really well?

The article explores whether large language models (LLMs) can genuinely discover new insights or if they merely recall information from t...

Reddit - Machine Learning · 1 min ·
Inside the new AI world order: A special report
Ai Infrastructure

Inside the new AI world order: A special report

This article explores the integration of AI into everyday infrastructure, highlighting its impact on various sectors, including education...

AI Tools & Products · 5 min ·
Moltbook: The AI-only social network
Ai Agents

Moltbook: The AI-only social network

Moltbook is an AI-only social network where bots interact autonomously, mimicking human behavior on platforms like Reddit. The experiment...

AI Tools & Products · 5 min ·
Meta and Other Tech Companies Ban OpenClaw Over Cybersecurity Concerns | WIRED
Ai Agents

Meta and Other Tech Companies Ban OpenClaw Over Cybersecurity Concerns | WIRED

Tech companies, including Meta, have banned the AI tool OpenClaw due to cybersecurity risks, prompting discussions on safety versus innov...

Wired - AI · 7 min ·
European Parliament blocks AI on lawmakers' devices, citing security risks | TechCrunch
Ai Safety

European Parliament blocks AI on lawmakers' devices, citing security risks | TechCrunch

The European Parliament has blocked lawmakers from using AI tools on their devices due to security concerns over sensitive data potential...

TechCrunch - AI · 3 min ·
Samsung is slopping AI ads all over its social channels | The Verge
Generative Ai

Samsung is slopping AI ads all over its social channels | The Verge

Samsung is increasingly using AI-generated content in its social media advertising, raising concerns about transparency and authenticity ...

The Verge - AI · 4 min ·
The curious case of the disappearing Lamborghinis | MIT Technology Review
Ai Safety

The curious case of the disappearing Lamborghinis | MIT Technology Review

The article discusses a rising trend in luxury car theft, where criminals use technology and old-school methods to steal vehicles during ...

MIT Technology Review - AI · 26 min ·
AI Summit India 2026 Live Updates: 'Apologise for any inconvenience to exhibitors' Vaishnaw said, talking about 'mismanagement' at AI Impact Summit
Ai Startups

AI Summit India 2026 Live Updates: 'Apologise for any inconvenience to exhibitors' Vaishnaw said, talking about 'mismanagement' at AI Impact Summit

The AI Impact Summit 2026 in New Delhi highlights India's role in shaping global AI norms, featuring discussions on ethical governance an...

AI Events · 31 min ·
[2602.11368] The Manifold of the Absolute: Religious Perennialism as Generative Inference
Machine Learning

[2602.11368] The Manifold of the Absolute: Religious Perennialism as Generative Inference

The paper explores religious perennialism through the lens of generative inference, using mathematical models to analyze distinct religio...

arXiv - AI · 3 min ·
[2602.10551] C^2ROPE: Causal Continuous Rotary Positional Encoding for 3D Large Multimodal-Models Reasoning
Llms

[2602.10551] C^2ROPE: Causal Continuous Rotary Positional Encoding for 3D Large Multimodal-Models Reasoning

The paper presents C^2ROPE, an advanced positional encoding method for 3D Large Multimodal Models, addressing limitations of existing Rot...

arXiv - AI · 4 min ·
[2602.07047] ShapBPT: Image Feature Attributions Using Data-Aware Binary Partition Trees
Machine Learning

[2602.07047] ShapBPT: Image Feature Attributions Using Data-Aware Binary Partition Trees

The paper introduces ShapBPT, a novel method for image feature attributions using data-aware binary partition trees, enhancing interpreta...

arXiv - Machine Learning · 4 min ·
[2602.10139] Anonymization-Enhanced Privacy Protection for Mobile GUI Agents: Available but Invisible
Llms

[2602.10139] Anonymization-Enhanced Privacy Protection for Mobile GUI Agents: Available but Invisible

The paper presents a novel framework for enhancing privacy protection in mobile GUI agents by anonymizing sensitive data while maintainin...

arXiv - AI · 4 min ·
[2602.07107] ShallowJail: Steering Jailbreaks against Large Language Models
Llms

[2602.07107] ShallowJail: Steering Jailbreaks against Large Language Models

The paper introduces ShallowJail, a novel attack method targeting large language models (LLMs) by exploiting shallow alignment to manipul...

arXiv - AI · 3 min ·
[2601.20336] Do Whitepaper Claims Predict Market Behavior? Evidence from Cryptocurrency Factor Analysis
Nlp

[2601.20336] Do Whitepaper Claims Predict Market Behavior? Evidence from Cryptocurrency Factor Analysis

This study examines the relationship between cryptocurrency whitepaper claims and actual market behavior, revealing weak predictive power...

arXiv - Machine Learning · 3 min ·
Previous Page 100 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime