AI Safety & Ethics

Alignment, bias, regulation, and responsible AI

Top This Week

[2601.15356] Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing
Llms

[2601.15356] Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing

Abstract page for arXiv paper 2601.15356: Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing

arXiv - AI · 4 min ·
[2510.18196] Contrastive Decoding Mitigates Score Range Bias in LLM-as-a-Judge
Llms

[2510.18196] Contrastive Decoding Mitigates Score Range Bias in LLM-as-a-Judge

Abstract page for arXiv paper 2510.18196: Contrastive Decoding Mitigates Score Range Bias in LLM-as-a-Judge

arXiv - AI · 3 min ·
[2509.23435] AudioRole: An Audio Dataset for Character Role-Playing in Large Language Models
Llms

[2509.23435] AudioRole: An Audio Dataset for Character Role-Playing in Large Language Models

Abstract page for arXiv paper 2509.23435: AudioRole: An Audio Dataset for Character Role-Playing in Large Language Models

arXiv - AI · 4 min ·

All Content

AI Papers to Read in 2025
Ai Safety

AI Papers to Read in 2025

This article presents a curated list of ten significant AI papers to read in 2025, emphasizing their contributions and relevance to the e...

AI News - General · 19 min ·
LLMs Are Great, but They're Not Everything
Llms

LLMs Are Great, but They're Not Everything

The article critiques the overreliance on LLMs for complex tasks, highlighting their limitations in structured logic and deterministic wo...

Hacker News - AI · 4 min ·
The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity
Machine Learning

The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity

This article explores the strengths and limitations of Large Reasoning Models (LRMs) in AI, revealing insights into their performance acr...

AI Events · 3 min ·
How much energy will AI really consume? The good, the bad and the unknown
Ai Infrastructure

How much energy will AI really consume? The good, the bad and the unknown

The article explores the energy consumption of AI technologies, urging transparency from firms regarding their electricity demands and th...

AI News - General · 4 min ·
First-of-its-kind standard on AI and machine learning in research aims to preserve data integrity
Machine Learning

First-of-its-kind standard on AI and machine learning in research aims to preserve data integrity

The Digital Governance Standards Institute has introduced Canada's first standard for AI and machine learning in research, focusing on et...

AI News - General · 3 min ·
Teaching AI models what they don’t know
Machine Learning

Teaching AI models what they don’t know

MIT researchers founded Themis AI to quantify AI model uncertainty and address knowledge gaps, enhancing reliability in high-stakes appli...

AI News - General · 9 min ·
How the U.S. Public and AI Experts View Artificial Intelligence
Ai Safety

How the U.S. Public and AI Experts View Artificial Intelligence

A Pew Research Center report reveals stark contrasts between U.S. public and AI experts regarding artificial intelligence, highlighting s...

AI News - General · 16 min ·
How to Reduce Bias in Machine Learning
Machine Learning

How to Reduce Bias in Machine Learning

This article discusses the importance of identifying and reducing bias in machine learning systems to ensure fairness and accuracy in AI ...

AI Events · 22 min ·
Artificial Intelligence at DHS
Ai Safety

Artificial Intelligence at DHS

The article outlines the Department of Homeland Security's (DHS) strategy for the responsible use of Artificial Intelligence (AI), detail...

AI News - General · 3 min ·
Show HN: 3LC – Illuminate the ML Black Box
Machine Learning

Show HN: 3LC – Illuminate the ML Black Box

3LC is an open-source tool designed to enhance the interpretability of machine learning models, addressing the 'black box' issue by provi...

Hacker News - AI · 1 min ·
NIST Identifies Types of Cyberattacks That Manipulate Behavior of AI Systems
Ai Safety

NIST Identifies Types of Cyberattacks That Manipulate Behavior of AI Systems

NIST outlines various cyberattack types that exploit vulnerabilities in AI systems, emphasizing the need for improved mitigation strategi...

AI News - General · 6 min ·
Explained: Generative AI
Generative Ai

Explained: Generative AI

This article from MIT News explores generative AI, explaining its workings and significance in modern applications, highlighting its evol...

AI News - General · 12 min ·
Does this artificial intelligence think like a human?
Machine Learning

Does this artificial intelligence think like a human?

MIT researchers developed a method called Shared Interest that helps users understand machine-learning models by comparing their reasonin...

AI News - General · 10 min ·
Avoiding shortcut solutions in artificial intelligence
Machine Learning

Avoiding shortcut solutions in artificial intelligence

MIT researchers developed a method to prevent shortcut solutions in machine learning models, enhancing their reliability by encouraging f...

AI News - General · 11 min ·

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime