AI Safety & Ethics

Alignment, bias, regulation, and responsible AI

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[2601.15356] Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing

Abstract page for arXiv paper 2601.15356: Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing

arXiv - AI · 4 min · about 10 hours ago

Llms

[2510.18196] Contrastive Decoding Mitigates Score Range Bias in LLM-as-a-Judge

Abstract page for arXiv paper 2510.18196: Contrastive Decoding Mitigates Score Range Bias in LLM-as-a-Judge

arXiv - AI · 3 min · about 10 hours ago

Llms

[2509.23435] AudioRole: An Audio Dataset for Character Role-Playing in Large Language Models

Abstract page for arXiv paper 2509.23435: AudioRole: An Audio Dataset for Character Role-Playing in Large Language Models

arXiv - AI · 4 min · about 10 hours ago

All Content

Ai Safety

AI Papers to Read in 2025

This article presents a curated list of ten significant AI papers to read in 2025, emphasizing their contributions and relevance to the e...

AI News - General · 19 min · 11 months ago

Llms

LLMs Are Great, but They're Not Everything

The article critiques the overreliance on LLMs for complex tasks, highlighting their limitations in structured logic and deterministic wo...

Hacker News - AI · 4 min · 11 months ago

Machine Learning

The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity

This article explores the strengths and limitations of Large Reasoning Models (LRMs) in AI, revealing insights into their performance acr...

AI Events · 3 min · 11 months ago

Ai Infrastructure

How much energy will AI really consume? The good, the bad and the unknown

The article explores the energy consumption of AI technologies, urging transparency from firms regarding their electricity demands and th...

AI News - General · 4 min · 11 months ago

Machine Learning

First-of-its-kind standard on AI and machine learning in research aims to preserve data integrity

The Digital Governance Standards Institute has introduced Canada's first standard for AI and machine learning in research, focusing on et...

AI News - General · 3 min · 12 months ago

Machine Learning

Teaching AI models what they don’t know

MIT researchers founded Themis AI to quantify AI model uncertainty and address knowledge gaps, enhancing reliability in high-stakes appli...

AI News - General · 9 min · about 1 year ago

Ai Safety

How the U.S. Public and AI Experts View Artificial Intelligence

A Pew Research Center report reveals stark contrasts between U.S. public and AI experts regarding artificial intelligence, highlighting s...

AI News - General · 16 min · about 1 year ago

Machine Learning

How to Reduce Bias in Machine Learning

This article discusses the importance of identifying and reducing bias in machine learning systems to ensure fairness and accuracy in AI ...

AI Events · 22 min · about 1 year ago

Ai Safety

Artificial Intelligence at DHS

The article outlines the Department of Homeland Security's (DHS) strategy for the responsible use of Artificial Intelligence (AI), detail...

AI News - General · 3 min · about 1 year ago

Machine Learning

Show HN: 3LC – Illuminate the ML Black Box

3LC is an open-source tool designed to enhance the interpretability of machine learning models, addressing the 'black box' issue by provi...

Hacker News - AI · 1 min · almost 2 years ago

Ai Safety

NIST Identifies Types of Cyberattacks That Manipulate Behavior of AI Systems

NIST outlines various cyberattack types that exploit vulnerabilities in AI systems, emphasizing the need for improved mitigation strategi...

AI News - General · 6 min · about 2 years ago

Generative Ai

Explained: Generative AI

This article from MIT News explores generative AI, explaining its workings and significance in modern applications, highlighting its evol...

AI News - General · 12 min · over 2 years ago

Machine Learning

Does this artificial intelligence think like a human?

MIT researchers developed a method called Shared Interest that helps users understand machine-learning models by comparing their reasonin...

AI News - General · 10 min · almost 4 years ago

Machine Learning

Avoiding shortcut solutions in artificial intelligence

MIT researchers developed a method to prevent shortcut solutions in machine learning models, enhancing their reliability by encouraging f...

AI News - General · 11 min · about 5 years ago

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Safety & Ethics

Top This Week

[2601.15356] Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing

[2510.18196] Contrastive Decoding Mitigates Score Range Bias in LLM-as-a-Judge

[2509.23435] AudioRole: An Audio Dataset for Character Role-Playing in Large Language Models

All Content

AI Papers to Read in 2025

LLMs Are Great, but They're Not Everything

The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity

How much energy will AI really consume? The good, the bad and the unknown

First-of-its-kind standard on AI and machine learning in research aims to preserve data integrity

Teaching AI models what they don’t know

How the U.S. Public and AI Experts View Artificial Intelligence

How to Reduce Bias in Machine Learning

Artificial Intelligence at DHS

Show HN: 3LC – Illuminate the ML Black Box

NIST Identifies Types of Cyberattacks That Manipulate Behavior of AI Systems

Explained: Generative AI

Does this artificial intelligence think like a human?

Avoiding shortcut solutions in artificial intelligence

Related Topics

Stay updated with AI News