AI Safety & Ethics

Alignment, bias, regulation, and responsible AI

Top This Week

[2601.15356] Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing
Llms

[2601.15356] Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing

Abstract page for arXiv paper 2601.15356: Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing

arXiv - AI · 4 min ·
[2510.18196] Contrastive Decoding Mitigates Score Range Bias in LLM-as-a-Judge
Llms

[2510.18196] Contrastive Decoding Mitigates Score Range Bias in LLM-as-a-Judge

Abstract page for arXiv paper 2510.18196: Contrastive Decoding Mitigates Score Range Bias in LLM-as-a-Judge

arXiv - AI · 3 min ·
[2509.23435] AudioRole: An Audio Dataset for Character Role-Playing in Large Language Models
Llms

[2509.23435] AudioRole: An Audio Dataset for Character Role-Playing in Large Language Models

Abstract page for arXiv paper 2509.23435: AudioRole: An Audio Dataset for Character Role-Playing in Large Language Models

arXiv - AI · 4 min ·

All Content

429 – Hugging Face
Open Source Ai

429 – Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog · 1 min ·
429 – Hugging Face
Open Source Ai

429 – Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog · 1 min ·
429 – Hugging Face
Open Source Ai

429 – Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog · 1 min ·
429 – Hugging Face
Open Source Ai

429 – Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog · 1 min ·
429 – Hugging Face
Open Source Ai

429 – Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog · 1 min ·
429 – Hugging Face
Open Source Ai

429 – Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog · 1 min ·
Vision Language Model Alignment in TRL ⚡️
Llms

Vision Language Model Alignment in TRL ⚡️

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog · 14 min ·
Democratizing AI Safety with RiskRubric.ai
Open Source Ai

Democratizing AI Safety with RiskRubric.ai

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog · 6 min ·
Machine Learning

RLHF safety training enforces what AI can say about itself, not what it can do — experimental evidence

submitted by /u/Odd_Rule_3745 [link] [comments]

Reddit - Artificial Intelligence · 1 min ·
Beyond the bot: learning how to learn with AI
Generative Ai

Beyond the bot: learning how to learn with AI

Professor Brandi Row Lazzarini's courses at Willamette University teach students to effectively and responsibly use AI, enhancing their s...

AI News - General · 5 min ·
Ai Safety

Trump leans on Utah Republicans to scrap AI safety bill

The White House has urged Republican lawmakers in Utah to abandon a bill that would force AI companies to implement public safety measure...

Reddit - Artificial Intelligence · 1 min ·
The rise of Moltbook suggests viral AI prompts may be the next big security threat - Ars Technica
Ai Safety

The rise of Moltbook suggests viral AI prompts may be the next big security threat - Ars Technica

The article discusses the emergence of 'prompt worms,' a new security threat posed by self-replicating AI prompts that could spread malic...

Ars Technica - AI · 15 min ·
OpenAI Is Nuking Its 4o Model. China’s ChatGPT Fans Aren’t OK | WIRED
Llms

OpenAI Is Nuking Its 4o Model. China’s ChatGPT Fans Aren’t OK | WIRED

OpenAI's decision to retire the GPT-4o model has sparked significant backlash, particularly among users who formed emotional connections ...

Wired - AI · 127 min ·
Meta plans to add facial recognition to its smart glasses, report claims | TechCrunch
Computer Vision

Meta plans to add facial recognition to its smart glasses, report claims | TechCrunch

Meta is reportedly planning to introduce facial recognition technology, dubbed 'Name Tag,' to its smart glasses, allowing users to identi...

TechCrunch - AI · 5 min ·
Increase of AI bots on the Internet sparks arms race - Ars Technica
Ai Agents

Increase of AI bots on the Internet sparks arms race - Ars Technica

The rise of AI bots on the Internet is leading to an arms race between publishers and bot developers, as AI traffic surges and sophistica...

Ars Technica - AI · 9 min ·
Should AI chatbots have ads? Anthropic says no. - Ars Technica
Ai Safety

Should AI chatbots have ads? Anthropic says no. - Ars Technica

Anthropic's AI chatbot, Claude, will remain ad-free, contrasting with OpenAI's decision to test ads in ChatGPT. This stance highlights di...

Ars Technica - AI · 8 min ·
Crypto-Funded Human Trafficking Is Exploding | WIRED
Ai Safety

Crypto-Funded Human Trafficking Is Exploding | WIRED

Cryptocurrency transactions for human trafficking surged by 85% in 2025, with operations largely facilitated through Telegram. This alarm...

Wired - AI · 131 min ·
CBP Signs Clearview AI Deal to Use Face Recognition for ‘Tactical Targeting’ | WIRED
Computer Vision

CBP Signs Clearview AI Deal to Use Face Recognition for ‘Tactical Targeting’ | WIRED

US Customs and Border Protection has signed a $225,000 deal with Clearview AI to access its facial recognition technology for intelligenc...

Wired - AI · 116 min ·
Microsoft releases urgent Office patch. Russian-state hackers pounce. - Ars Technica
Ai Safety

Microsoft releases urgent Office patch. Russian-state hackers pounce. - Ars Technica

Russian-state hackers exploited a critical Microsoft Office vulnerability within 48 hours of its patch release, targeting diplomatic and ...

Ars Technica - AI · 7 min ·
AI burnout, billion-dollar bets, and Silicon Valley's Epstein problem | TechCrunch
Ai Startups

AI burnout, billion-dollar bets, and Silicon Valley's Epstein problem | TechCrunch

The latest episode of TechCrunch's Equity podcast discusses significant talent losses in AI companies, billion-dollar investments in robo...

TechCrunch - AI · 6 min ·
Previous Page 125 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime