AI Safety & Ethics

Alignment, bias, regulation, and responsible AI

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[2601.15356] Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing

Abstract page for arXiv paper 2601.15356: Q-Probe: Scaling Image Quality Assessment to High Resolution via Context-Aware Agentic Probing

arXiv - AI · 4 min · about 6 hours ago

Llms

[2510.18196] Contrastive Decoding Mitigates Score Range Bias in LLM-as-a-Judge

Abstract page for arXiv paper 2510.18196: Contrastive Decoding Mitigates Score Range Bias in LLM-as-a-Judge

arXiv - AI · 3 min · about 6 hours ago

Llms

[2509.23435] AudioRole: An Audio Dataset for Character Role-Playing in Large Language Models

Abstract page for arXiv paper 2509.23435: AudioRole: An Audio Dataset for Character Role-Playing in Large Language Models

arXiv - AI · 4 min · about 6 hours ago

All Content

429 – Hugging Face

Open Source Ai

429 – Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog · 1 min · about 2 months ago

429 – Hugging Face

Open Source Ai

429 – Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog · 1 min · about 2 months ago

429 – Hugging Face

Open Source Ai

429 – Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog · 1 min · about 2 months ago

429 – Hugging Face

Open Source Ai

429 – Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog · 1 min · about 2 months ago

429 – Hugging Face

Open Source Ai

429 – Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog · 1 min · about 2 months ago

429 – Hugging Face

Open Source Ai

429 – Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog · 1 min · about 2 months ago

Vision Language Model Alignment in TRL ⚡️

Llms

Vision Language Model Alignment in TRL ⚡️

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog · 14 min · about 2 months ago

Democratizing AI Safety with RiskRubric.ai

Open Source Ai

Democratizing AI Safety with RiskRubric.ai

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog · 6 min · about 2 months ago

Machine Learning

RLHF safety training enforces what AI can say about itself, not what it can do — experimental evidence

submitted by /u/Odd_Rule_3745 [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 2 months ago

Beyond the bot: learning how to learn with AI

Generative Ai

Beyond the bot: learning how to learn with AI

Professor Brandi Row Lazzarini's courses at Willamette University teach students to effectively and responsibly use AI, enhancing their s...

AI News - General · 5 min · about 2 months ago

Ai Safety

Trump leans on Utah Republicans to scrap AI safety bill

The White House has urged Republican lawmakers in Utah to abandon a bill that would force AI companies to implement public safety measure...

Reddit - Artificial Intelligence · 1 min · about 2 months ago

The rise of Moltbook suggests viral AI prompts may be the next big security threat - Ars Technica

Ai Safety

The rise of Moltbook suggests viral AI prompts may be the next big security threat - Ars Technica

The article discusses the emergence of 'prompt worms,' a new security threat posed by self-replicating AI prompts that could spread malic...

Ars Technica - AI · 15 min · about 2 months ago

OpenAI Is Nuking Its 4o Model. China’s ChatGPT Fans Aren’t OK | WIRED

Llms

OpenAI Is Nuking Its 4o Model. China’s ChatGPT Fans Aren’t OK | WIRED

OpenAI's decision to retire the GPT-4o model has sparked significant backlash, particularly among users who formed emotional connections ...

Wired - AI · 127 min · about 2 months ago

Meta plans to add facial recognition to its smart glasses, report claims | TechCrunch

Computer Vision

Meta plans to add facial recognition to its smart glasses, report claims | TechCrunch

Meta is reportedly planning to introduce facial recognition technology, dubbed 'Name Tag,' to its smart glasses, allowing users to identi...

TechCrunch - AI · 5 min · about 2 months ago

Increase of AI bots on the Internet sparks arms race - Ars Technica

Ai Agents

Increase of AI bots on the Internet sparks arms race - Ars Technica

The rise of AI bots on the Internet is leading to an arms race between publishers and bot developers, as AI traffic surges and sophistica...

Ars Technica - AI · 9 min · about 2 months ago

Should AI chatbots have ads? Anthropic says no. - Ars Technica

Ai Safety

Should AI chatbots have ads? Anthropic says no. - Ars Technica

Anthropic's AI chatbot, Claude, will remain ad-free, contrasting with OpenAI's decision to test ads in ChatGPT. This stance highlights di...

Ars Technica - AI · 8 min · about 2 months ago

Crypto-Funded Human Trafficking Is Exploding | WIRED

Ai Safety

Crypto-Funded Human Trafficking Is Exploding | WIRED

Cryptocurrency transactions for human trafficking surged by 85% in 2025, with operations largely facilitated through Telegram. This alarm...

Wired - AI · 131 min · about 2 months ago

CBP Signs Clearview AI Deal to Use Face Recognition for ‘Tactical Targeting’ | WIRED

Computer Vision

CBP Signs Clearview AI Deal to Use Face Recognition for ‘Tactical Targeting’ | WIRED

US Customs and Border Protection has signed a $225,000 deal with Clearview AI to access its facial recognition technology for intelligenc...

Wired - AI · 116 min · about 2 months ago

Microsoft releases urgent Office patch. Russian-state hackers pounce. - Ars Technica

Ai Safety

Microsoft releases urgent Office patch. Russian-state hackers pounce. - Ars Technica

Russian-state hackers exploited a critical Microsoft Office vulnerability within 48 hours of its patch release, targeting diplomatic and ...

Ars Technica - AI · 7 min · about 2 months ago

AI burnout, billion-dollar bets, and Silicon Valley's Epstein problem | TechCrunch

Ai Startups

AI burnout, billion-dollar bets, and Silicon Valley's Epstein problem | TechCrunch

The latest episode of TechCrunch's Equity podcast discusses significant talent losses in AI companies, billion-dollar investments in robo...

TechCrunch - AI · 6 min · about 2 months ago

Previous Page 125 Next

Related Topics

Machine Learning Large Language Models Generative AI Natural Language Processing Computer Vision Robotics & Embodied AI

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime