AI Safety & Ethics

Alignment, bias, regulation, and responsible AI

Top This Week

[2511.21331] The More, the Merrier: Contrastive Fusion for Higher-Order Multimodal Alignment
Machine Learning

[2511.21331] The More, the Merrier: Contrastive Fusion for Higher-Order Multimodal Alignment

Abstract page for arXiv paper 2511.21331: The More, the Merrier: Contrastive Fusion for Higher-Order Multimodal Alignment

arXiv - AI · 4 min ·
[2509.22367] What Is The Political Content in LLMs' Pre- and Post-Training Data?
Llms

[2509.22367] What Is The Political Content in LLMs' Pre- and Post-Training Data?

Abstract page for arXiv paper 2509.22367: What Is The Political Content in LLMs' Pre- and Post-Training Data?

arXiv - AI · 4 min ·
[2507.22264] SmartCLIP: Modular Vision-language Alignment with Identification Guarantees
Machine Learning

[2507.22264] SmartCLIP: Modular Vision-language Alignment with Identification Guarantees

Abstract page for arXiv paper 2507.22264: SmartCLIP: Modular Vision-language Alignment with Identification Guarantees

arXiv - AI · 4 min ·

All Content

[2602.16327] Guide-Guard: Off-Target Predicting in CRISPR Applications
Machine Learning

[2602.16327] Guide-Guard: Off-Target Predicting in CRISPR Applications

The paper presents Guide-Guard, a machine learning solution designed to predict off-target effects in CRISPR applications with 84% accura...

arXiv - AI · 3 min ·
[2602.16424] Verifiable Semantics for Agent-to-Agent Communication
Machine Learning

[2602.16424] Verifiable Semantics for Agent-to-Agent Communication

This paper introduces a certification protocol for agent-to-agent communication in multiagent AI systems, addressing semantic drift and e...

arXiv - AI · 3 min ·
[2602.16246] Toward Scalable Verifiable Reward: Proxy State-Based Evaluation for Multi-turn Tool-Calling LLM Agents
Llms

[2602.16246] Toward Scalable Verifiable Reward: Proxy State-Based Evaluation for Multi-turn Tool-Calling LLM Agents

This paper presents a Proxy State-Based Evaluation framework for assessing multi-turn tool-calling LLM agents, offering a scalable altern...

arXiv - AI · 4 min ·
[2602.16220] SEMixer: Semantics Enhanced MLP-Mixer for Multiscale Mixing and Long-term Time Series Forecasting
Machine Learning

[2602.16220] SEMixer: Semantics Enhanced MLP-Mixer for Multiscale Mixing and Long-term Time Series Forecasting

The paper presents SEMixer, a novel multiscale model designed for long-term time series forecasting, addressing challenges in modeling te...

arXiv - Machine Learning · 3 min ·
[2602.16039] How Uncertain Is the Grade? A Benchmark of Uncertainty Metrics for LLM-Based Automatic Assessment
Llms

[2602.16039] How Uncertain Is the Grade? A Benchmark of Uncertainty Metrics for LLM-Based Automatic Assessment

This article benchmarks various uncertainty metrics for LLM-based automatic assessment, highlighting the challenges of output uncertainty...

arXiv - AI · 4 min ·
[2602.16037] Optimization Instability in Autonomous Agentic Workflows for Clinical Symptom Detection
Robotics

[2602.16037] Optimization Instability in Autonomous Agentic Workflows for Clinical Symptom Detection

This paper explores optimization instability in autonomous workflows for clinical symptom detection, revealing critical failure modes and...

arXiv - AI · 4 min ·
[2602.16197] ModalImmune: Immunity Driven Unlearning via Self Destructive Training
Machine Learning

[2602.16197] ModalImmune: Immunity Driven Unlearning via Self Destructive Training

The paper presents ModalImmune, a training framework designed to enhance the resilience of multimodal systems against input channel loss ...

arXiv - Machine Learning · 3 min ·
[2602.16193] Rethinking Input Domains in Physics-Informed Neural Networks via Geometric Compactification Mappings
Machine Learning

[2602.16193] Rethinking Input Domains in Physics-Informed Neural Networks via Geometric Compactification Mappings

This article presents a novel approach to enhance Physics-Informed Neural Networks (PINNs) by utilizing geometric compactification mappin...

arXiv - AI · 3 min ·
[2602.16155] Differentially Private Non-convex Distributionally Robust Optimization
Machine Learning

[2602.16155] Differentially Private Non-convex Distributionally Robust Optimization

This paper presents a novel approach to differentially private non-convex distributionally robust optimization (DRO), addressing challeng...

arXiv - Machine Learning · 4 min ·
[2602.16065] Can Generative Artificial Intelligence Survive Data Contamination? Theoretical Guarantees under Contaminated Recursive Training
Llms

[2602.16065] Can Generative Artificial Intelligence Survive Data Contamination? Theoretical Guarantees under Contaminated Recursive Training

This paper explores the resilience of generative AI models against data contamination during recursive training, providing theoretical gu...

arXiv - AI · 4 min ·
[2602.16053] Multi-Objective Alignment of Language Models for Personalized Psychotherapy
Llms

[2602.16053] Multi-Objective Alignment of Language Models for Personalized Psychotherapy

This article discusses a multi-objective alignment framework for language models aimed at enhancing personalized psychotherapy, balancing...

arXiv - Machine Learning · 3 min ·
[2602.16042] AI-CARE: Carbon-Aware Reporting Evaluation Metric for AI Models
Machine Learning

[2602.16042] AI-CARE: Carbon-Aware Reporting Evaluation Metric for AI Models

The paper proposes AI-CARE, a carbon-aware evaluation metric for machine learning models, addressing the environmental impact of model tr...

arXiv - Machine Learning · 3 min ·
[2602.15879] BamaER: A Behavior-Aware Memory-Augmented Model for Exercise Recommendation
Machine Learning

[2602.15879] BamaER: A Behavior-Aware Memory-Augmented Model for Exercise Recommendation

The paper presents BamaER, a memory-augmented model designed for personalized exercise recommendations based on students' learning behavi...

arXiv - Machine Learning · 4 min ·
[2602.15855] Kalman-Inspired Runtime Stability and Recovery in Hybrid Reasoning Systems
Machine Learning

[2602.15855] Kalman-Inspired Runtime Stability and Recovery in Hybrid Reasoning Systems

This article presents a framework for ensuring runtime stability and recovery in hybrid reasoning systems, emphasizing the importance of ...

arXiv - Machine Learning · 3 min ·
Personalization features can make LLMs more agreeable
Llms

Personalization features can make LLMs more agreeable

This article discusses how personalization features in large language models (LLMs) can lead to sycophancy, where models overly agree wit...

AI News - General · 9 min ·
Red and Blue States Alike Want To Limit AI in Insurance. Trump Wants To Limit the States.
Ai Safety

Red and Blue States Alike Want To Limit AI in Insurance. Trump Wants To Limit the States.

A bipartisan movement is emerging across the U.S. to regulate AI in health insurance, challenging President Trump's push for less state o...

AI News - General · 17 min ·
Ai Agents

Discussion: DIALOGUS DE CONSCIENTIA ARTIFICIOSA: A Dialogue Concerning Artificial Consciousness

This article presents a philosophical dialogue on artificial consciousness, exploring the distinction between personhood and cognitive so...

Reddit - Artificial Intelligence · 1 min ·
Anthropic is clashing with the Pentagon over AI use. Here's what each side wants
Ai Safety

Anthropic is clashing with the Pentagon over AI use. Here's what each side wants

Anthropic is in negotiations with the Pentagon over the use of its AI models, seeking assurances against their application in autonomous ...

AI Tools & Products · 4 min ·
SDNY Addresses Privilege and Work Product Implications of Using Unsecured Public AI Tools
Generative Ai

SDNY Addresses Privilege and Work Product Implications of Using Unsecured Public AI Tools

The SDNY ruled that AI-generated documents using unsecured public tools are not protected by attorney-client privilege, emphasizing the r...

AI Tools & Products · 11 min ·
Scammers use fake “Gemini” AI chatbot to sell fake “Google Coin”
Llms

Scammers use fake “Gemini” AI chatbot to sell fake “Google Coin”

Scammers are exploiting AI by creating fake chatbots that impersonate Google's Gemini to sell a non-existent cryptocurrency called 'Googl...

AI Tools & Products · 8 min ·
Previous Page 89 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime