AI Safety & Ethics

Alignment, bias, regulation, and responsible AI

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

[2511.21331] The More, the Merrier: Contrastive Fusion for Higher-Order Multimodal Alignment

Abstract page for arXiv paper 2511.21331: The More, the Merrier: Contrastive Fusion for Higher-Order Multimodal Alignment

arXiv - AI · 4 min · about 13 hours ago

Llms

[2509.22367] What Is The Political Content in LLMs' Pre- and Post-Training Data?

Abstract page for arXiv paper 2509.22367: What Is The Political Content in LLMs' Pre- and Post-Training Data?

arXiv - AI · 4 min · about 13 hours ago

Machine Learning

[2507.22264] SmartCLIP: Modular Vision-language Alignment with Identification Guarantees

Abstract page for arXiv paper 2507.22264: SmartCLIP: Modular Vision-language Alignment with Identification Guarantees

arXiv - AI · 4 min · about 13 hours ago

All Content

Machine Learning

[2602.16327] Guide-Guard: Off-Target Predicting in CRISPR Applications

The paper presents Guide-Guard, a machine learning solution designed to predict off-target effects in CRISPR applications with 84% accura...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.16424] Verifiable Semantics for Agent-to-Agent Communication

This paper introduces a certification protocol for agent-to-agent communication in multiagent AI systems, addressing semantic drift and e...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.16246] Toward Scalable Verifiable Reward: Proxy State-Based Evaluation for Multi-turn Tool-Calling LLM Agents

This paper presents a Proxy State-Based Evaluation framework for assessing multi-turn tool-calling LLM agents, offering a scalable altern...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.16220] SEMixer: Semantics Enhanced MLP-Mixer for Multiscale Mixing and Long-term Time Series Forecasting

The paper presents SEMixer, a novel multiscale model designed for long-term time series forecasting, addressing challenges in modeling te...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.16039] How Uncertain Is the Grade? A Benchmark of Uncertainty Metrics for LLM-Based Automatic Assessment

This article benchmarks various uncertainty metrics for LLM-based automatic assessment, highlighting the challenges of output uncertainty...

arXiv - AI · 4 min · about 2 months ago

Robotics

[2602.16037] Optimization Instability in Autonomous Agentic Workflows for Clinical Symptom Detection

This paper explores optimization instability in autonomous workflows for clinical symptom detection, revealing critical failure modes and...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.16197] ModalImmune: Immunity Driven Unlearning via Self Destructive Training

The paper presents ModalImmune, a training framework designed to enhance the resilience of multimodal systems against input channel loss ...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.16193] Rethinking Input Domains in Physics-Informed Neural Networks via Geometric Compactification Mappings

This article presents a novel approach to enhance Physics-Informed Neural Networks (PINNs) by utilizing geometric compactification mappin...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.16155] Differentially Private Non-convex Distributionally Robust Optimization

This paper presents a novel approach to differentially private non-convex distributionally robust optimization (DRO), addressing challeng...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.16065] Can Generative Artificial Intelligence Survive Data Contamination? Theoretical Guarantees under Contaminated Recursive Training

This paper explores the resilience of generative AI models against data contamination during recursive training, providing theoretical gu...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.16053] Multi-Objective Alignment of Language Models for Personalized Psychotherapy

This article discusses a multi-objective alignment framework for language models aimed at enhancing personalized psychotherapy, balancing...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.16042] AI-CARE: Carbon-Aware Reporting Evaluation Metric for AI Models

The paper proposes AI-CARE, a carbon-aware evaluation metric for machine learning models, addressing the environmental impact of model tr...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.15879] BamaER: A Behavior-Aware Memory-Augmented Model for Exercise Recommendation

The paper presents BamaER, a memory-augmented model designed for personalized exercise recommendations based on students' learning behavi...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.15855] Kalman-Inspired Runtime Stability and Recovery in Hybrid Reasoning Systems

This article presents a framework for ensuring runtime stability and recovery in hybrid reasoning systems, emphasizing the importance of ...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

Personalization features can make LLMs more agreeable

This article discusses how personalization features in large language models (LLMs) can lead to sycophancy, where models overly agree wit...

AI News - General · 9 min · about 2 months ago

Ai Safety

Red and Blue States Alike Want To Limit AI in Insurance. Trump Wants To Limit the States.

A bipartisan movement is emerging across the U.S. to regulate AI in health insurance, challenging President Trump's push for less state o...

AI News - General · 17 min · about 2 months ago

Ai Agents

Discussion: DIALOGUS DE CONSCIENTIA ARTIFICIOSA: A Dialogue Concerning Artificial Consciousness

This article presents a philosophical dialogue on artificial consciousness, exploring the distinction between personhood and cognitive so...

Reddit - Artificial Intelligence · 1 min · about 2 months ago

Ai Safety

Anthropic is clashing with the Pentagon over AI use. Here's what each side wants

Anthropic is in negotiations with the Pentagon over the use of its AI models, seeking assurances against their application in autonomous ...

AI Tools & Products · 4 min · about 2 months ago

Generative Ai

SDNY Addresses Privilege and Work Product Implications of Using Unsecured Public AI Tools

The SDNY ruled that AI-generated documents using unsecured public tools are not protected by attorney-client privilege, emphasizing the r...

AI Tools & Products · 11 min · about 2 months ago

Llms

Scammers use fake “Gemini” AI chatbot to sell fake “Google Coin”

Scammers are exploiting AI by creating fake chatbots that impersonate Google's Gemini to sell a non-existent cryptocurrency called 'Googl...

AI Tools & Products · 8 min · about 2 months ago

Previous Page 89 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Safety & Ethics

Top This Week

[2511.21331] The More, the Merrier: Contrastive Fusion for Higher-Order Multimodal Alignment

[2509.22367] What Is The Political Content in LLMs' Pre- and Post-Training Data?

[2507.22264] SmartCLIP: Modular Vision-language Alignment with Identification Guarantees

All Content

[2602.16327] Guide-Guard: Off-Target Predicting in CRISPR Applications

[2602.16424] Verifiable Semantics for Agent-to-Agent Communication

[2602.16246] Toward Scalable Verifiable Reward: Proxy State-Based Evaluation for Multi-turn Tool-Calling LLM Agents

[2602.16220] SEMixer: Semantics Enhanced MLP-Mixer for Multiscale Mixing and Long-term Time Series Forecasting

[2602.16039] How Uncertain Is the Grade? A Benchmark of Uncertainty Metrics for LLM-Based Automatic Assessment

[2602.16037] Optimization Instability in Autonomous Agentic Workflows for Clinical Symptom Detection

[2602.16197] ModalImmune: Immunity Driven Unlearning via Self Destructive Training

[2602.16193] Rethinking Input Domains in Physics-Informed Neural Networks via Geometric Compactification Mappings

[2602.16155] Differentially Private Non-convex Distributionally Robust Optimization

[2602.16065] Can Generative Artificial Intelligence Survive Data Contamination? Theoretical Guarantees under Contaminated Recursive Training

[2602.16053] Multi-Objective Alignment of Language Models for Personalized Psychotherapy

[2602.16042] AI-CARE: Carbon-Aware Reporting Evaluation Metric for AI Models

[2602.15879] BamaER: A Behavior-Aware Memory-Augmented Model for Exercise Recommendation

[2602.15855] Kalman-Inspired Runtime Stability and Recovery in Hybrid Reasoning Systems

Personalization features can make LLMs more agreeable

Red and Blue States Alike Want To Limit AI in Insurance. Trump Wants To Limit the States.

Discussion: DIALOGUS DE CONSCIENTIA ARTIFICIOSA: A Dialogue Concerning Artificial Consciousness

Anthropic is clashing with the Pentagon over AI use. Here's what each side wants

SDNY Addresses Privilege and Work Product Implications of Using Unsecured Public AI Tools

Scammers use fake “Gemini” AI chatbot to sell fake “Google Coin”

Related Topics

Stay updated with AI News