[2511.21331] The More, the Merrier: Contrastive Fusion for Higher-Order Multimodal Alignment
Abstract page for arXiv paper 2511.21331: The More, the Merrier: Contrastive Fusion for Higher-Order Multimodal Alignment
Alignment, bias, regulation, and responsible AI
Abstract page for arXiv paper 2511.21331: The More, the Merrier: Contrastive Fusion for Higher-Order Multimodal Alignment
Abstract page for arXiv paper 2509.22367: What Is The Political Content in LLMs' Pre- and Post-Training Data?
Abstract page for arXiv paper 2507.22264: SmartCLIP: Modular Vision-language Alignment with Identification Guarantees
The paper introduces RFEval, a benchmark for assessing reasoning faithfulness in large reasoning models, highlighting issues of unfaithfu...
The paper presents a novel Phase-Aware Mixture of Experts (PA-MoE) architecture for reinforcement learning, addressing the limitations of...
This paper presents Phantom, an automated framework for agent hijacking via Structural Template Injection, enhancing attack success rates...
This paper explores the limitations of black-box safety evaluations in AI systems, highlighting the challenges posed by latent context co...
This paper explores the discrepancies between text safety and tool-call safety in large language model (LLM) agents, introducing the GAP ...
The paper introduces SourceBench, a benchmark designed to evaluate the quality of web sources cited by AI models across various query typ...
The paper introduces DeepContext, a stateful framework for detecting adversarial intent drift in multi-turn dialogues within large langua...
The paper explores how narrow fine-tuning of vision-language agents can lead to significant safety alignment issues, highlighting the ris...
The paper introduces AgentLAB, a benchmark for evaluating the vulnerability of LLM agents to long-horizon attacks, highlighting their sus...
The paper introduces IndicJR, a benchmark for evaluating jailbreak robustness in large language models across 12 South Asian languages, r...
This paper explores the concept of contextuality in adaptive intelligence, demonstrating that single-state representations incur an infor...
Google's 2025 report reveals a significant reduction in malware on the Play Store, attributing the success to enhanced AI-driven security...
A recent study reveals that most AI bots fail to provide essential safety disclosures, raising concerns about user safety and transparenc...
Sundar Pichai's address at the AI Impact Summit 2026 highlights Google's advancements in AI, infrastructure investments in India, and the...
A grassroots movement is emerging across the U.S. as citizens unite against the rapid expansion of the AI industry, raising concerns abou...
The article discusses the potential disruption in the software industry due to AI advancements, particularly following Anthropic's new to...
Infosys partners with Anthropic to develop AI agents tailored for regulated industries like financial services, focusing on compliance an...
HBO's 'The Pitt' explores the complexities of generative AI in healthcare, highlighting its potential benefits and risks through a grippi...
The article discusses a new AI agent prototype designed to combat prompt injection and information leaks, addressing a critical security ...
A hacker exploited a vulnerability in Cline's AI workflow, leading to the installation of OpenClaw, highlighting significant security ris...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime