[2511.21331] The More, the Merrier: Contrastive Fusion for Higher-Order Multimodal Alignment
Abstract page for arXiv paper 2511.21331: The More, the Merrier: Contrastive Fusion for Higher-Order Multimodal Alignment
Alignment, bias, regulation, and responsible AI
Abstract page for arXiv paper 2511.21331: The More, the Merrier: Contrastive Fusion for Higher-Order Multimodal Alignment
Abstract page for arXiv paper 2509.22367: What Is The Political Content in LLMs' Pre- and Post-Training Data?
Abstract page for arXiv paper 2507.22264: SmartCLIP: Modular Vision-language Alignment with Identification Guarantees
The paper presents LGQ, a novel image tokenizer that learns discretization geometry to enhance scalability and stability in visual genera...
This paper explores the concept of long-tail knowledge in large language models (LLMs), analyzing its taxonomy, mechanisms of loss, and i...
This paper presents a novel framework for partial identification of population quantities under missing data, utilizing weak shadow varia...
The paper presents SIT-LMPC, a novel algorithm for safe information-theoretic learning model predictive control tailored for robots perfo...
The paper presents REMUL, a multi-party reinforcement learning approach that enhances the faithfulness of reasoning in large language mod...
The paper discusses the phenomenon of 'Retrieval Collapse,' where AI-generated content dominates search results, leading to a decline in ...
This article discusses Visual Memory Injection (VMI) attacks on large vision-language models (LVLMs) in multi-turn conversations, highlig...
The paper presents FedGraph-AGI, a federated learning framework designed to enhance cross-border insider threat detection in government f...
The paper presents ScenicRules, a benchmark for evaluating autonomous driving systems that balances multiple objectives like safety and e...
This article investigates the mental state reasoning of language models (LMs) using 41 open-weight models, revealing insights into their ...
This article presents a randomized controlled trial (RCT) examining scalable prompting interventions in a CS1 course, highlighting the im...
This paper presents Quality-constrained Entropy Maximization Policy Optimization (QEMPO), a method to enhance diversity in large language...
This article presents a novel approach to UAV search operations in post-disaster scenarios, addressing the challenges posed by Non-Line-o...
The paper presents MedProbCLIP, a probabilistic framework for enhancing the reliability of radiograph-report retrieval using vision-langu...
ReLoop introduces a structured approach to improve the reliability of LLM-generated optimization code by addressing silent failures throu...
This article presents a scoping review of dataset documentation tools, analyzing motivations behind their design and factors affecting th...
This paper presents GPEReg-Net, a novel framework for improving image registration in bidirectional photoacoustic microscopy by disentang...
This paper explores the design choices of Model Context Protocols (MCPs) and introduces Code Execution MCP (CE-MCP) as a solution to scal...
This paper discusses the importance of causality in interpretability research for large language models, highlighting pitfalls in general...
The paper presents a method for assessing privacy vulnerability in machine learning models using a generalized leverage score, enabling e...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime