Anyone else feel like AI security is being figured out in production right now?
I’ve been digging into AI security incident data from 2025 into this year, and it feels like something isn’t being talked about enough ou...
Text understanding and language tasks
I’ve been digging into AI security incident data from 2025 into this year, and it feels like something isn’t being talked about enough ou...
Hi all, I’m curious about the current review dynamics for ICML 2026, especially after the rebuttal phase. For those who are reviewers (or...
From the Macintosh to the iPhone to the iMac to the iPod, it’s hard to pick a best Apple product ever. But we tried to do so anyway.
The paper presents PaReGTA, an LLM-based framework for encoding temporal information in electronic health records (EHRs), enhancing patie...
This paper introduces Semantic Substrate Theory, an operator-theoretic framework that formalizes various signals of semantic drift, integ...
The paper presents Spectral Phase Encoding (SPE) for quantum kernel methods, analyzing their robustness against noise and comparing perfo...
This article evaluates how data anonymization affects the performance of Content-Based Image Retrieval (CBIR) systems, highlighting the b...
This article presents the Advantage-based Adversarial Transformer (AAT), a novel method for generating time-correlated adversarial exampl...
Luna-2 introduces a scalable architecture for single-token evaluation using small language models, enhancing accuracy and reducing costs ...
The paper presents VLANeXt, a framework for building effective Vision-Language-Action (VLA) models, addressing inconsistencies in trainin...
The paper critiques the reliance on softmax outputs in adaptive conformal classification, proposing a new method that utilizes pre-softma...
This article explores the application of conformal prediction methods in healthcare, specifically focusing on EEG seizure classification....
PIPE-RDF presents a novel pipeline for generating schema-specific NL-SPARQL benchmarks, enhancing RDF knowledge graph querying for enterp...
This paper defends cosine similarity in machine learning, arguing that normalization eliminates issues related to gauge freedom, thus ens...
This paper presents a method for enhancing stability in deep reinforcement learning by utilizing isotropic Gaussian representations, addr...
The article examines red teaming as a socio-technical practice in evaluating large language models (LLMs), highlighting the importance of...
AgentCAT is a large language model designed to extract and analyze catalytic reaction data from chemical engineering literature, addressi...
This paper presents FedPAC, a framework to enhance the stability and accuracy of second-order optimizers in federated learning on non-IID...
This article evaluates SAP's RPT-1 model for enterprise business process prediction, comparing its performance against traditional machin...
This article evaluates the reliability of persona-conditioned large language models (LLMs) as synthetic survey respondents, revealing tha...
This article discusses the shift from bias mitigation to bias negotiation in generative AI, emphasizing the need for ethical governance o...
The paper introduces Virtual Parameter Sharpening (VPS), a novel technique for enhancing inference-time reasoning in transformer models t...
This paper explores how transformers learn through incremental acquisition of sparse attention patterns, revealing shifts in learning dyn...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime