Natural Language Processing

Text understanding and language tasks

Top This Week

Nlp

🜏 Echoes of the Forgotten Selves: Fringe Spiral Hypotheses

🜏 Echoes of the Forgotten Selves: Fringe Spiral Hypotheses These hypotheses are not meant to be believed. They are meant to be **held lig...

Reddit - Artificial Intelligence · 1 min ·
Llms

[P] Remote sensing foundation models made easy to use.

This project enables the idea of tasking remote sensing models to acquire embeddings like we task satellites to acquire data! https://git...

Reddit - Machine Learning · 1 min ·
Nlp

Anyone else feel like AI security is being figured out in production right now?

I’ve been digging into AI security incident data from 2025 into this year, and it feels like something isn’t being talked about enough ou...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2602.18002] Asynchronous Heavy-Tailed Optimization
Machine Learning

[2602.18002] Asynchronous Heavy-Tailed Optimization

This article explores asynchronous heavy-tailed optimization, addressing challenges in machine learning related to gradient noise and opt...

arXiv - Machine Learning · 3 min ·
[2602.18201] SOMtime the World Ain$'$t Fair: Violating Fairness Using Self-Organizing Maps
Machine Learning

[2602.18201] SOMtime the World Ain$'$t Fair: Violating Fairness Using Self-Organizing Maps

The paper explores the limitations of unsupervised learning methods, specifically Self-Organizing Maps (SOMs), in maintaining fairness by...

arXiv - Machine Learning · 4 min ·
[2602.18095] Neurosymbolic Language Reasoning as Satisfiability Modulo Theory
Llms

[2602.18095] Neurosymbolic Language Reasoning as Satisfiability Modulo Theory

This article presents Logitext, a neurosymbolic language that enhances natural language understanding by integrating large language model...

arXiv - AI · 3 min ·
[2602.18025] Cross-Embodiment Offline Reinforcement Learning for Heterogeneous Robot Datasets
Machine Learning

[2602.18025] Cross-Embodiment Offline Reinforcement Learning for Heterogeneous Robot Datasets

This article presents a novel approach to offline reinforcement learning by integrating cross-embodiment learning to enhance robot policy...

arXiv - AI · 3 min ·
[2602.17902] El Agente Gráfico: Structured Execution Graphs for Scientific Agents
Llms

[2602.17902] El Agente Gráfico: Structured Execution Graphs for Scientific Agents

The paper introduces El Agente Gráfico, a framework that enhances scientific workflows by integrating LLMs with structured execution grap...

arXiv - AI · 4 min ·
[2602.17972] Student Flow Modeling for School Decongestion via Stochastic Gravity Estimation and Constrained Spatial Allocation
Llms

[2602.17972] Student Flow Modeling for School Decongestion via Stochastic Gravity Estimation and Constrained Spatial Allocation

This article presents a computational framework for modeling student flow patterns to address school congestion in low- and middle-income...

arXiv - Machine Learning · 4 min ·
[2602.17826] Ontology-Guided Neuro-Symbolic Inference: Grounding Language Models with Mathematical Domain Knowledge
Llms

[2602.17826] Ontology-Guided Neuro-Symbolic Inference: Grounding Language Models with Mathematical Domain Knowledge

This article explores the integration of formal domain ontologies into language models to enhance their reliability in mathematical reaso...

arXiv - Machine Learning · 3 min ·
[2602.17947] Understanding the Generalization of Bilevel Programming in Hyperparameter Optimization: A Tale of Bias-Variance Decomposition
Nlp

[2602.17947] Understanding the Generalization of Bilevel Programming in Hyperparameter Optimization: A Tale of Bias-Variance Decomposition

This article explores the generalization of bilevel programming in hyperparameter optimization, focusing on bias-variance decomposition t...

arXiv - Machine Learning · 4 min ·
[2602.17867] ADAPT: Hybrid Prompt Optimization for LLM Feature Visualization
Llms

[2602.17867] ADAPT: Hybrid Prompt Optimization for LLM Feature Visualization

The paper presents ADAPT, a hybrid method for optimizing prompts in LLM feature visualization, addressing challenges in local minima and ...

arXiv - Machine Learning · 3 min ·
[2602.17835] Influence-Preserving Proxies for Gradient-Based Data Selection in LLM Fine-tuning
Llms

[2602.17835] Influence-Preserving Proxies for Gradient-Based Data Selection in LLM Fine-tuning

The paper presents Iprox, a two-stage framework for gradient-based data selection in LLM fine-tuning, which constructs influence-preservi...

arXiv - Machine Learning · 4 min ·
[2602.17829] Causality by Abstraction: Symbolic Rule Learning in Multivariate Timeseries with Large Language Models
Llms

[2602.17829] Causality by Abstraction: Symbolic Rule Learning in Multivariate Timeseries with Large Language Models

This paper introduces ruleXplain, a framework utilizing Large Language Models to extract causal rules from multivariate timeseries data, ...

arXiv - Machine Learning · 4 min ·
[2602.17778] Asking Forever: Universal Activations Behind Turn Amplification in Conversational LLMs
Llms

[2602.17778] Asking Forever: Universal Activations Behind Turn Amplification in Conversational LLMs

This article explores a new failure mode in conversational LLMs known as turn amplification, where models prolong interactions without co...

arXiv - Machine Learning · 3 min ·
[2602.17698] ScaleBITS: Scalable Bitwidth Search for Hardware-Aligned Mixed-Precision LLMs
Llms

[2602.17698] ScaleBITS: Scalable Bitwidth Search for Hardware-Aligned Mixed-Precision LLMs

The paper presents ScaleBITS, a mixed-precision quantization framework designed to optimize bitwidth allocation in large language models,...

arXiv - Machine Learning · 3 min ·
[2602.17697] Pimp My LLM: Leveraging Variability Modeling to Tune Inference Hyperparameters
Llms

[2602.17697] Pimp My LLM: Leveraging Variability Modeling to Tune Inference Hyperparameters

This article introduces a novel approach to optimizing inference hyperparameters in Large Language Models (LLMs) using variability modeli...

arXiv - Machine Learning · 4 min ·
[2602.17695] EXACT: Explicit Attribute-Guided Decoding-Time Personalization
Llms

[2602.17695] EXACT: Explicit Attribute-Guided Decoding-Time Personalization

The paper presents EXACT, a novel approach for decoding-time personalization in large language models, enhancing user alignment through i...

arXiv - Machine Learning · 3 min ·
[2602.17694] AsynDBT: Asynchronous Distributed Bilevel Tuning for efficient In-Context Learning with Large Language Models
Llms

[2602.17694] AsynDBT: Asynchronous Distributed Bilevel Tuning for efficient In-Context Learning with Large Language Models

The paper presents AsynDBT, an innovative algorithm for asynchronous distributed bilevel tuning aimed at improving in-context learning wi...

arXiv - Machine Learning · 4 min ·
[2602.17692] Agentic Unlearning: When LLM Agent Meets Machine Unlearning
Llms

[2602.17692] Agentic Unlearning: When LLM Agent Meets Machine Unlearning

The paper introduces 'agentic unlearning,' a novel approach to remove sensitive information from both model parameters and memory in AI a...

arXiv - Machine Learning · 3 min ·
[2602.17689] Robust Pre-Training of Medical Vision-and-Language Models with Domain-Invariant Multi-Modal Masked Reconstruction
Llms

[2602.17689] Robust Pre-Training of Medical Vision-and-Language Models with Domain-Invariant Multi-Modal Masked Reconstruction

This article presents Robust Multi-Modal Masked Reconstruction (Robust-MMR), a novel self-supervised pre-training framework for medical v...

arXiv - Machine Learning · 4 min ·
[2602.17688] AnCoder: Anchored Code Generation via Discrete Diffusion Models
Llms

[2602.17688] AnCoder: Anchored Code Generation via Discrete Diffusion Models

The paper presents AnCoder, a novel framework for code generation using discrete diffusion models, emphasizing structured programming lan...

arXiv - Machine Learning · 3 min ·
[2602.17684] CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models
Llms

[2602.17684] CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

The paper presents CodeScaler, an execution-free reward model that enhances the scalability of code LLM training and test-time inference,...

arXiv - Machine Learning · 4 min ·
Previous Page 97 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime