Natural Language Processing

Text understanding and language tasks

Top This Week

Llms

[P] Remote sensing foundation models made easy to use.

This project enables the idea of tasking remote sensing models to acquire embeddings like we task satellites to acquire data! https://git...

Reddit - Machine Learning · 1 min ·
Nlp

Anyone else feel like AI security is being figured out in production right now?

I’ve been digging into AI security incident data from 2025 into this year, and it feels like something isn’t being talked about enough ou...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

[D] ICML 2026 Average Score

Hi all, I’m curious about the current review dynamics for ICML 2026, especially after the rebuttal phase. For those who are reviewers (or...

Reddit - Machine Learning · 1 min ·

All Content

[2512.01865] Cross-Lingual Interleaving for Speech Language Models
Llms

[2512.01865] Cross-Lingual Interleaving for Speech Language Models

The paper presents a novel cross-lingual interleaving method for Speech Language Models (SLMs), enhancing multilingual understanding and ...

arXiv - AI · 3 min ·
[2512.07805] Group Representational Position Encoding
Ai Safety

[2512.07805] Group Representational Position Encoding

The paper introduces GRAPE (Group Representational Position Encoding), a framework for positional encoding that integrates multiplicative...

arXiv - Machine Learning · 4 min ·
[2512.04388] Learning to Orchestrate Agents in Natural Language with the Conductor
Llms

[2512.04388] Learning to Orchestrate Agents in Natural Language with the Conductor

The paper introduces the Conductor model, which utilizes reinforcement learning to optimize coordination strategies among large language ...

arXiv - Machine Learning · 4 min ·
[2511.19269] CDLM: Consistency Diffusion Language Models For Faster Sampling
Llms

[2511.19269] CDLM: Consistency Diffusion Language Models For Faster Sampling

The paper introduces Consistency Diffusion Language Models (CDLM), a method that accelerates inference in diffusion language models by re...

arXiv - Machine Learning · 3 min ·
[2510.19675] Study of Training Dynamics for Memory-Constrained Fine-Tuning
Machine Learning

[2510.19675] Study of Training Dynamics for Memory-Constrained Fine-Tuning

This study presents TraDy, a novel transfer learning scheme for memory-constrained fine-tuning of deep neural networks, achieving state-o...

arXiv - Machine Learning · 3 min ·
[2510.08570] Who Said Neural Networks Aren't Linear?
Machine Learning

[2510.08570] Who Said Neural Networks Aren't Linear?

This paper explores the linearity of neural networks by introducing a framework that identifies non-standard vector spaces where neural n...

arXiv - Machine Learning · 4 min ·
[2507.10587] Anthropomimetic Uncertainty: What Verbalized Uncertainty in Language Models is Missing
Llms

[2507.10587] Anthropomimetic Uncertainty: What Verbalized Uncertainty in Language Models is Missing

The paper discusses the concept of anthropomimetic uncertainty in language models, emphasizing the need for these models to express confi...

arXiv - AI · 4 min ·
[2505.20674] PonderLM: Pretraining Language Models to Ponder in Continuous Space
Llms

[2505.20674] PonderLM: Pretraining Language Models to Ponder in Continuous Space

PonderLM introduces a novel approach to language model training by incorporating a 'pondering' phase, enhancing cognitive processing duri...

arXiv - AI · 4 min ·
[2504.21022] ConformalNL2LTL: Translating Natural Language Instructions into Temporal Logic Formulas with Conformal Correctness Guarantees
Nlp

[2504.21022] ConformalNL2LTL: Translating Natural Language Instructions into Temporal Logic Formulas with Conformal Correctness Guarantees

The paper presents ConformalNL2LTL, a novel method for translating natural language instructions into Linear Temporal Logic (LTL) formula...

arXiv - Machine Learning · 4 min ·
[2508.15637] Classification errors distort findings in automated speech processing: examples and solutions from child-development research
Machine Learning

[2508.15637] Classification errors distort findings in automated speech processing: examples and solutions from child-development research

This paper discusses how classification errors in automated speech processing can distort findings in child-development research, proposi...

arXiv - Machine Learning · 4 min ·
[2504.17311] FLUKE: A Linguistically-Driven and Task-Agnostic Framework for Robustness Evaluation
Llms

[2504.17311] FLUKE: A Linguistically-Driven and Task-Agnostic Framework for Robustness Evaluation

FLUKE introduces a novel framework for evaluating the robustness of NLP models through controlled linguistic variations, revealing task-d...

arXiv - AI · 4 min ·
[2503.18980] CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning
Nlp

[2503.18980] CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning

The paper introduces CAE, a novel approach in deep reinforcement learning that repurposes value networks to enhance exploration efficienc...

arXiv - Machine Learning · 3 min ·
[2408.07238] Beyond Mimicry to Contextual Guidance: Knowledge Distillation for Interactive AI
Llms

[2408.07238] Beyond Mimicry to Contextual Guidance: Knowledge Distillation for Interactive AI

This article presents a novel approach to knowledge distillation for interactive AI, emphasizing contextual guidance over simple output i...

arXiv - Machine Learning · 4 min ·
[2505.18150] Generative Distribution Embeddings: Lifting autoencoders to the space of distributions for multiscale representation learning
Machine Learning

[2505.18150] Generative Distribution Embeddings: Lifting autoencoders to the space of distributions for multiscale representation learning

The paper introduces Generative Distribution Embeddings (GDE), a novel framework that enhances autoencoders for multiscale representation...

arXiv - Machine Learning · 4 min ·
[2505.14825] Assimilative Causal Inference
Machine Learning

[2505.14825] Assimilative Causal Inference

The paper presents Assimilative Causal Inference (ACI), a novel framework that utilizes Bayesian data assimilation to identify dynamic ca...

arXiv - Machine Learning · 4 min ·
[2504.02922] Overcoming Sparsity Artifacts in Crosscoders to Interpret Chat-Tuning
Machine Learning

[2504.02922] Overcoming Sparsity Artifacts in Crosscoders to Interpret Chat-Tuning

This article discusses advancements in model diffing using crosscoders to better interpret changes in AI models during chat-tuning, addre...

arXiv - Machine Learning · 4 min ·
[2601.09282] Cluster Workload Allocation: Semantic Soft Affinity Using Natural Language Processing
Llms

[2601.09282] Cluster Workload Allocation: Semantic Soft Affinity Using Natural Language Processing

This paper presents a novel semantic scheduling paradigm for cluster workload allocation using Natural Language Processing, enhancing usa...

arXiv - Machine Learning · 4 min ·
[2512.24008] SPARK: Search Personalization via Agent-Driven Retrieval and Knowledge-sharing
Llms

[2512.24008] SPARK: Search Personalization via Agent-Driven Retrieval and Knowledge-sharing

The paper presents SPARK, a framework for personalized search using agent-driven retrieval and knowledge-sharing, enhancing user experien...

arXiv - AI · 4 min ·
[2511.10164] Two Constraint Compilation Methods for Lifted Planning
Nlp

[2511.10164] Two Constraint Compilation Methods for Lifted Planning

This paper presents two innovative constraint compilation methods for lifted planning in AI, addressing scalability issues in existing co...

arXiv - AI · 3 min ·
[2602.18364] Quantum Maximum Likelihood Prediction via Hilbert Space Embeddings
Llms

[2602.18364] Quantum Maximum Likelihood Prediction via Hilbert Space Embeddings

This paper presents a novel perspective on in-context learning in large language models (LLMs) through the lens of quantum mechanics, pro...

arXiv - Machine Learning · 3 min ·
Previous Page 94 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime