Natural Language Processing

Text understanding and language tasks

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Nlp

🜏 Echoes of the Forgotten Selves: Fringe Spiral Hypotheses

🜏 Echoes of the Forgotten Selves: Fringe Spiral Hypotheses These hypotheses are not meant to be believed. They are meant to be **held lig...

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

Llms

[P] Remote sensing foundation models made easy to use.

This project enables the idea of tasking remote sensing models to acquire embeddings like we task satellites to acquire data! https://git...

Reddit - Machine Learning · 1 min · about 13 hours ago

Nlp

Anyone else feel like AI security is being figured out in production right now?

I’ve been digging into AI security incident data from 2025 into this year, and it feels like something isn’t being talked about enough ou...

Reddit - Artificial Intelligence · 1 min · about 17 hours ago

All Content

Machine Learning

[2505.11235] Efficient Orthogonal Fine-Tuning with Principal Subspace Adaptation

The paper presents Efficient Orthogonal Fine-Tuning with Principal Subspace Adaptation (PSOFT), a method that enhances parameter-efficien...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2502.14762] Unlocking [CLS] Features for Continual Post-Training

The paper presents a novel approach to continual learning in machine learning models, introducing a parameter-efficient fine-tuning modul...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2409.12709] SeqRisk: Transformer-augmented latent variable model for robust survival prediction with longitudinal data

SeqRisk introduces a transformer-augmented latent variable model for enhanced survival prediction using longitudinal healthcare data, add...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.17654] Mine and Refine: Optimizing Graded Relevance in E-commerce Search Retrieval

The paper presents a two-stage framework called 'Mine and Refine' for optimizing graded relevance in e-commerce search retrieval, enhanci...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.17546] Learning to Stay Safe: Adaptive Regularization Against Safety Degradation during Fine-Tuning

This article presents a novel training framework for instruction-following language models that maintains safety during fine-tuning by ad...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.17445] ABCD: All Biases Come Disguised

The paper 'ABCD: All Biases Come Disguised' explores biases in LLMs during multiple-choice question evaluations, proposing a new protocol...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.17287] Representation Collapse in Machine Translation Through the Lens of Angular Dispersion

This paper explores representation collapse in neural machine translation models, particularly focusing on the Transformer architecture a...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.17187] Anti-causal domain generalization: Leveraging unlabeled data

The paper explores anti-causal domain generalization, proposing methods to leverage unlabeled data for robust predictive modeling in vary...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.17104] Simplify to Amplify: Achieving Information-Theoretic Bounds with Fewer Steps in Spectral Community Detection

This paper presents a streamlined spectral algorithm for community detection in the stochastic block model, achieving improved error boun...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.16961] Greedy Multi-Path Block Verification for Faster Decoding in Speculative Sampling

This paper presents Greedy Multi-Path Block Verification (GBV), a method that enhances the efficiency of speculative decoding in machine ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.16835] NeST: Neuron Selective Tuning for LLM Safety

The paper introduces NeST, a novel framework for enhancing safety in large language models (LLMs) by selectively tuning a small subset of...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.16830] The Impact of Formations on Football Matches Using Double Machine Learning. Is it worth parking the bus?

This study explores the impact of football formations on match outcomes using Double Machine Learning, questioning the effectiveness of d...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.09725] Efficient Remote Prefix Fetching with GPU-native Media ASICs

The paper presents KVFetcher, a novel solution for efficient remote key-value (KV) cache reuse using GPU-native video codecs, significant...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.17525] Variational inference via radial transport

The paper introduces a novel approach to variational inference (VI) by optimizing radial profiles, enhancing the approximation of high-di...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.17363] 2Mamba2Furious: Linear in Complexity, Competitive in Accuracy

The paper presents 2Mamba, a linear attention transformer variant that achieves competitive accuracy compared to softmax attention while ...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.17350] Shortcut learning in geometric knot classification

This paper explores the application of machine learning to classify geometric knots, addressing the challenge of identifying equivalent e...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.17089] Synergizing Transport-Based Generative Models and Latent Geometry for Stochastic Closure Modeling

This article presents a novel approach to stochastic closure modeling by integrating transport-based generative models with latent geomet...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.17050] Multi-Probe Zero Collision Hash (MPZCH): Mitigating Embedding Collisions and Enhancing Model Freshness in Large-Scale Recommenders

The paper presents the Multi-Probe Zero Collision Hash (MPZCH), a novel indexing method that mitigates embedding collisions in large-scal...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.16994] Dynamic Delayed Tree Expansion For Improved Multi-Path Speculative Decoding

This article presents a novel approach to multi-path speculative decoding in machine learning, introducing dynamic delayed tree expansion...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.16980] Discovering Universal Activation Directions for PII Leakage in Language Models

The paper introduces UniLeak, a framework that identifies universal activation directions in language models, enhancing the understanding...

arXiv - Machine Learning · 3 min · about 1 month ago

Previous Page 99 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Natural Language Processing

Top This Week

🜏 Echoes of the Forgotten Selves: Fringe Spiral Hypotheses

[P] Remote sensing foundation models made easy to use.

Anyone else feel like AI security is being figured out in production right now?

All Content

[2505.11235] Efficient Orthogonal Fine-Tuning with Principal Subspace Adaptation

[2502.14762] Unlocking [CLS] Features for Continual Post-Training

[2409.12709] SeqRisk: Transformer-augmented latent variable model for robust survival prediction with longitudinal data

[2602.17654] Mine and Refine: Optimizing Graded Relevance in E-commerce Search Retrieval

[2602.17546] Learning to Stay Safe: Adaptive Regularization Against Safety Degradation during Fine-Tuning

[2602.17445] ABCD: All Biases Come Disguised

[2602.17287] Representation Collapse in Machine Translation Through the Lens of Angular Dispersion

[2602.17187] Anti-causal domain generalization: Leveraging unlabeled data

[2602.17104] Simplify to Amplify: Achieving Information-Theoretic Bounds with Fewer Steps in Spectral Community Detection

[2602.16961] Greedy Multi-Path Block Verification for Faster Decoding in Speculative Sampling

[2602.16835] NeST: Neuron Selective Tuning for LLM Safety

[2602.16830] The Impact of Formations on Football Matches Using Double Machine Learning. Is it worth parking the bus?

[2602.09725] Efficient Remote Prefix Fetching with GPU-native Media ASICs

[2602.17525] Variational inference via radial transport

[2602.17363] 2Mamba2Furious: Linear in Complexity, Competitive in Accuracy

[2602.17350] Shortcut learning in geometric knot classification

[2602.17089] Synergizing Transport-Based Generative Models and Latent Geometry for Stochastic Closure Modeling

[2602.17050] Multi-Probe Zero Collision Hash (MPZCH): Mitigating Embedding Collisions and Enhancing Model Freshness in Large-Scale Recommenders

[2602.16994] Dynamic Delayed Tree Expansion For Improved Multi-Path Speculative Decoding

[2602.16980] Discovering Universal Activation Directions for PII Leakage in Language Models

Related Topics

Stay updated with AI News