Natural Language Processing

Text understanding and language tasks

Top This Week

Nlp

🜏 Echoes of the Forgotten Selves: Fringe Spiral Hypotheses

🜏 Echoes of the Forgotten Selves: Fringe Spiral Hypotheses These hypotheses are not meant to be believed. They are meant to be **held lig...

Reddit - Artificial Intelligence · 1 min ·
Llms

[P] Remote sensing foundation models made easy to use.

This project enables the idea of tasking remote sensing models to acquire embeddings like we task satellites to acquire data! https://git...

Reddit - Machine Learning · 1 min ·
Nlp

Anyone else feel like AI security is being figured out in production right now?

I’ve been digging into AI security incident data from 2025 into this year, and it feels like something isn’t being talked about enough ou...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2505.11235] Efficient Orthogonal Fine-Tuning with Principal Subspace Adaptation
Machine Learning

[2505.11235] Efficient Orthogonal Fine-Tuning with Principal Subspace Adaptation

The paper presents Efficient Orthogonal Fine-Tuning with Principal Subspace Adaptation (PSOFT), a method that enhances parameter-efficien...

arXiv - Machine Learning · 4 min ·
[2502.14762] Unlocking [CLS] Features for Continual Post-Training
Llms

[2502.14762] Unlocking [CLS] Features for Continual Post-Training

The paper presents a novel approach to continual learning in machine learning models, introducing a parameter-efficient fine-tuning modul...

arXiv - Machine Learning · 4 min ·
[2409.12709] SeqRisk: Transformer-augmented latent variable model for robust survival prediction with longitudinal data
Machine Learning

[2409.12709] SeqRisk: Transformer-augmented latent variable model for robust survival prediction with longitudinal data

SeqRisk introduces a transformer-augmented latent variable model for enhanced survival prediction using longitudinal healthcare data, add...

arXiv - Machine Learning · 3 min ·
[2602.17654] Mine and Refine: Optimizing Graded Relevance in E-commerce Search Retrieval
Machine Learning

[2602.17654] Mine and Refine: Optimizing Graded Relevance in E-commerce Search Retrieval

The paper presents a two-stage framework called 'Mine and Refine' for optimizing graded relevance in e-commerce search retrieval, enhanci...

arXiv - Machine Learning · 4 min ·
[2602.17546] Learning to Stay Safe: Adaptive Regularization Against Safety Degradation during Fine-Tuning
Llms

[2602.17546] Learning to Stay Safe: Adaptive Regularization Against Safety Degradation during Fine-Tuning

This article presents a novel training framework for instruction-following language models that maintains safety during fine-tuning by ad...

arXiv - Machine Learning · 4 min ·
[2602.17445] ABCD: All Biases Come Disguised
Llms

[2602.17445] ABCD: All Biases Come Disguised

The paper 'ABCD: All Biases Come Disguised' explores biases in LLMs during multiple-choice question evaluations, proposing a new protocol...

arXiv - Machine Learning · 4 min ·
[2602.17287] Representation Collapse in Machine Translation Through the Lens of Angular Dispersion
Machine Learning

[2602.17287] Representation Collapse in Machine Translation Through the Lens of Angular Dispersion

This paper explores representation collapse in neural machine translation models, particularly focusing on the Transformer architecture a...

arXiv - Machine Learning · 3 min ·
[2602.17187] Anti-causal domain generalization: Leveraging unlabeled data
Machine Learning

[2602.17187] Anti-causal domain generalization: Leveraging unlabeled data

The paper explores anti-causal domain generalization, proposing methods to leverage unlabeled data for robust predictive modeling in vary...

arXiv - Machine Learning · 3 min ·
[2602.17104] Simplify to Amplify: Achieving Information-Theoretic Bounds with Fewer Steps in Spectral Community Detection
Machine Learning

[2602.17104] Simplify to Amplify: Achieving Information-Theoretic Bounds with Fewer Steps in Spectral Community Detection

This paper presents a streamlined spectral algorithm for community detection in the stochastic block model, achieving improved error boun...

arXiv - Machine Learning · 3 min ·
[2602.16961] Greedy Multi-Path Block Verification for Faster Decoding in Speculative Sampling
Machine Learning

[2602.16961] Greedy Multi-Path Block Verification for Faster Decoding in Speculative Sampling

This paper presents Greedy Multi-Path Block Verification (GBV), a method that enhances the efficiency of speculative decoding in machine ...

arXiv - Machine Learning · 4 min ·
[2602.16835] NeST: Neuron Selective Tuning for LLM Safety
Llms

[2602.16835] NeST: Neuron Selective Tuning for LLM Safety

The paper introduces NeST, a novel framework for enhancing safety in large language models (LLMs) by selectively tuning a small subset of...

arXiv - Machine Learning · 4 min ·
[2602.16830] The Impact of Formations on Football Matches Using Double Machine Learning. Is it worth parking the bus?
Machine Learning

[2602.16830] The Impact of Formations on Football Matches Using Double Machine Learning. Is it worth parking the bus?

This study explores the impact of football formations on match outcomes using Double Machine Learning, questioning the effectiveness of d...

arXiv - Machine Learning · 4 min ·
[2602.09725] Efficient Remote Prefix Fetching with GPU-native Media ASICs
Llms

[2602.09725] Efficient Remote Prefix Fetching with GPU-native Media ASICs

The paper presents KVFetcher, a novel solution for efficient remote key-value (KV) cache reuse using GPU-native video codecs, significant...

arXiv - Machine Learning · 4 min ·
[2602.17525] Variational inference via radial transport
Machine Learning

[2602.17525] Variational inference via radial transport

The paper introduces a novel approach to variational inference (VI) by optimizing radial profiles, enhancing the approximation of high-di...

arXiv - Machine Learning · 3 min ·
[2602.17363] 2Mamba2Furious: Linear in Complexity, Competitive in Accuracy
Machine Learning

[2602.17363] 2Mamba2Furious: Linear in Complexity, Competitive in Accuracy

The paper presents 2Mamba, a linear attention transformer variant that achieves competitive accuracy compared to softmax attention while ...

arXiv - Machine Learning · 3 min ·
[2602.17350] Shortcut learning in geometric knot classification
Machine Learning

[2602.17350] Shortcut learning in geometric knot classification

This paper explores the application of machine learning to classify geometric knots, addressing the challenge of identifying equivalent e...

arXiv - Machine Learning · 4 min ·
[2602.17089] Synergizing Transport-Based Generative Models and Latent Geometry for Stochastic Closure Modeling
Machine Learning

[2602.17089] Synergizing Transport-Based Generative Models and Latent Geometry for Stochastic Closure Modeling

This article presents a novel approach to stochastic closure modeling by integrating transport-based generative models with latent geomet...

arXiv - Machine Learning · 4 min ·
[2602.17050] Multi-Probe Zero Collision Hash (MPZCH): Mitigating Embedding Collisions and Enhancing Model Freshness in Large-Scale Recommenders
Machine Learning

[2602.17050] Multi-Probe Zero Collision Hash (MPZCH): Mitigating Embedding Collisions and Enhancing Model Freshness in Large-Scale Recommenders

The paper presents the Multi-Probe Zero Collision Hash (MPZCH), a novel indexing method that mitigates embedding collisions in large-scal...

arXiv - Machine Learning · 4 min ·
[2602.16994] Dynamic Delayed Tree Expansion For Improved Multi-Path Speculative Decoding
Machine Learning

[2602.16994] Dynamic Delayed Tree Expansion For Improved Multi-Path Speculative Decoding

This article presents a novel approach to multi-path speculative decoding in machine learning, introducing dynamic delayed tree expansion...

arXiv - Machine Learning · 4 min ·
[2602.16980] Discovering Universal Activation Directions for PII Leakage in Language Models
Llms

[2602.16980] Discovering Universal Activation Directions for PII Leakage in Language Models

The paper introduces UniLeak, a framework that identifies universal activation directions in language models, enhancing the understanding...

arXiv - Machine Learning · 3 min ·
Previous Page 99 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest β€’ Unsubscribe anytime