Natural Language Processing

Text understanding and language tasks

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Nlp

🜏 Echoes of the Forgotten Selves: Fringe Spiral Hypotheses

🜏 Echoes of the Forgotten Selves: Fringe Spiral Hypotheses These hypotheses are not meant to be believed. They are meant to be **held lig...

Reddit - Artificial Intelligence · 1 min · about 8 hours ago

Llms

[P] Remote sensing foundation models made easy to use.

This project enables the idea of tasking remote sensing models to acquire embeddings like we task satellites to acquire data! https://git...

Reddit - Machine Learning · 1 min · about 17 hours ago

Nlp

Anyone else feel like AI security is being figured out in production right now?

I’ve been digging into AI security incident data from 2025 into this year, and it feels like something isn’t being talked about enough ou...

Reddit - Artificial Intelligence · 1 min · about 20 hours ago

All Content

Llms

[2503.23339] A Scalable Framework for Evaluating Health Language Models

This paper presents a scalable framework for evaluating health language models, introducing Adaptive Precise Boolean rubrics to enhance e...

arXiv - AI · 4 min · about 1 month ago

Llms

[2503.17338] Capturing Individual Human Preferences with Reward Features

The paper discusses a new approach to modeling individual human preferences in reinforcement learning, emphasizing the need for adaptive ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2410.13957] Goal Inference from Open-Ended Dialog

The paper discusses a method for embodied AI agents to infer user goals from open-ended dialogues using Large Language Models (LLMs), emp...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.17641] FAMOSE: A ReAct Approach to Automated Feature Discovery

The paper presents FAMOSE, a novel framework that utilizes the ReAct paradigm for automated feature discovery in machine learning, enhanc...

arXiv - AI · 4 min · about 1 month ago

$[2602.17598] The Cascade Equivalence Hypothesis: When Do Speech LLMs Behave Like ASR$\rightarrow$LLM Pipelines?$

Llms

[2602.17598] The Cascade Equivalence Hypothesis: When Do Speech LLMs Behave Like ASR$\rightarrow$LLM Pipelines?

The paper explores the Cascade Equivalence Hypothesis, examining when speech LLMs function similarly to ASR→LLM pipelines. It highlights ...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.17526] The Anxiety of Influence: Bloom Filters in Transformer Attention Heads

This article explores how certain transformer attention heads act as membership testers, identifying token repetition across various lang...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.17510] LORA-CRAFT: Cross-layer Rank Adaptation via Frozen Tucker Decomposition of Pre-trained Attention Weights

The paper presents LORA-CRAFT, a novel parameter-efficient fine-tuning method that utilizes Tucker tensor decomposition on pre-trained at...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.17450] Beyond Pipelines: A Fundamental Study on the Rise of Generative-Retrieval Architectures in Web Research

This paper explores the evolution of web research through generative-retrieval architectures, highlighting the transformative impact of l...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.17431] Fine-Grained Uncertainty Quantification for Long-Form Language Model Outputs: A Comparative Study

This study presents a taxonomy for fine-grained uncertainty quantification in long-form language model outputs, highlighting effective me...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.17395] SpectralGCD: Spectral Concept Selection and Cross-modal Representation Learning for Generalized Category Discovery

The paper presents SpectralGCD, a novel approach for Generalized Category Discovery (GCD) that enhances multimodal learning by efficientl...

arXiv - Machine Learning · 4 min · about 1 month ago

Ai Agents

[2602.17394] Voice-Driven Semantic Perception for UAV-Assisted Emergency Networks

The paper presents SIREN, an AI framework for enhancing UAV-assisted emergency networks by converting voice communications into structure...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.17330] SubQuad: Near-Quadratic-Free Structure Inference with Distribution-Balanced Objectives in Adaptive Receptor framework

The paper presents SubQuad, an innovative pipeline for analyzing adaptive immune repertoires, addressing challenges of high computational...

arXiv - Machine Learning · 3 min · about 1 month ago

Nlp

[2602.17327] WebFAQ 2.0: A Multilingual QA Dataset with Mined Hard Negatives for Dense Retrieval

WebFAQ 2.0 introduces a multilingual QA dataset with 198 million FAQ-based question-answer pairs across 108 languages, enhancing multilin...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.17316] Same Meaning, Different Scores: Lexical and Syntactic Sensitivity in LLM Evaluation

This paper investigates how lexical and syntactic variations affect the evaluation of Large Language Models (LLMs), revealing significant...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.17283] Towards Cross-lingual Values Assessment: A Consensus-Pluralism Perspective

This article presents X-Value, a new benchmark for assessing cross-lingual values in large language models (LLMs), highlighting their lim...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.17185] The Bots of Persuasion: Examining How Conversational Agents' Linguistic Expressions of Personality Affect User Perceptions and Decisions

This article explores how the linguistic expressions of personality in conversational agents (CAs) influence user perceptions and decisio...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.17171] In-Context Learning in Linear vs. Quadratic Attention Models: An Empirical Study on Regression Tasks

This study compares in-context learning (ICL) performance between linear and quadratic attention models on regression tasks, highlighting...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.17063] Sign Lock-In: Randomly Initialized Weight Signs Persist and Bottleneck Sub-Bit Model Compression

The paper discusses 'Sign Lock-In,' a phenomenon in machine learning where randomly initialized weight signs persist during model trainin...

arXiv - AI · 3 min · about 1 month ago

Nlp

[2602.17054] ALPS: A Diagnostic Challenge Set for Arabic Linguistic & Pragmatic Reasoning

The paper introduces ALPS, a diagnostic challenge set designed to evaluate Arabic linguistic and pragmatic reasoning, highlighting the li...

arXiv - AI · 4 min · about 1 month ago

Nlp

[2602.17051] Evaluating Cross-Lingual Classification Approaches Enabling Topic Discovery for Multilingual Social Media Data

This article evaluates various cross-lingual classification methods for analyzing multilingual social media data, focusing on topic disco...

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 101 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Natural Language Processing

Top This Week

🜏 Echoes of the Forgotten Selves: Fringe Spiral Hypotheses

[P] Remote sensing foundation models made easy to use.

Anyone else feel like AI security is being figured out in production right now?

All Content

[2503.23339] A Scalable Framework for Evaluating Health Language Models

[2503.17338] Capturing Individual Human Preferences with Reward Features

[2410.13957] Goal Inference from Open-Ended Dialog

[2602.17641] FAMOSE: A ReAct Approach to Automated Feature Discovery

[2602.17598] The Cascade Equivalence Hypothesis: When Do Speech LLMs Behave Like ASR$\rightarrow$LLM Pipelines?

[2602.17526] The Anxiety of Influence: Bloom Filters in Transformer Attention Heads

[2602.17510] LORA-CRAFT: Cross-layer Rank Adaptation via Frozen Tucker Decomposition of Pre-trained Attention Weights

[2602.17450] Beyond Pipelines: A Fundamental Study on the Rise of Generative-Retrieval Architectures in Web Research

[2602.17431] Fine-Grained Uncertainty Quantification for Long-Form Language Model Outputs: A Comparative Study

[2602.17395] SpectralGCD: Spectral Concept Selection and Cross-modal Representation Learning for Generalized Category Discovery

[2602.17394] Voice-Driven Semantic Perception for UAV-Assisted Emergency Networks

[2602.17330] SubQuad: Near-Quadratic-Free Structure Inference with Distribution-Balanced Objectives in Adaptive Receptor framework

[2602.17327] WebFAQ 2.0: A Multilingual QA Dataset with Mined Hard Negatives for Dense Retrieval

[2602.17316] Same Meaning, Different Scores: Lexical and Syntactic Sensitivity in LLM Evaluation

[2602.17283] Towards Cross-lingual Values Assessment: A Consensus-Pluralism Perspective

[2602.17185] The Bots of Persuasion: Examining How Conversational Agents' Linguistic Expressions of Personality Affect User Perceptions and Decisions

[2602.17171] In-Context Learning in Linear vs. Quadratic Attention Models: An Empirical Study on Regression Tasks

[2602.17063] Sign Lock-In: Randomly Initialized Weight Signs Persist and Bottleneck Sub-Bit Model Compression

[2602.17054] ALPS: A Diagnostic Challenge Set for Arabic Linguistic & Pragmatic Reasoning

[2602.17051] Evaluating Cross-Lingual Classification Approaches Enabling Topic Discovery for Multilingual Social Media Data

Related Topics

Stay updated with AI News