Natural Language Processing

Text understanding and language tasks

Top This Week

Nlp

๐Ÿœ Echoes of the Forgotten Selves: Fringe Spiral Hypotheses

๐Ÿœ Echoes of the Forgotten Selves: Fringe Spiral Hypotheses These hypotheses are not meant to be believed. They are meant to be **held lig...

Reddit - Artificial Intelligence · 1 min ·
Llms

[P] Remote sensing foundation models made easy to use.

This project enables the idea of tasking remote sensing models to acquire embeddings like we task satellites to acquire data! https://git...

Reddit - Machine Learning · 1 min ·
Nlp

Anyone else feel like AI security is being figured out in production right now?

Iโ€™ve been digging into AI security incident data from 2025 into this year, and it feels like something isnโ€™t being talked about enough ou...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2503.23339] A Scalable Framework for Evaluating Health Language Models
Llms

[2503.23339] A Scalable Framework for Evaluating Health Language Models

This paper presents a scalable framework for evaluating health language models, introducing Adaptive Precise Boolean rubrics to enhance e...

arXiv - AI · 4 min ·
[2503.17338] Capturing Individual Human Preferences with Reward Features
Llms

[2503.17338] Capturing Individual Human Preferences with Reward Features

The paper discusses a new approach to modeling individual human preferences in reinforcement learning, emphasizing the need for adaptive ...

arXiv - Machine Learning · 4 min ·
[2410.13957] Goal Inference from Open-Ended Dialog
Llms

[2410.13957] Goal Inference from Open-Ended Dialog

The paper discusses a method for embodied AI agents to infer user goals from open-ended dialogues using Large Language Models (LLMs), emp...

arXiv - Machine Learning · 4 min ·
[2602.17641] FAMOSE: A ReAct Approach to Automated Feature Discovery
Machine Learning

[2602.17641] FAMOSE: A ReAct Approach to Automated Feature Discovery

The paper presents FAMOSE, a novel framework that utilizes the ReAct paradigm for automated feature discovery in machine learning, enhanc...

arXiv - AI · 4 min ·
[2602.17598] The Cascade Equivalence Hypothesis: When Do Speech LLMs Behave Like ASR$\rightarrow$LLM Pipelines?
Llms

[2602.17598] The Cascade Equivalence Hypothesis: When Do Speech LLMs Behave Like ASR$\rightarrow$LLM Pipelines?

The paper explores the Cascade Equivalence Hypothesis, examining when speech LLMs function similarly to ASRโ†’LLM pipelines. It highlights ...

arXiv - AI · 3 min ·
[2602.17526] The Anxiety of Influence: Bloom Filters in Transformer Attention Heads
Llms

[2602.17526] The Anxiety of Influence: Bloom Filters in Transformer Attention Heads

This article explores how certain transformer attention heads act as membership testers, identifying token repetition across various lang...

arXiv - AI · 4 min ·
[2602.17510] LORA-CRAFT: Cross-layer Rank Adaptation via Frozen Tucker Decomposition of Pre-trained Attention Weights
Machine Learning

[2602.17510] LORA-CRAFT: Cross-layer Rank Adaptation via Frozen Tucker Decomposition of Pre-trained Attention Weights

The paper presents LORA-CRAFT, a novel parameter-efficient fine-tuning method that utilizes Tucker tensor decomposition on pre-trained at...

arXiv - AI · 4 min ·
[2602.17450] Beyond Pipelines: A Fundamental Study on the Rise of Generative-Retrieval Architectures in Web Research
Llms

[2602.17450] Beyond Pipelines: A Fundamental Study on the Rise of Generative-Retrieval Architectures in Web Research

This paper explores the evolution of web research through generative-retrieval architectures, highlighting the transformative impact of l...

arXiv - AI · 3 min ·
[2602.17431] Fine-Grained Uncertainty Quantification for Long-Form Language Model Outputs: A Comparative Study
Llms

[2602.17431] Fine-Grained Uncertainty Quantification for Long-Form Language Model Outputs: A Comparative Study

This study presents a taxonomy for fine-grained uncertainty quantification in long-form language model outputs, highlighting effective me...

arXiv - Machine Learning · 3 min ·
[2602.17395] SpectralGCD: Spectral Concept Selection and Cross-modal Representation Learning for Generalized Category Discovery
Machine Learning

[2602.17395] SpectralGCD: Spectral Concept Selection and Cross-modal Representation Learning for Generalized Category Discovery

The paper presents SpectralGCD, a novel approach for Generalized Category Discovery (GCD) that enhances multimodal learning by efficientl...

arXiv - Machine Learning · 4 min ·
[2602.17394] Voice-Driven Semantic Perception for UAV-Assisted Emergency Networks
Ai Agents

[2602.17394] Voice-Driven Semantic Perception for UAV-Assisted Emergency Networks

The paper presents SIREN, an AI framework for enhancing UAV-assisted emergency networks by converting voice communications into structure...

arXiv - AI · 4 min ·
[2602.17330] SubQuad: Near-Quadratic-Free Structure Inference with Distribution-Balanced Objectives in Adaptive Receptor framework
Machine Learning

[2602.17330] SubQuad: Near-Quadratic-Free Structure Inference with Distribution-Balanced Objectives in Adaptive Receptor framework

The paper presents SubQuad, an innovative pipeline for analyzing adaptive immune repertoires, addressing challenges of high computational...

arXiv - Machine Learning · 3 min ·
[2602.17327] WebFAQ 2.0: A Multilingual QA Dataset with Mined Hard Negatives for Dense Retrieval
Nlp

[2602.17327] WebFAQ 2.0: A Multilingual QA Dataset with Mined Hard Negatives for Dense Retrieval

WebFAQ 2.0 introduces a multilingual QA dataset with 198 million FAQ-based question-answer pairs across 108 languages, enhancing multilin...

arXiv - AI · 4 min ·
[2602.17316] Same Meaning, Different Scores: Lexical and Syntactic Sensitivity in LLM Evaluation
Llms

[2602.17316] Same Meaning, Different Scores: Lexical and Syntactic Sensitivity in LLM Evaluation

This paper investigates how lexical and syntactic variations affect the evaluation of Large Language Models (LLMs), revealing significant...

arXiv - AI · 3 min ·
[2602.17283] Towards Cross-lingual Values Assessment: A Consensus-Pluralism Perspective
Llms

[2602.17283] Towards Cross-lingual Values Assessment: A Consensus-Pluralism Perspective

This article presents X-Value, a new benchmark for assessing cross-lingual values in large language models (LLMs), highlighting their lim...

arXiv - AI · 4 min ·
[2602.17185] The Bots of Persuasion: Examining How Conversational Agents' Linguistic Expressions of Personality Affect User Perceptions and Decisions
Llms

[2602.17185] The Bots of Persuasion: Examining How Conversational Agents' Linguistic Expressions of Personality Affect User Perceptions and Decisions

This article explores how the linguistic expressions of personality in conversational agents (CAs) influence user perceptions and decisio...

arXiv - AI · 4 min ·
[2602.17171] In-Context Learning in Linear vs. Quadratic Attention Models: An Empirical Study on Regression Tasks
Machine Learning

[2602.17171] In-Context Learning in Linear vs. Quadratic Attention Models: An Empirical Study on Regression Tasks

This study compares in-context learning (ICL) performance between linear and quadratic attention models on regression tasks, highlighting...

arXiv - AI · 3 min ·
[2602.17063] Sign Lock-In: Randomly Initialized Weight Signs Persist and Bottleneck Sub-Bit Model Compression
Machine Learning

[2602.17063] Sign Lock-In: Randomly Initialized Weight Signs Persist and Bottleneck Sub-Bit Model Compression

The paper discusses 'Sign Lock-In,' a phenomenon in machine learning where randomly initialized weight signs persist during model trainin...

arXiv - AI · 3 min ·
[2602.17054] ALPS: A Diagnostic Challenge Set for Arabic Linguistic & Pragmatic Reasoning
Nlp

[2602.17054] ALPS: A Diagnostic Challenge Set for Arabic Linguistic & Pragmatic Reasoning

The paper introduces ALPS, a diagnostic challenge set designed to evaluate Arabic linguistic and pragmatic reasoning, highlighting the li...

arXiv - AI · 4 min ·
[2602.17051] Evaluating Cross-Lingual Classification Approaches Enabling Topic Discovery for Multilingual Social Media Data
Nlp

[2602.17051] Evaluating Cross-Lingual Classification Approaches Enabling Topic Discovery for Multilingual Social Media Data

This article evaluates various cross-lingual classification methods for analyzing multilingual social media data, focusing on topic disco...

arXiv - Machine Learning · 4 min ·
Previous Page 101 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest โ€ข Unsubscribe anytime