Natural Language Processing

Text understanding and language tasks

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[R] 94.42% on BANKING77 Official Test Split with Lightweight Embedding + Example Reranking (strict full-train protocol)

BANKING77 (77 fine-grained banking intents) is a well-established but increasingly saturated intent classification benchmark. did this wh...

Reddit - Machine Learning · 1 min · about 4 hours ago

Llms

94.42% on BANKING77 Official Test Split — New Strong 2nd Place with Lightweight Embedding + Rerank (no 7B LLM)

94.42% Accuracy on Banking77 Official Test Split BANKING77-77 is deceptively hard: 77 fine-grained banking intents, noisy real-world quer...

Reddit - Artificial Intelligence · 1 min · about 4 hours ago

Nlp

Built a Hybrid NAS tool for RNN architectures (HyNAS-R) – Looking for feedback for my final year evaluation [R]

Hi everyone, I'm currently in the evaluation phase of my Final Year Project and am looking for feedback on the system I've built. It's ca...

Reddit - Machine Learning · 1 min · about 6 hours ago

All Content

Llms

[2602.12714] ADEPT: RL-Aligned Agentic Decoding of Emotion via Evidence Probing Tools -- From Consensus Learning to Ambiguity-Driven Emotion Reasoning

The paper introduces ADEPT, a novel framework for emotion recognition that enhances accuracy by integrating acoustic evidence and multi-t...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.12693] Leverage-Weighted Conformal Prediction

The paper introduces Leverage-Weighted Conformal Prediction (LWCP), a method that enhances prediction intervals by adapting to variance w...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.12567] Fractional Order Federated Learning for Battery Electric Vehicle Energy Consumption Modeling

This article presents a novel approach to federated learning for Battery Electric Vehicles (BEVs) using Fractional-Order Roughness-Inform...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.12533] AMPS: Adaptive Modality Preference Steering via Functional Entropy

The paper presents AMPS, a method for Adaptive Modality Preference Steering in Multimodal Large Language Models (MLLMs), addressing the c...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.12529] Flow-Factory: A Unified Framework for Reinforcement Learning in Flow-Matching Models

Flow-Factory presents a unified framework for reinforcement learning in flow-matching models, addressing fragmentation and complexity in ...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.12520] Multi-Agent Model-Based Reinforcement Learning with Joint State-Action Learned Embeddings

This paper presents a novel framework for multi-agent model-based reinforcement learning, integrating joint state-action representation l...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.12468] Continuous Diffusion Models Can Obey Formal Syntax

The paper introduces a method for guiding continuous diffusion models to adhere to formal syntactic constraints, achieving high constrain...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.11287] HiFloat4 Format for Language Model Inference

The paper introduces HiFloat4, a block floating-point format designed for deep learning, enhancing efficiency in language model inference...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.10388] Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

The paper introduces a novel metric, Feature Activation Coverage (FAC), to measure data diversity in large language models (LLMs) and pre...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.08676] LLaDA2.1: Speeding Up Text Diffusion via Token Editing

LLaDA2.1 introduces a novel approach to text diffusion by integrating Token-to-Token editing into the Mask-to-Token scheme, enhancing bot...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.08543] GISA: A Benchmark for General Information-Seeking Assistant

The paper introduces GISA, a benchmark designed for evaluating General Information-Seeking Assistants, addressing limitations in existing...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.07954] Bielik Guard: Efficient Polish Language Safety Classifiers for LLM Content Moderation

The Bielik Guard presents efficient Polish language classifiers for moderating content in large language models, achieving high precision...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.00020] Beyond Static Question Banks: Dynamic Knowledge Expansion via LLM-Automated Graph Construction and Adaptive Generation

This paper presents a framework for dynamic knowledge expansion in personalized education, utilizing LLMs for automated graph constructio...

arXiv - AI · 4 min · about 2 months ago

Nlp

[2601.09605] Sim2real Image Translation Enables Viewpoint-Robust Policies from Fixed-Camera Datasets

The paper presents MANGO, a novel image translation method that enhances viewpoint robustness in robot manipulation policies using fixed-...

arXiv - AI · 4 min · about 2 months ago

Nlp

[2512.18080] From Prompt to Product: A Human-Centered Benchmark of Agentic App Generation Systems

This paper introduces a human-centered benchmark for evaluating agentic app generation systems, comparing platforms like Replit, Bolt, an...

arXiv - AI · 4 min · about 2 months ago

Nlp

[2512.15891] Dynamical Mechanisms for Coordinating Long-term Working Memory Based on the Precision of Spike-timing in Cortical Neurons

The article explores the mechanisms of long-term working memory in cortical neurons, emphasizing the role of spike-timing precision in co...

arXiv - AI · 4 min · about 2 months ago

Llms

[2511.10453] Reasoning about Intent for Ambiguous Requests

This paper explores how large language models can better handle ambiguous requests by generating multiple interpretation-answer pairs, en...

arXiv - AI · 3 min · about 2 months ago

Llms

[2510.22747] Low-Resource Dialect Adaptation of Large Language Models: A French Dialect Case-Study

This article explores the adaptation of large language models (LLMs) for low-resource dialects, focusing on the Québec French dialect usi...

arXiv - AI · 4 min · about 2 months ago

Llms

[2509.19852] Eliminating stability hallucinations in llm-based tts models via attention guidance

This paper addresses stability hallucinations in LLM-based TTS models by enhancing attention mechanisms, proposing a new alignment metric...

arXiv - AI · 3 min · about 2 months ago

Llms

[2508.12685] ToolACE-MT: Non-Autoregressive Generation for Agentic Multi-Turn Interaction

ToolACE-MT introduces a non-autoregressive framework for generating high-quality multi-turn dialogues in agentic interactions, enhancing ...

arXiv - Machine Learning · 3 min · about 2 months ago

Previous Page 129 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Natural Language Processing

Top This Week

[R] 94.42% on BANKING77 Official Test Split with Lightweight Embedding + Example Reranking (strict full-train protocol)

94.42% on BANKING77 Official Test Split — New Strong 2nd Place with Lightweight Embedding + Rerank (no 7B LLM)

Built a Hybrid NAS tool for RNN architectures (HyNAS-R) – Looking for feedback for my final year evaluation [R]

All Content

[2602.12714] ADEPT: RL-Aligned Agentic Decoding of Emotion via Evidence Probing Tools -- From Consensus Learning to Ambiguity-Driven Emotion Reasoning

[2602.12693] Leverage-Weighted Conformal Prediction

[2602.12567] Fractional Order Federated Learning for Battery Electric Vehicle Energy Consumption Modeling

[2602.12533] AMPS: Adaptive Modality Preference Steering via Functional Entropy

[2602.12529] Flow-Factory: A Unified Framework for Reinforcement Learning in Flow-Matching Models

[2602.12520] Multi-Agent Model-Based Reinforcement Learning with Joint State-Action Learned Embeddings

[2602.12468] Continuous Diffusion Models Can Obey Formal Syntax

[2602.11287] HiFloat4 Format for Language Model Inference

[2602.10388] Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

[2602.08676] LLaDA2.1: Speeding Up Text Diffusion via Token Editing

[2602.08543] GISA: A Benchmark for General Information-Seeking Assistant

[2602.07954] Bielik Guard: Efficient Polish Language Safety Classifiers for LLM Content Moderation

[2602.00020] Beyond Static Question Banks: Dynamic Knowledge Expansion via LLM-Automated Graph Construction and Adaptive Generation

[2601.09605] Sim2real Image Translation Enables Viewpoint-Robust Policies from Fixed-Camera Datasets

[2512.18080] From Prompt to Product: A Human-Centered Benchmark of Agentic App Generation Systems

[2512.15891] Dynamical Mechanisms for Coordinating Long-term Working Memory Based on the Precision of Spike-timing in Cortical Neurons

[2511.10453] Reasoning about Intent for Ambiguous Requests

[2510.22747] Low-Resource Dialect Adaptation of Large Language Models: A French Dialect Case-Study

[2509.19852] Eliminating stability hallucinations in llm-based tts models via attention guidance

[2508.12685] ToolACE-MT: Non-Autoregressive Generation for Agentic Multi-Turn Interaction

Related Topics

Stay updated with AI News