Natural Language Processing

Text understanding and language tasks

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[R] 94.42% on BANKING77 Official Test Split with Lightweight Embedding + Example Reranking (strict full-train protocol)

BANKING77 (77 fine-grained banking intents) is a well-established but increasingly saturated intent classification benchmark. did this wh...

Reddit - Machine Learning · 1 min · 24 minutes ago

Llms

94.42% on BANKING77 Official Test Split — New Strong 2nd Place with Lightweight Embedding + Rerank (no 7B LLM)

94.42% Accuracy on Banking77 Official Test Split BANKING77-77 is deceptively hard: 77 fine-grained banking intents, noisy real-world quer...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Nlp

Built a Hybrid NAS tool for RNN architectures (HyNAS-R) – Looking for feedback for my final year evaluation [R]

Hi everyone, I'm currently in the evaluation phase of my Final Year Project and am looking for feedback on the system I've built. It's ca...

Reddit - Machine Learning · 1 min · about 3 hours ago

All Content

Llms

[2602.13235] Lang2Act: Fine-Grained Visual Reasoning through Self-Emergent Linguistic Toolchains

The paper introduces Lang2Act, a novel framework for enhancing visual reasoning in Vision-Language Models (VLMs) through self-emergent li...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.13237] NL2LOGIC: AST-Guided Translation of Natural Language into First-Order Logic with Large Language Models

NL2LOGIC presents a novel framework for translating natural language into first-order logic using large language models, enhancing accura...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.13226] Variation is the Key: A Variation-Based Framework for LLM-Generated Text Detection

This paper presents VaryBalance, a novel framework for detecting text generated by large language models (LLMs), outperforming existing m...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.13224] A Geometric Taxonomy of Hallucinations in LLMs

This article presents a geometric taxonomy of hallucinations in large language models (LLMs), categorizing them into three types: unfaith...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.13215] When to Think Fast and Slow? AMOR: Entropy-Based Metacognitive Gate for Dynamic SSM-Attention Switching

The paper presents AMOR, an entropy-based metacognitive gate that enhances attention switching in state space models, improving efficienc...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

How AI is Transforming Document Processing and PDF Workflows

The article discusses how AI is revolutionizing document processing and PDF workflows, highlighting advancements in automation, accuracy,...

AI News - General · 10 min · about 2 months ago

Machine Learning

[R] Learning State-Tracking from Code Using Linear RNNs

This article discusses the use of linear RNNs for state-tracking tasks, particularly focusing on permutation composition and its implicat...

Reddit - Machine Learning · 1 min · about 2 months ago

Machine Learning

[D] Self-Reference Circuits in Transformers: Do Induction Heads Create De Se Beliefs?

This article explores how transformers process indexical language, focusing on self-reference circuits and their implications for underst...

Reddit - Machine Learning · 1 min · about 2 months ago

Machine Learning

Short Paper Reviews [R]

The article discusses the submission and review process for short papers in machine learning, focusing on the unique challenges and expec...

Reddit - Machine Learning · 1 min · about 2 months ago

Machine Learning

Izwi Update: Local Speaker Diarization, Forced Alignment, and better model support

Izwi has released significant updates, including local speaker diarization, forced alignment for accurate timestamps, and real-time strea...

Reddit - Artificial Intelligence · 1 min · about 2 months ago

Machine Learning

Are AI note taking apps overhyped right now?

The article discusses the current hype surrounding AI note-taking apps, questioning their effectiveness in real-world scenarios compared ...

Reddit - Artificial Intelligence · 1 min · about 2 months ago

Llms

[2602.12262] T3D: Few-Step Diffusion Language Models via Trajectory Self-Distillation with Direct Discriminative Optimization

The paper presents T3D, a framework for enhancing few-step diffusion language models through trajectory self-distillation and direct disc...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.10538] Why Agentic Theorem Prover Works: A Statistical Provability Theory of Mathematical Reasoning Models

This article explores the effectiveness of agentic theorem provers through a statistical provability theory, analyzing their performance ...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.04884] Reinforced Attention Learning

The paper introduces Reinforced Attention Learning (RAL), a novel framework that optimizes internal attention distributions in multimodal...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2509.22876] HEART: Emotionally-Driven Test-Time Scaling of Language Models

The paper presents HEART, a framework that leverages emotional cues to enhance the reasoning capabilities of language models during test-...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2508.02872] Highlight & Summarize: RAG without the jailbreaks

The paper presents Highlight & Summarize (H&S), a novel design pattern for retrieval-augmented generation (RAG) systems that prevents jai...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2412.06014] Post-hoc Probabilistic Vision-Language Models

This article presents a novel approach to uncertainty estimation in vision-language models (VLMs) by proposing a post-hoc method that enh...

arXiv - Machine Learning · 3 min · about 2 months ago

Nlp

[2410.03041] Minmax Trend Filtering: Generalizations of Total Variation Denoising via a Local Minmax/Maxmin Formula

The paper introduces Minmax Trend Filtering (MTF), a novel approach to Total Variation Denoising (TVD) that utilizes a local minmax/maxmi...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2312.17111] Online Tensor Inference

The paper presents a novel framework for online tensor inference, addressing the challenges of real-time data processing in applications ...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.11151] Diffusion-Pretrained Dense and Contextual Embeddings

The paper introduces pplx-embed, a family of multilingual embedding models utilizing diffusion-pretrained language models for enhanced re...

arXiv - Machine Learning · 3 min · about 2 months ago

Previous Page 127 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Natural Language Processing

Top This Week

[R] 94.42% on BANKING77 Official Test Split with Lightweight Embedding + Example Reranking (strict full-train protocol)

94.42% on BANKING77 Official Test Split — New Strong 2nd Place with Lightweight Embedding + Rerank (no 7B LLM)

Built a Hybrid NAS tool for RNN architectures (HyNAS-R) – Looking for feedback for my final year evaluation [R]

All Content

[2602.13235] Lang2Act: Fine-Grained Visual Reasoning through Self-Emergent Linguistic Toolchains

[2602.13237] NL2LOGIC: AST-Guided Translation of Natural Language into First-Order Logic with Large Language Models

[2602.13226] Variation is the Key: A Variation-Based Framework for LLM-Generated Text Detection

[2602.13224] A Geometric Taxonomy of Hallucinations in LLMs

[2602.13215] When to Think Fast and Slow? AMOR: Entropy-Based Metacognitive Gate for Dynamic SSM-Attention Switching

How AI is Transforming Document Processing and PDF Workflows

[R] Learning State-Tracking from Code Using Linear RNNs

[D] Self-Reference Circuits in Transformers: Do Induction Heads Create De Se Beliefs?

Short Paper Reviews [R]

Izwi Update: Local Speaker Diarization, Forced Alignment, and better model support

Are AI note taking apps overhyped right now?

[2602.12262] T3D: Few-Step Diffusion Language Models via Trajectory Self-Distillation with Direct Discriminative Optimization

[2602.10538] Why Agentic Theorem Prover Works: A Statistical Provability Theory of Mathematical Reasoning Models

[2602.04884] Reinforced Attention Learning

[2509.22876] HEART: Emotionally-Driven Test-Time Scaling of Language Models

[2508.02872] Highlight & Summarize: RAG without the jailbreaks

[2412.06014] Post-hoc Probabilistic Vision-Language Models

[2410.03041] Minmax Trend Filtering: Generalizations of Total Variation Denoising via a Local Minmax/Maxmin Formula

[2312.17111] Online Tensor Inference

[2602.11151] Diffusion-Pretrained Dense and Contextual Embeddings

Related Topics

Stay updated with AI News