Natural Language Processing

Text understanding and language tasks

Top This Week

Llms

[R] 94.42% on BANKING77 Official Test Split with Lightweight Embedding + Example Reranking (strict full-train protocol)

BANKING77 (77 fine-grained banking intents) is a well-established but increasingly saturated intent classification benchmark. did this wh...

Reddit - Machine Learning · 1 min ·
Llms

94.42% on BANKING77 Official Test Split — New Strong 2nd Place with Lightweight Embedding + Rerank (no 7B LLM)

94.42% Accuracy on Banking77 Official Test Split BANKING77-77 is deceptively hard: 77 fine-grained banking intents, noisy real-world quer...

Reddit - Artificial Intelligence · 1 min ·
Nlp

Built a Hybrid NAS tool for RNN architectures (HyNAS-R) – Looking for feedback for my final year evaluation [R]

Hi everyone, I'm currently in the evaluation phase of my Final Year Project and am looking for feedback on the system I've built. It's ca...

Reddit - Machine Learning · 1 min ·

All Content

[2602.13235] Lang2Act: Fine-Grained Visual Reasoning through Self-Emergent Linguistic Toolchains
Llms

[2602.13235] Lang2Act: Fine-Grained Visual Reasoning through Self-Emergent Linguistic Toolchains

The paper introduces Lang2Act, a novel framework for enhancing visual reasoning in Vision-Language Models (VLMs) through self-emergent li...

arXiv - AI · 4 min ·
[2602.13237] NL2LOGIC: AST-Guided Translation of Natural Language into First-Order Logic with Large Language Models
Llms

[2602.13237] NL2LOGIC: AST-Guided Translation of Natural Language into First-Order Logic with Large Language Models

NL2LOGIC presents a novel framework for translating natural language into first-order logic using large language models, enhancing accura...

arXiv - AI · 4 min ·
[2602.13226] Variation is the Key: A Variation-Based Framework for LLM-Generated Text Detection
Llms

[2602.13226] Variation is the Key: A Variation-Based Framework for LLM-Generated Text Detection

This paper presents VaryBalance, a novel framework for detecting text generated by large language models (LLMs), outperforming existing m...

arXiv - AI · 3 min ·
[2602.13224] A Geometric Taxonomy of Hallucinations in LLMs
Llms

[2602.13224] A Geometric Taxonomy of Hallucinations in LLMs

This article presents a geometric taxonomy of hallucinations in large language models (LLMs), categorizing them into three types: unfaith...

arXiv - AI · 3 min ·
[2602.13215] When to Think Fast and Slow? AMOR: Entropy-Based Metacognitive Gate for Dynamic SSM-Attention Switching
Machine Learning

[2602.13215] When to Think Fast and Slow? AMOR: Entropy-Based Metacognitive Gate for Dynamic SSM-Attention Switching

The paper presents AMOR, an entropy-based metacognitive gate that enhances attention switching in state space models, improving efficienc...

arXiv - AI · 3 min ·
How AI is Transforming Document Processing and PDF Workflows
Machine Learning

How AI is Transforming Document Processing and PDF Workflows

The article discusses how AI is revolutionizing document processing and PDF workflows, highlighting advancements in automation, accuracy,...

AI News - General · 10 min ·
Machine Learning

[R] Learning State-Tracking from Code Using Linear RNNs

This article discusses the use of linear RNNs for state-tracking tasks, particularly focusing on permutation composition and its implicat...

Reddit - Machine Learning · 1 min ·
Machine Learning

[D] Self-Reference Circuits in Transformers: Do Induction Heads Create De Se Beliefs?

This article explores how transformers process indexical language, focusing on self-reference circuits and their implications for underst...

Reddit - Machine Learning · 1 min ·
Machine Learning

Short Paper Reviews [R]

The article discusses the submission and review process for short papers in machine learning, focusing on the unique challenges and expec...

Reddit - Machine Learning · 1 min ·
Machine Learning

Izwi Update: Local Speaker Diarization, Forced Alignment, and better model support

Izwi has released significant updates, including local speaker diarization, forced alignment for accurate timestamps, and real-time strea...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

Are AI note taking apps overhyped right now?

The article discusses the current hype surrounding AI note-taking apps, questioning their effectiveness in real-world scenarios compared ...

Reddit - Artificial Intelligence · 1 min ·
[2602.12262] T3D: Few-Step Diffusion Language Models via Trajectory Self-Distillation with Direct Discriminative Optimization
Llms

[2602.12262] T3D: Few-Step Diffusion Language Models via Trajectory Self-Distillation with Direct Discriminative Optimization

The paper presents T3D, a framework for enhancing few-step diffusion language models through trajectory self-distillation and direct disc...

arXiv - Machine Learning · 3 min ·
[2602.10538] Why Agentic Theorem Prover Works: A Statistical Provability Theory of Mathematical Reasoning Models
Machine Learning

[2602.10538] Why Agentic Theorem Prover Works: A Statistical Provability Theory of Mathematical Reasoning Models

This article explores the effectiveness of agentic theorem provers through a statistical provability theory, analyzing their performance ...

arXiv - Machine Learning · 4 min ·
[2602.04884] Reinforced Attention Learning
Llms

[2602.04884] Reinforced Attention Learning

The paper introduces Reinforced Attention Learning (RAL), a novel framework that optimizes internal attention distributions in multimodal...

arXiv - Machine Learning · 3 min ·
[2509.22876] HEART: Emotionally-Driven Test-Time Scaling of Language Models
Llms

[2509.22876] HEART: Emotionally-Driven Test-Time Scaling of Language Models

The paper presents HEART, a framework that leverages emotional cues to enhance the reasoning capabilities of language models during test-...

arXiv - Machine Learning · 3 min ·
[2508.02872] Highlight & Summarize: RAG without the jailbreaks
Llms

[2508.02872] Highlight & Summarize: RAG without the jailbreaks

The paper presents Highlight & Summarize (H&S), a novel design pattern for retrieval-augmented generation (RAG) systems that prevents jai...

arXiv - Machine Learning · 4 min ·
[2412.06014] Post-hoc Probabilistic Vision-Language Models
Llms

[2412.06014] Post-hoc Probabilistic Vision-Language Models

This article presents a novel approach to uncertainty estimation in vision-language models (VLMs) by proposing a post-hoc method that enh...

arXiv - Machine Learning · 3 min ·
[2410.03041] Minmax Trend Filtering: Generalizations of Total Variation Denoising via a Local Minmax/Maxmin Formula
Nlp

[2410.03041] Minmax Trend Filtering: Generalizations of Total Variation Denoising via a Local Minmax/Maxmin Formula

The paper introduces Minmax Trend Filtering (MTF), a novel approach to Total Variation Denoising (TVD) that utilizes a local minmax/maxmi...

arXiv - Machine Learning · 4 min ·
[2312.17111] Online Tensor Inference
Machine Learning

[2312.17111] Online Tensor Inference

The paper presents a novel framework for online tensor inference, addressing the challenges of real-time data processing in applications ...

arXiv - Machine Learning · 4 min ·
[2602.11151] Diffusion-Pretrained Dense and Contextual Embeddings
Llms

[2602.11151] Diffusion-Pretrained Dense and Contextual Embeddings

The paper introduces pplx-embed, a family of multilingual embedding models utilizing diffusion-pretrained language models for enhanced re...

arXiv - Machine Learning · 3 min ·
Previous Page 127 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime