Natural Language Processing

Text understanding and language tasks

Top This Week

Machine Learning

[D] ICML 26 - What to do with the zero follow-up questions

Hello everyone. I submitted my work to ICML 26 this year, and it got somewhat above average reviews. Now, in the rebuttal acknowledgment,...

Reddit - Machine Learning · 1 min ·
Startup Battlefield 200 applications open until May 27 | TechCrunch
Nlp

Startup Battlefield 200 applications open until May 27 | TechCrunch

Nominate your startup, or one you know, and apply for a chance at VC access, TechCrunch coverage, and $100K for Startup Battlefield 200.

TechCrunch - AI · 4 min ·
[2603.24326] Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing
Llms

[2603.24326] Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing

Abstract page for arXiv paper 2603.24326: Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing

arXiv - AI · 4 min ·

All Content

[2602.13647] PT-RAG: Structure-Fidelity Retrieval-Augmented Generation for Academic Papers
Nlp

[2602.13647] PT-RAG: Structure-Fidelity Retrieval-Augmented Generation for Academic Papers

PT-RAG introduces a novel framework for retrieval-augmented generation that maintains the hierarchical structure of academic papers, impr...

arXiv - AI · 4 min ·
[2602.15029] Symmetry in language statistics shapes the geometry of model representations
Llms

[2602.15029] Symmetry in language statistics shapes the geometry of model representations

This article explores how symmetry in language statistics influences the geometric representation of models in machine learning, particul...

arXiv - Machine Learning · 4 min ·
[2602.13571] LLM-Confidence Reranker: A Training-Free Approach for Enhancing Retrieval-Augmented Generation Systems
Llms

[2602.13571] LLM-Confidence Reranker: A Training-Free Approach for Enhancing Retrieval-Augmented Generation Systems

The paper presents the LLM-Confidence Reranker, a training-free algorithm designed to enhance retrieval-augmented generation systems by l...

arXiv - AI · 4 min ·
[2602.15014] Scaling Beyond Masked Diffusion Language Models
Llms

[2602.15014] Scaling Beyond Masked Diffusion Language Models

This paper explores scaling laws in masked diffusion language models, revealing that they can be made more efficient and competitive agai...

arXiv - Machine Learning · 4 min ·
[2602.13521] Arming Data Agents with Tribal Knowledge
Llms

[2602.13521] Arming Data Agents with Tribal Knowledge

The paper introduces Tk-Boost, a framework enhancing NL2SQL agents by integrating tribal knowledge to correct misconceptions during datab...

arXiv - AI · 4 min ·
[2602.13504] From Perceptions To Evidence: Detecting AI-Generated Content In Turkish News Media With A Fine-Tuned Bert Classifier
Llms

[2602.13504] From Perceptions To Evidence: Detecting AI-Generated Content In Turkish News Media With A Fine-Tuned Bert Classifier

This study presents a fine-tuned BERT classifier for detecting AI-generated content in Turkish news media, achieving a high F1 score and ...

arXiv - AI · 4 min ·
[2602.14914] Additive Control Variates Dominate Self-Normalisation in Off-Policy Evaluation
Nlp

[2602.14914] Additive Control Variates Dominate Self-Normalisation in Off-Policy Evaluation

This paper presents a theoretical analysis demonstrating that additive control variates outperform self-normalisation techniques in off-p...

arXiv - Machine Learning · 3 min ·
[2602.14913] Coverage Guarantees for Pseudo-Calibrated Conformal Prediction under Distribution Shift
Machine Learning

[2602.14913] Coverage Guarantees for Pseudo-Calibrated Conformal Prediction under Distribution Shift

This paper explores coverage guarantees for pseudo-calibrated conformal prediction methods under distribution shifts, proposing a new alg...

arXiv - Machine Learning · 3 min ·
[2602.14889] Web-Scale Multimodal Summarization using CLIP-Based Semantic Alignment
Machine Learning

[2602.14889] Web-Scale Multimodal Summarization using CLIP-Based Semantic Alignment

The paper presents a framework for web-scale multimodal summarization that integrates text and image data using CLIP-based semantic align...

arXiv - Machine Learning · 3 min ·
[2602.13466] Language Model Memory and Memory Models for Language
Llms

[2602.13466] Language Model Memory and Memory Models for Language

The paper explores the limitations of memory in language models, proposing a new architecture that enhances memory formation through comb...

arXiv - Machine Learning · 3 min ·
[2602.14855] A Pragmatic Method for Comparing Clusterings with Overlaps and Outliers
Nlp

[2602.14855] A Pragmatic Method for Comparing Clusterings with Overlaps and Outliers

This paper presents a new method for comparing clustering results that accommodates overlaps and outliers, addressing a gap in existing e...

arXiv - Machine Learning · 3 min ·
[2602.13455] Using Machine Learning to Enhance the Detection of Obfuscated Abusive Words in Swahili: A Focus on Child Safety
Machine Learning

[2602.13455] Using Machine Learning to Enhance the Detection of Obfuscated Abusive Words in Swahili: A Focus on Child Safety

This article explores the use of machine learning to detect obfuscated abusive language in Swahili, focusing on child safety and the chal...

arXiv - AI · 4 min ·
[2602.13452] LLM-Powered Automatic Translation and Urgency in Crisis Scenarios
Llms

[2602.13452] LLM-Powered Automatic Translation and Urgency in Crisis Scenarios

This article examines the effectiveness of large language models (LLMs) in crisis communication, particularly focusing on multilingual tr...

arXiv - AI · 3 min ·
[2602.13370] G2CP: A Graph-Grounded Communication Protocol for Verifiable and Efficient Multi-Agent Reasoning
Llms

[2602.13370] G2CP: A Graph-Grounded Communication Protocol for Verifiable and Efficient Multi-Agent Reasoning

The paper presents G2CP, a novel communication protocol for multi-agent systems that enhances efficiency and verifiability by using graph...

arXiv - AI · 3 min ·
[2602.14759] Inner Loop Inference for Pretrained Transformers: Unlocking Latent Capabilities Without Training
Machine Learning

[2602.14759] Inner Loop Inference for Pretrained Transformers: Unlocking Latent Capabilities Without Training

The paper presents 'Inner Loop Inference,' a method for enhancing pretrained Transformers by iteratively refining outputs during inferenc...

arXiv - Machine Learning · 4 min ·
[2602.13362] Nonparametric Distribution Regression Re-calibration
Machine Learning

[2602.13362] Nonparametric Distribution Regression Re-calibration

The paper presents a novel nonparametric algorithm for re-calibrating predictive distributions in regression, addressing the challenge of...

arXiv - Machine Learning · 3 min ·
[2602.14728] D2-LoRA: A Synergistic Approach to Differential and Directional Low-Rank Adaptation
Machine Learning

[2602.14728] D2-LoRA: A Synergistic Approach to Differential and Directional Low-Rank Adaptation

D2-LoRA introduces a novel method for efficient fine-tuning in machine learning, achieving significant accuracy improvements while minimi...

arXiv - Machine Learning · 4 min ·
[2602.13352] Using Deep Learning to Generate Semantically Correct Hindi Captions
Machine Learning

[2602.13352] Using Deep Learning to Generate Semantically Correct Hindi Captions

This article explores the use of deep learning techniques to generate semantically accurate image captions in Hindi, utilizing advanced m...

arXiv - AI · 4 min ·
[2602.14696] A Critical Look at Targeted Instruction Selection: Disentangling What Matters (and What Doesn't)
Llms

[2602.14696] A Critical Look at Targeted Instruction Selection: Disentangling What Matters (and What Doesn't)

This paper critically examines targeted instruction selection for fine-tuning large language models, analyzing data representation and se...

arXiv - Machine Learning · 4 min ·
[2602.13346] CellMaster: Collaborative Cell Type Annotation in Single-Cell Analysis
Llms

[2602.13346] CellMaster: Collaborative Cell Type Annotation in Single-Cell Analysis

CellMaster introduces an AI-driven approach for zero-shot cell-type annotation in single-cell RNA sequencing, improving accuracy signific...

arXiv - AI · 3 min ·
Previous Page 123 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime