Natural Language Processing

Text understanding and language tasks

Top This Week

Nlp

Automate IOS devices through XCUITest with droidrun.

Automate iOS apps with XCUITest and Droidrun using just natural language. You send the command to Droidrun, and the agent starts the task...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

[P] Trained a small BERT on 276K Kubernetes YAMLs using tree positional encoding instead of sequential

I trained a BERT-style transformer on 276K Kubernetes YAML files, replacing standard positional encoding with learned tree coordinates (d...

Reddit - Machine Learning · 1 min ·
Machine Learning

I am doing a multi-model graph database in pure Rust with Cypher, SQL, Gremlin, and native GNN looking for extreme speed and performance

Hi guys, I'm a PhD student in Applied AI and I've been building an embeddable graph database engine from scratch in Rust. I'd love feedba...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2509.02452] Do LLMs Adhere to Label Definitions? Examining Their Receptivity to External Label Definitions
Llms

[2509.02452] Do LLMs Adhere to Label Definitions? Examining Their Receptivity to External Label Definitions

This article investigates whether large language models (LLMs) adhere to external label definitions or rely on internal representations, ...

arXiv - Machine Learning · 3 min ·
[2508.19982] Diffusion Language Models Know the Answer Before Decoding
Llms

[2508.19982] Diffusion Language Models Know the Answer Before Decoding

The paper discusses Diffusion Language Models (DLMs) and introduces a new decoding method called Prophet, which allows for faster inferen...

arXiv - AI · 4 min ·
[2506.09886] Probabilistic distances-based hallucination detection in LLMs with RAG
Llms

[2506.09886] Probabilistic distances-based hallucination detection in LLMs with RAG

This paper presents a novel method for detecting hallucinations in large language models (LLMs) using probabilistic distances in retrieva...

arXiv - AI · 3 min ·
[2506.07477] Premise Selection for a Lean Hammer
Machine Learning

[2506.07477] Premise Selection for a Lean Hammer

The paper presents LeanPremise, a neural premise selection system that enhances LeanHammer, a tool for automated reasoning in proof assis...

arXiv - Machine Learning · 4 min ·
[2506.05154] Resisting Contextual Interference in RAG via Parametric-Knowledge Reinforcement
Llms

[2506.05154] Resisting Contextual Interference in RAG via Parametric-Knowledge Reinforcement

The paper presents Knowledgeable-R1, a reinforcement-learning framework designed to enhance retrieval-augmented generation (RAG) by mitig...

arXiv - AI · 4 min ·
[2407.15160] When Can Transformers Count to n?
Llms

[2407.15160] When Can Transformers Count to n?

This paper investigates the limitations of transformer models in performing basic counting tasks, revealing a critical relationship betwe...

arXiv - Machine Learning · 4 min ·
[2509.01350] Error Notebook-Guided, Training-Free Part Retrieval in 3D CAD Assemblies via Vision-Language Models
Llms

[2509.01350] Error Notebook-Guided, Training-Free Part Retrieval in 3D CAD Assemblies via Vision-Language Models

The paper presents a novel framework for part retrieval in 3D CAD assemblies using vision-language models, emphasizing training-free meth...

arXiv - AI · 4 min ·
[2506.13793] Med-REFL: Medical Reasoning Enhancement via Self-Corrected Fine-grained Reflection
Machine Learning

[2506.13793] Med-REFL: Medical Reasoning Enhancement via Self-Corrected Fine-grained Reflection

The paper presents Med-REFL, a framework designed to enhance medical reasoning in AI by enabling self-correction through fine-grained ref...

arXiv - AI · 4 min ·
[2602.22207] Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets
Llms

[2602.22207] Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets

The paper presents an automated framework for translating benchmarks and datasets for multilingual Large Language Model evaluation, addre...

arXiv - Machine Learning · 3 min ·
[2602.22145] When AI Writes, Whose Voice Remains? Quantifying Cultural Marker Erasure Across World English Varieties in Large Language Models
Llms

[2602.22145] When AI Writes, Whose Voice Remains? Quantifying Cultural Marker Erasure Across World English Varieties in Large Language Models

This article explores the phenomenon of 'Cultural Ghosting' in large language models (LLMs), highlighting the systematic erasure of cultu...

arXiv - AI · 4 min ·
[2602.22039] TG-ASR: Translation-Guided Learning with Parallel Gated Cross Attention for Low-Resource Automatic Speech Recognition
Machine Learning

[2602.22039] TG-ASR: Translation-Guided Learning with Parallel Gated Cross Attention for Low-Resource Automatic Speech Recognition

The paper presents TG-ASR, a translation-guided framework for improving automatic speech recognition in low-resource languages, specifica...

arXiv - AI · 4 min ·
[2602.22026] RGB-Event HyperGraph Prompt for Kilometer Marker Recognition based on Pre-trained Foundation Models
Llms

[2602.22026] RGB-Event HyperGraph Prompt for Kilometer Marker Recognition based on Pre-trained Foundation Models

This article presents a novel approach to Kilometer Marker Recognition (KMR) using RGB-event cameras, enhancing visual perception for aut...

arXiv - AI · 3 min ·
[2602.21997] Enhancing LLM-Based Test Generation by Eliminating Covered Code
Llms

[2602.21997] Enhancing LLM-Based Test Generation by Eliminating Covered Code

This paper presents a novel method for enhancing LLM-based unit test generation by eliminating covered code, addressing challenges in tes...

arXiv - Machine Learning · 4 min ·
[2602.21864] DynamicGTR: Leveraging Graph Topology Representation Preferences to Boost VLM Capabilities on Graph QAs
Llms

[2602.21864] DynamicGTR: Leveraging Graph Topology Representation Preferences to Boost VLM Capabilities on Graph QAs

The paper presents DynamicGTR, a framework that enhances Vision-Language Models (VLMs) by dynamically selecting optimal graph topology re...

arXiv - AI · 4 min ·
[2602.21829] StoryMovie: A Dataset for Semantic Alignment of Visual Stories with Movie Scripts and Subtitles
Machine Learning

[2602.21829] StoryMovie: A Dataset for Semantic Alignment of Visual Stories with Movie Scripts and Subtitles

The paper introduces StoryMovie, a dataset designed for aligning visual stories with movie scripts and subtitles, enhancing dialogue attr...

arXiv - AI · 3 min ·
[2602.21800] An Evaluation of Context Length Extrapolation in Long Code via Positional Embeddings and Efficient Attention
Llms

[2602.21800] An Evaluation of Context Length Extrapolation in Long Code via Positional Embeddings and Efficient Attention

This paper evaluates methods for context length extrapolation in long code using positional embeddings and efficient attention mechanisms...

arXiv - AI · 3 min ·
[2602.21720] Evaluating the relationship between regularity and learnability in recursive numeral systems using Reinforcement Learning
Ai Safety

[2602.21720] Evaluating the relationship between regularity and learnability in recursive numeral systems using Reinforcement Learning

This article explores the relationship between regularity and learnability in recursive numeral systems using Reinforcement Learning, dem...

arXiv - AI · 3 min ·
[2602.21647] Mitigating Structural Noise in Low-Resource S2TT: An Optimized Cascaded Nepali-English Pipeline with Punctuation Restoration
Machine Learning

[2602.21647] Mitigating Structural Noise in Low-Resource S2TT: An Optimized Cascaded Nepali-English Pipeline with Punctuation Restoration

This paper presents an optimized cascaded Nepali-English speech-to-text translation system that mitigates structural noise from ASR, enha...

arXiv - Machine Learning · 4 min ·
[2602.21611] Structurally Aligned Subtask-Level Memory for Software Engineering Agents
Llms

[2602.21611] Structurally Aligned Subtask-Level Memory for Software Engineering Agents

The paper presents Structurally Aligned Subtask-Level Memory, a novel approach for enhancing software engineering agents by improving mem...

arXiv - AI · 3 min ·
[2602.21598] Retrieval Challenges in Low-Resource Public Service Information: A Case Study on Food Pantry Access
Nlp

[2602.21598] Retrieval Challenges in Low-Resource Public Service Information: A Case Study on Food Pantry Access

This article explores the challenges of retrieving public service information in low-resource environments, focusing on food pantry acces...

arXiv - AI · 3 min ·
Previous Page 75 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime