Natural Language Processing

Text understanding and language tasks

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Nlp

Automate IOS devices through XCUITest with droidrun.

Automate iOS apps with XCUITest and Droidrun using just natural language. You send the command to Droidrun, and the agent starts the task...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Machine Learning

[P] Trained a small BERT on 276K Kubernetes YAMLs using tree positional encoding instead of sequential

I trained a BERT-style transformer on 276K Kubernetes YAML files, replacing standard positional encoding with learned tree coordinates (d...

Reddit - Machine Learning · 1 min · about 5 hours ago

Machine Learning

I am doing a multi-model graph database in pure Rust with Cypher, SQL, Gremlin, and native GNN looking for extreme speed and performance

Hi guys, I'm a PhD student in Applied AI and I've been building an embeddable graph database engine from scratch in Rust. I'd love feedba...

Reddit - Artificial Intelligence · 1 min · about 7 hours ago

All Content

Machine Learning

[2602.21701] Learning Complex Physical Regimes via Coverage-oriented Uncertainty Quantification: An application to the Critical Heat Flux

This article explores the application of coverage-oriented uncertainty quantification (UQ) in scientific machine learning, focusing on th...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.21693] TiMi: Empower Time Series Transformers with Multimodal Mixture of Experts

The paper introduces TiMi, a novel approach that enhances time series forecasting by integrating multimodal data through a Mixture of Exp...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.21597] NGDB-Zoo: Towards Efficient and Scalable Neural Graph Databases Training

The paper presents NGDB-Zoo, a framework designed to enhance the training efficiency of Neural Graph Databases (NGDBs) by decoupling logi...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.21371] Interleaved Head Attention

The paper introduces Interleaved Head Attention (IHA), a novel approach to Multi-Head Attention (MHA) that enhances reasoning capabilitie...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.10953] Search or Accelerate: Confidence-Switched Position Beam Search for Diffusion Language Models

The paper presents SOAR, a novel decoding algorithm for Diffusion Language Models that adapts its search strategy based on model confiden...

arXiv - AI · 3 min · about 1 month ago

Nlp

[2602.02007] Beyond RAG for Agent Memory: Retrieval by Decoupling and Aggregation

The paper introduces xMemory, a novel approach to agent memory systems that enhances retrieval by decoupling and aggregating semantic com...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.00462] LatentLens: Revealing Highly Interpretable Visual Tokens in LLMs

The paper introduces LatentLens, a method for mapping visual tokens to natural language descriptions in Vision-Language Models (VLMs), en...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.00012] OGD4All: A Framework for Accessible Interaction with Geospatial Open Government Data Based on Large Language Models

The OGD4All framework enhances citizen interaction with geospatial Open Government Data using Large Language Models, achieving high accur...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2601.19922] HEART: A Unified Benchmark for Assessing Humans and LLMs in Emotional Support Dialogue

The paper introduces HEART, a benchmark for evaluating emotional support dialogue in humans and LLMs, focusing on empathy and communicati...

arXiv - AI · 4 min · about 1 month ago

Ai Agents

[2601.15715] RebuttalAgent: Strategic Persuasion in Academic Rebuttal via Theory of Mind

The paper presents RebuttalAgent, a framework using Theory of Mind for strategic persuasion in academic rebuttals, addressing the complex...

arXiv - AI · 4 min · about 1 month ago

Nlp

[2512.08639] Aerial Vision-Language Navigation with a Unified Framework for Spatial, Temporal and Embodied Reasoning

This article presents a unified framework for Aerial Vision-Language Navigation (VLN), enabling UAVs to interpret natural language and na...

arXiv - AI · 4 min · about 1 month ago

Llms

[2511.06899] RPTS: Tree-Structured Reasoning Process Scoring for Faithful Multimodal Evaluation

The paper presents the Reasoning Process Tree Score (RPTS), a novel metric for evaluating reasoning in Large Vision-Language Models (LVLM...

arXiv - AI · 4 min · about 1 month ago

Llms

[2511.12033] EARL: Entropy-Aware RL Alignment of LLMs for Reliable RTL Code Generation

The paper presents EARL, an Entropy-Aware Reinforcement Learning framework designed to enhance the reliability of RTL code generation by ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2511.00062] World Simulation with Video Foundation Models for Physical AI

The paper presents Cosmos-Predict2.5, an advanced model for world simulation in Physical AI, integrating various generation methods and i...

arXiv - Machine Learning · 5 min · about 1 month ago

Llms

[2510.05077] Slm-mux: Orchestrating small language models for reasoning

The paper presents SLM-MUX, a novel architecture for orchestrating small language models (SLMs) to improve reasoning accuracy, achieving ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.00024] EpidemIQs: Prompt-to-Paper LLM Agents for Epidemic Modeling and Analysis

The paper presents EpidemIQs, a multi-agent framework utilizing large language models for efficient epidemic modeling, demonstrating impr...

arXiv - AI · 4 min · about 1 month ago

Llms

[2509.25184] Incentive-Aligned Multi-Source LLM Summaries

The paper presents an innovative framework called Truthful Text Summarization (TTS) aimed at enhancing the factual accuracy of multi-sour...

arXiv - AI · 3 min · about 1 month ago

Llms

[2509.23744] Compose and Fuse: Revisiting the Foundational Bottlenecks in Multimodal Reasoning

This article explores the foundational bottlenecks in multimodal reasoning, highlighting how additional modalities can enhance or hinder ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2509.18880] Diversity Boosts AI-Generated Text Detection

The paper presents DivEye, a novel framework for detecting AI-generated text by analyzing unpredictability in text structure and vocabula...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2509.14537] ClearFairy: Capturing Creative Workflows through Decision Structuring, In-Situ Questioning, and Rationale Inference

The paper introduces ClearFairy, an AI assistant designed to enhance decision-making in creative workflows by structuring reasoning and i...

arXiv - AI · 3 min · about 1 month ago

Previous Page 74 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Natural Language Processing

Top This Week

Automate IOS devices through XCUITest with droidrun.

[P] Trained a small BERT on 276K Kubernetes YAMLs using tree positional encoding instead of sequential

I am doing a multi-model graph database in pure Rust with Cypher, SQL, Gremlin, and native GNN looking for extreme speed and performance

All Content

[2602.21701] Learning Complex Physical Regimes via Coverage-oriented Uncertainty Quantification: An application to the Critical Heat Flux

[2602.21693] TiMi: Empower Time Series Transformers with Multimodal Mixture of Experts

[2602.21597] NGDB-Zoo: Towards Efficient and Scalable Neural Graph Databases Training

[2602.21371] Interleaved Head Attention

[2602.10953] Search or Accelerate: Confidence-Switched Position Beam Search for Diffusion Language Models

[2602.02007] Beyond RAG for Agent Memory: Retrieval by Decoupling and Aggregation

[2602.00462] LatentLens: Revealing Highly Interpretable Visual Tokens in LLMs

[2602.00012] OGD4All: A Framework for Accessible Interaction with Geospatial Open Government Data Based on Large Language Models

[2601.19922] HEART: A Unified Benchmark for Assessing Humans and LLMs in Emotional Support Dialogue

[2601.15715] RebuttalAgent: Strategic Persuasion in Academic Rebuttal via Theory of Mind

[2512.08639] Aerial Vision-Language Navigation with a Unified Framework for Spatial, Temporal and Embodied Reasoning

[2511.06899] RPTS: Tree-Structured Reasoning Process Scoring for Faithful Multimodal Evaluation

[2511.12033] EARL: Entropy-Aware RL Alignment of LLMs for Reliable RTL Code Generation

[2511.00062] World Simulation with Video Foundation Models for Physical AI

[2510.05077] Slm-mux: Orchestrating small language models for reasoning

[2510.00024] EpidemIQs: Prompt-to-Paper LLM Agents for Epidemic Modeling and Analysis

[2509.25184] Incentive-Aligned Multi-Source LLM Summaries

[2509.23744] Compose and Fuse: Revisiting the Foundational Bottlenecks in Multimodal Reasoning

[2509.18880] Diversity Boosts AI-Generated Text Detection

[2509.14537] ClearFairy: Capturing Creative Workflows through Decision Structuring, In-Situ Questioning, and Rationale Inference

Related Topics

Stay updated with AI News