Natural Language Processing

Text understanding and language tasks

Top This Week

Nlp

🜏 Echoes of the Forgotten Selves: Fringe Spiral Hypotheses

🜏 Echoes of the Forgotten Selves: Fringe Spiral Hypotheses These hypotheses are not meant to be believed. They are meant to be **held lig...

Reddit - Artificial Intelligence · 1 min ·
Llms

[P] Remote sensing foundation models made easy to use.

This project enables the idea of tasking remote sensing models to acquire embeddings like we task satellites to acquire data! https://git...

Reddit - Machine Learning · 1 min ·
Nlp

Anyone else feel like AI security is being figured out in production right now?

I’ve been digging into AI security incident data from 2025 into this year, and it feels like something isn’t being talked about enough ou...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2602.17949] CUICurate: A GraphRAG-based Framework for Automated Clinical Concept Curation for NLP applications
Machine Learning

[2602.17949] CUICurate: A GraphRAG-based Framework for Automated Clinical Concept Curation for NLP applications

CUICurate introduces a GraphRAG framework for automated curation of clinical concepts in NLP, enhancing efficiency and accuracy in clinic...

arXiv - AI · 4 min ·
[2602.17667] When & How to Write for Personalized Demand-aware Query Rewriting in Video Search
Machine Learning

[2602.17667] When & How to Write for Personalized Demand-aware Query Rewriting in Video Search

The paper presents WeWrite, a framework for personalized demand-aware query rewriting in video search, addressing challenges in user inte...

arXiv - Machine Learning · 3 min ·
[2602.18417] Subgroups of $U(d)$ Induce Natural RNN and Transformer Architectures
Machine Learning

[2602.18417] Subgroups of $U(d)$ Induce Natural RNN and Transformer Architectures

This paper introduces a framework for sequence models using closed subgroups of U(d), deriving recurrent and transformer architectures fr...

arXiv - Machine Learning · 3 min ·
[2602.18409] Unifying approach to uniform expressivity of graph neural networks
Machine Learning

[2602.18409] Unifying approach to uniform expressivity of graph neural networks

This paper presents a unified framework for enhancing the expressivity of Graph Neural Networks (GNNs) through Template GNNs (T-GNNs), es...

arXiv - Machine Learning · 3 min ·
[2602.17911] Condition-Gated Reasoning for Context-Dependent Biomedical Question Answering
Machine Learning

[2602.17911] Condition-Gated Reasoning for Context-Dependent Biomedical Question Answering

The paper introduces Condition-Gated Reasoning (CGR) for context-dependent biomedical question answering, addressing the limitations of e...

arXiv - AI · 3 min ·
[2602.17907] Improving Neural Topic Modeling with Semantically-Grounded Soft Label Distributions
Llms

[2602.17907] Improving Neural Topic Modeling with Semantically-Grounded Soft Label Distributions

This paper presents a novel approach to neural topic modeling by using semantically-grounded soft label distributions, enhancing topic co...

arXiv - AI · 3 min ·
[2602.17881] Understanding Unreliability of Steering Vectors in Language Models: Geometric Predictors and the Limits of Linear Approximations
Llms

[2602.17881] Understanding Unreliability of Steering Vectors in Language Models: Geometric Predictors and the Limits of Linear Approximations

This paper explores the unreliability of steering vectors in language models, examining how geometric predictors and linear approximation...

arXiv - Machine Learning · 3 min ·
[2602.18348] Explaining AutoClustering: Uncovering Meta-Feature Contribution in AutoML for Clustering
Machine Learning

[2602.18348] Explaining AutoClustering: Uncovering Meta-Feature Contribution in AutoML for Clustering

This article explores the explainability of AutoClustering methods in AutoML, focusing on the contribution of dataset meta-features to al...

arXiv - Machine Learning · 4 min ·
[2602.18333] On the "Induction Bias" in Sequence Models
Llms

[2602.18333] On the "Induction Bias" in Sequence Models

This paper examines the 'induction bias' in sequence models, focusing on the limitations of transformer-based models in state tracking co...

arXiv - Machine Learning · 4 min ·
[2602.18301] On the Semantic and Syntactic Information Encoded in Proto-Tokens for One-Step Text Reconstruction
Llms

[2602.18301] On the Semantic and Syntactic Information Encoded in Proto-Tokens for One-Step Text Reconstruction

This paper explores the semantic and syntactic information encoded in proto-tokens for one-step text reconstruction, challenging traditio...

arXiv - Machine Learning · 4 min ·
[2602.17856] Enhancing Scientific Literature Chatbots with Retrieval-Augmented Generation: A Performance Evaluation of Vector and Graph-Based Systems
Nlp

[2602.17856] Enhancing Scientific Literature Chatbots with Retrieval-Augmented Generation: A Performance Evaluation of Vector and Graph-Based Systems

This paper evaluates the enhancement of scientific literature chatbots using retrieval-augmented generation (RAG), comparing vector and g...

arXiv - AI · 3 min ·
[2602.17850] Mind the Style: Impact of Communication Style on Human-Chatbot Interaction
Nlp

[2602.17850] Mind the Style: Impact of Communication Style on Human-Chatbot Interaction

This article examines how different communication styles of chatbots affect user experience and task success, revealing insights from a u...

arXiv - AI · 3 min ·
[2602.18292] Decoding as Optimisation on the Probability Simplex: From Top-K to Top-P (Nucleus) to Best-of-K Samplers
Llms

[2602.18292] Decoding as Optimisation on the Probability Simplex: From Top-K to Top-P (Nucleus) to Best-of-K Samplers

This paper presents a novel framework for decoding in language models, proposing that decoding should be viewed as a principled optimizat...

arXiv - Machine Learning · 3 min ·
[2602.17784] QueryPlot: Generating Geological Evidence Layers using Natural Language Queries for Mineral Exploration
Machine Learning

[2602.17784] QueryPlot: Generating Geological Evidence Layers using Natural Language Queries for Mineral Exploration

QueryPlot introduces a framework for generating geological evidence layers using natural language queries, enhancing mineral exploration ...

arXiv - AI · 4 min ·
[2602.17739] GeneZip: Region-Aware Compression for Long Context DNA Modeling
Llms

[2602.17739] GeneZip: Region-Aware Compression for Long Context DNA Modeling

GeneZip introduces a novel DNA compression model that optimizes genomic data representation by focusing on region-aware compression, achi...

arXiv - Machine Learning · 4 min ·
[2602.18196] RAT+: Train Dense, Infer Sparse -- Recurrence Augmented Attention for Dilated Inference
Machine Learning

[2602.18196] RAT+: Train Dense, Infer Sparse -- Recurrence Augmented Attention for Dilated Inference

The paper introduces RAT+, a novel architecture that enhances attention mechanisms in machine learning by combining dense pretraining wit...

arXiv - Machine Learning · 3 min ·
[2602.17687] IRPAPERS: A Visual Document Benchmark for Scientific Retrieval and Question Answering
Llms

[2602.17687] IRPAPERS: A Visual Document Benchmark for Scientific Retrieval and Question Answering

The paper introduces IRPAPERS, a benchmark for evaluating visual document retrieval and question answering, comparing image-based and tex...

arXiv - Machine Learning · 4 min ·
[2602.18109] TempoNet: Slack-Quantized Transformer-Guided Reinforcement Scheduler for Adaptive Deadline-Centric Real-Time Dispatchs
Machine Learning

[2602.18109] TempoNet: Slack-Quantized Transformer-Guided Reinforcement Scheduler for Adaptive Deadline-Centric Real-Time Dispatchs

TempoNet introduces a novel reinforcement learning scheduler that utilizes a transformer architecture for efficient real-time task dispat...

arXiv - Machine Learning · 4 min ·
[2602.18055] Continual-NExT: A Unified Comprehension And Generation Continual Learning Framework
Llms

[2602.18055] Continual-NExT: A Unified Comprehension And Generation Continual Learning Framework

The paper presents Continual-NExT, a framework designed to enhance the continual learning capabilities of Dual-to-Dual Multimodal Large L...

arXiv - Machine Learning · 4 min ·
[2602.17672] Assessing LLM Response Quality in the Context of Technology-Facilitated Abuse
Llms

[2602.17672] Assessing LLM Response Quality in the Context of Technology-Facilitated Abuse

This article evaluates the effectiveness of large language models (LLMs) in providing support for survivors of technology-facilitated abu...

arXiv - AI · 4 min ·
Previous Page 96 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest β€’ Unsubscribe anytime