Natural Language Processing

Text understanding and language tasks

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Nlp

McKinsey's AI Lie Explains What's Happening to Work

Everyone thinks McKinsey just built 25,000 AI experts. They didn't. They took a 35-year-old internal database, put a natural language int...

Reddit - Artificial Intelligence · 1 min · about 11 hours ago

Generative Ai

Midjourney has a new offer on the cancel page there is 20 off for 2 months

submitted by /u/RainDragonfly826 [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 20 hours ago

Nlp

Walmart CEO reportedly brags that company's in-app AI agent is making people spend 35% more money

AI Tools & Products · 4 min · 1 day ago

All Content

Llms

[2509.17196] Evolution of Concepts in Language Model Pre-Training

This article examines the evolution of concepts in language model pre-training, revealing how feature development influences performance ...

arXiv - AI · 3 min · about 2 months ago

Generative Ai

[2508.18210] Why Synthetic Isn't Real Yet: A Diagnostic Framework for Contact Center Dialogue Generation

This article presents a diagnostic framework for evaluating synthetic dialogue generation in contact centers, highlighting the limitation...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2410.17587] Predicting Company Growth using Scaling Theory informed Machine Learning

The paper presents a novel Scaling-Theory-Informed Machine Learning (STIML) framework for predicting company growth by integrating struct...

arXiv - Machine Learning · 4 min · about 2 months ago

Nlp

[2508.03882] Simulating Cyberattacks through a Breach Attack Simulation (BAS) Platform empowered by Security Chaos Engineering (SCE)

This article presents a novel approach to simulating cyberattacks by integrating Security Chaos Engineering (SCE) into Breach Attack Simu...

arXiv - AI · 3 min · about 2 months ago

Llms

[2507.16713] A Pragmatist Robot: Learning to Plan Tasks by Experiencing the Real World

The paper presents PragmaBot, a framework for robotic task planning that utilizes real-world experiences and self-reflection to enhance l...

arXiv - AI · 4 min · about 2 months ago

Nlp

[2507.14186] A Disentangled Representation Learning Framework for Low-altitude Network Coverage Prediction

This paper presents a novel framework for predicting low-altitude network coverage using disentangled representation learning, addressing...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.12247] ExtractBench: A Benchmark and Evaluation Methodology for Complex Structured Extraction

ExtractBench introduces a benchmark and evaluation framework for extracting structured data from unstructured documents like PDFs, addres...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.10603] dnaHNet: A Scalable and Hierarchical Foundation Model for Genomic Sequence Learning

The paper presents dnaHNet, a novel tokenizer-free autoregressive model designed for genomic sequence learning, achieving significant eff...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.04942] Privileged Information Distillation for Language Models

This paper presents methods for distilling privileged information in language models, focusing on improving performance in multi-turn env...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.02201] Cardinality-Preserving Attention Channels for Graph Transformers in Molecular Property Prediction

This article presents a novel graph transformer model, incorporating cardinality-preserving attention channels, to enhance molecular prop...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.00628] From Associations to Activations: Comparing Behavioral and Hidden-State Semantic Geometry in LLMs

This paper examines the relationship between behavioral and hidden-state semantic geometry in large language models (LLMs) through psycho...

arXiv - AI · 3 min · about 2 months ago

Llms

[2505.07671] Benchmarking Retrieval-Augmented Generation for Chemistry

This article presents ChemRAG-Bench, a benchmark for evaluating retrieval-augmented generation (RAG) in chemistry, demonstrating signific...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2601.09495] Parallelizable memory recurrent units

The paper introduces memory recurrent units (MRUs), a new family of RNNs that combine persistent memory with parallelizable computations,...

arXiv - Machine Learning · 4 min · about 2 months ago

Nlp

[2512.13228] ModSSC: A Modular Framework for Semi-Supervised Classification on Heterogeneous Data

ModSSC is an open-source Python framework designed for semi-supervised classification, enhancing reproducibility and experimentation acro...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2502.16730] RapidPen: Fully Automated IP-to-Shell Penetration Testing with LLM-based Agents

RapidPen is a novel automated penetration testing framework that utilizes large language models to autonomously exploit vulnerabilities, ...

arXiv - AI · 4 min · about 2 months ago

Llms

[2511.02077] Beyond Static Cutoffs: One-Shot Dynamic Thresholding for Diffusion Language Models

This article presents One-Shot Dynamic Thresholding (OSDT) for diffusion language models, enhancing decoding efficiency and accuracy by c...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2510.15987] Algorithmic Primitives and Compositional Geometry of Reasoning in Language Models

The paper explores how algorithmic primitives and compositional geometry can enhance reasoning capabilities in large language models (LLM...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2510.04008] RACE Attention: A Strictly Linear-Time Attention for Long-Sequence Training

The paper presents RACE Attention, a novel linear-time attention mechanism designed for long-sequence training, significantly improving e...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2510.03272] Where to Add PDE Diffusion in Transformers

This paper investigates the optimal placement of PDE diffusion layers in transformer architectures, revealing that their insertion order ...

arXiv - AI · 4 min · about 2 months ago

Llms

[2510.02410] OpenTSLM: Time-Series Language Models for Reasoning over Multivariate Medical Text- and Time-Series Data

OpenTSLM introduces a new family of Time Series Language Models designed to enhance reasoning over multivariate medical data, outperformi...

arXiv - Machine Learning · 4 min · about 2 months ago

Previous Page 116 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Natural Language Processing

Top This Week

McKinsey's AI Lie Explains What's Happening to Work

Midjourney has a new offer on the cancel page there is 20 off for 2 months

Walmart CEO reportedly brags that company's in-app AI agent is making people spend 35% more money

All Content

[2509.17196] Evolution of Concepts in Language Model Pre-Training

[2508.18210] Why Synthetic Isn't Real Yet: A Diagnostic Framework for Contact Center Dialogue Generation

[2410.17587] Predicting Company Growth using Scaling Theory informed Machine Learning

[2508.03882] Simulating Cyberattacks through a Breach Attack Simulation (BAS) Platform empowered by Security Chaos Engineering (SCE)

[2507.16713] A Pragmatist Robot: Learning to Plan Tasks by Experiencing the Real World

[2507.14186] A Disentangled Representation Learning Framework for Low-altitude Network Coverage Prediction

[2602.12247] ExtractBench: A Benchmark and Evaluation Methodology for Complex Structured Extraction

[2602.10603] dnaHNet: A Scalable and Hierarchical Foundation Model for Genomic Sequence Learning

[2602.04942] Privileged Information Distillation for Language Models

[2602.02201] Cardinality-Preserving Attention Channels for Graph Transformers in Molecular Property Prediction

[2602.00628] From Associations to Activations: Comparing Behavioral and Hidden-State Semantic Geometry in LLMs

[2505.07671] Benchmarking Retrieval-Augmented Generation for Chemistry

[2601.09495] Parallelizable memory recurrent units

[2512.13228] ModSSC: A Modular Framework for Semi-Supervised Classification on Heterogeneous Data

[2502.16730] RapidPen: Fully Automated IP-to-Shell Penetration Testing with LLM-based Agents

[2511.02077] Beyond Static Cutoffs: One-Shot Dynamic Thresholding for Diffusion Language Models

[2510.15987] Algorithmic Primitives and Compositional Geometry of Reasoning in Language Models

[2510.04008] RACE Attention: A Strictly Linear-Time Attention for Long-Sequence Training

[2510.03272] Where to Add PDE Diffusion in Transformers

[2510.02410] OpenTSLM: Time-Series Language Models for Reasoning over Multivariate Medical Text- and Time-Series Data

Related Topics

Stay updated with AI News