Natural Language Processing

Text understanding and language tasks

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Nlp

McKinsey's AI Lie Explains What's Happening to Work

Everyone thinks McKinsey just built 25,000 AI experts. They didn't. They took a 35-year-old internal database, put a natural language int...

Reddit - Artificial Intelligence · 1 min · about 7 hours ago

Generative Ai

Midjourney has a new offer on the cancel page there is 20 off for 2 months

submitted by /u/RainDragonfly826 [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 16 hours ago

Nlp

Walmart CEO reportedly brags that company's in-app AI agent is making people spend 35% more money

AI Tools & Products · 4 min · about 21 hours ago

All Content

Llms

[2602.15238] Closing the Distribution Gap in Adversarial Training for LLMs

This article discusses a novel approach to adversarial training for large language models (LLMs), proposing Distributional Adversarial Tr...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.15236] BindCLIP: A Unified Contrastive-Generative Representation Learning Framework for Virtual Screening

BindCLIP introduces a novel framework for virtual screening, enhancing ligand identification through a unified contrastive-generative lea...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.15222] Automatically Finding Reward Model Biases

This article presents a novel approach to identifying biases in reward models used in large language models (LLMs), highlighting the pote...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.15210] ÜberWeb: Insights from Multilingual Curation for a 20-Trillion-Token Dataset

The paper discusses multilingual data curation strategies for training foundation models, revealing that targeted improvements in data qu...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.15200] COMPOT: Calibration-Optimized Matrix Procrustes Orthogonalization for Transformers Compression

The paper presents COMPOT, a novel framework for compressing Transformer models using Calibration-Optimized Matrix Procrustes Orthogonali...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.15183] Seeing to Generalize: How Visual Data Corrects Binding Shortcuts

This article explores how Vision Language Models (VLMs) enhance performance on text-only tasks by correcting binding shortcuts through vi...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.15155] Refine Now, Query Fast: A Decoupled Refinement Paradigm for Implicit Neural Fields

The paper presents a Decoupled Representation Refinement (DRR) paradigm for Implicit Neural Representations (INRs), enhancing speed and f...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.15089] Hybrid Feature Learning with Time Series Embeddings for Equipment Anomaly Prediction

This study presents a hybrid approach for equipment anomaly prediction by combining time series embeddings with statistical features, ach...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[D] Semantic Compression Vectors in LLMs: A Field Study on Topic Persistence in 5.1 vs 4o Models

This article explores the effectiveness of Semantic Compression Vectors (SCVs) in large language models (LLMs), comparing the 5.1 and 4o ...

Reddit - Machine Learning · 1 min · about 2 months ago

Machine Learning

[D] We tested the same INT8 model on 5 Snapdragon chipsets. Accuracy ranged from 93% to 71%. Same weights, same ONNX file.

This article presents findings from testing an INT8 model across five Snapdragon chipsets, revealing significant variations in accuracy, ...

Reddit - Machine Learning · 1 min · about 2 months ago

Llms

Live demo: This is what AI shopping actually looks like when stores serve structured data via UCP

This article discusses a live demo showcasing AI shopping experiences using structured data via the Universal Commerce Protocol (UCP), hi...

Reddit - Artificial Intelligence · 1 min · about 2 months ago

Open Source Ai

NVIDIA Nemotron 2 Nano 9B Japanese: 日本のソブリンAIを支える最先端小規模言語モデル

NVIDIA has launched the Nemotron-Nano-9B-v2-Japanese, a lightweight language model designed to enhance Japanese language understanding an...

Hugging Face Blog · 2 min · about 2 months ago

Machine Learning

Cohere launches a family of open multilingual models | TechCrunch

Cohere has launched Tiny Aya, a family of open multilingual models that support over 70 languages and can run on everyday devices, enhanc...

TechCrunch - AI · 5 min · about 2 months ago

Machine Learning

[2602.10058] Evaluating Disentangled Representations for Controllable Music Generation

This article evaluates disentangled representations in music generation, focusing on their effectiveness for controllable synthesis and i...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.10551] C^2ROPE: Causal Continuous Rotary Positional Encoding for 3D Large Multimodal-Models Reasoning

The paper presents C^2ROPE, an advanced positional encoding method for 3D Large Multimodal Models, addressing limitations of existing Rot...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.10139] Anonymization-Enhanced Privacy Protection for Mobile GUI Agents: Available but Invisible

The paper presents a novel framework for enhancing privacy protection in mobile GUI agents by anonymizing sensitive data while maintainin...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.08274] Language Modeling and Understanding Through Paraphrase Generation and Detection

The paper explores the role of paraphrase generation and detection in language modeling, emphasizing the need for fine-grained semantic u...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.07672] Debugging code world models

The paper explores Code World Models (CWMs), which simulate program execution and identify error sources, focusing on local semantic exec...

arXiv - AI · 4 min · about 2 months ago

Nlp

[2601.20336] Do Whitepaper Claims Predict Market Behavior? Evidence from Cryptocurrency Factor Analysis

This study examines the relationship between cryptocurrency whitepaper claims and actual market behavior, revealing weak predictive power...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2601.15518] DS@GT at TREC TOT 2025: Bridging Vague Recollection with Fusion Retrieval and Learned Reranking

This paper presents a two-stage retrieval system designed for the TREC Tip-of-the-Tongue task, integrating multiple retrieval methods wit...

arXiv - Machine Learning · 3 min · about 2 months ago

Previous Page 114 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Natural Language Processing

Top This Week

McKinsey's AI Lie Explains What's Happening to Work

Midjourney has a new offer on the cancel page there is 20 off for 2 months

Walmart CEO reportedly brags that company's in-app AI agent is making people spend 35% more money

All Content

[2602.15238] Closing the Distribution Gap in Adversarial Training for LLMs

[2602.15236] BindCLIP: A Unified Contrastive-Generative Representation Learning Framework for Virtual Screening

[2602.15222] Automatically Finding Reward Model Biases

[2602.15210] ÜberWeb: Insights from Multilingual Curation for a 20-Trillion-Token Dataset

[2602.15200] COMPOT: Calibration-Optimized Matrix Procrustes Orthogonalization for Transformers Compression

[2602.15183] Seeing to Generalize: How Visual Data Corrects Binding Shortcuts

[2602.15155] Refine Now, Query Fast: A Decoupled Refinement Paradigm for Implicit Neural Fields

[2602.15089] Hybrid Feature Learning with Time Series Embeddings for Equipment Anomaly Prediction

[D] Semantic Compression Vectors in LLMs: A Field Study on Topic Persistence in 5.1 vs 4o Models

[D] We tested the same INT8 model on 5 Snapdragon chipsets. Accuracy ranged from 93% to 71%. Same weights, same ONNX file.

Live demo: This is what AI shopping actually looks like when stores serve structured data via UCP

NVIDIA Nemotron 2 Nano 9B Japanese: 日本のソブリンAIを支える最先端小規模言語モデル

Cohere launches a family of open multilingual models | TechCrunch

[2602.10058] Evaluating Disentangled Representations for Controllable Music Generation

[2602.10551] C^2ROPE: Causal Continuous Rotary Positional Encoding for 3D Large Multimodal-Models Reasoning

[2602.10139] Anonymization-Enhanced Privacy Protection for Mobile GUI Agents: Available but Invisible

[2602.08274] Language Modeling and Understanding Through Paraphrase Generation and Detection

[2602.07672] Debugging code world models

[2601.20336] Do Whitepaper Claims Predict Market Behavior? Evidence from Cryptocurrency Factor Analysis

[2601.15518] DS@GT at TREC TOT 2025: Bridging Vague Recollection with Fusion Retrieval and Learned Reranking

Related Topics

Stay updated with AI News