Natural Language Processing

Text understanding and language tasks

Top This Week

Nlp

McKinsey's AI Lie Explains What's Happening to Work

Everyone thinks McKinsey just built 25,000 AI experts. They didn't. They took a 35-year-old internal database, put a natural language int...

Reddit - Artificial Intelligence · 1 min ·
Generative Ai

Midjourney has a new offer on the cancel page there is 20 off for 2 months

submitted by /u/RainDragonfly826 [link] [comments]

Reddit - Artificial Intelligence · 1 min ·
Walmart CEO reportedly brags that company's in-app AI agent is making people spend 35% more money
Nlp

Walmart CEO reportedly brags that company's in-app AI agent is making people spend 35% more money

AI Tools & Products · 4 min ·

All Content

[2602.15238] Closing the Distribution Gap in Adversarial Training for LLMs
Llms

[2602.15238] Closing the Distribution Gap in Adversarial Training for LLMs

This article discusses a novel approach to adversarial training for large language models (LLMs), proposing Distributional Adversarial Tr...

arXiv - Machine Learning · 3 min ·
[2602.15236] BindCLIP: A Unified Contrastive-Generative Representation Learning Framework for Virtual Screening
Machine Learning

[2602.15236] BindCLIP: A Unified Contrastive-Generative Representation Learning Framework for Virtual Screening

BindCLIP introduces a novel framework for virtual screening, enhancing ligand identification through a unified contrastive-generative lea...

arXiv - Machine Learning · 4 min ·
[2602.15222] Automatically Finding Reward Model Biases
Llms

[2602.15222] Automatically Finding Reward Model Biases

This article presents a novel approach to identifying biases in reward models used in large language models (LLMs), highlighting the pote...

arXiv - AI · 3 min ·
[2602.15210] ÜberWeb: Insights from Multilingual Curation for a 20-Trillion-Token Dataset
Llms

[2602.15210] ÜberWeb: Insights from Multilingual Curation for a 20-Trillion-Token Dataset

The paper discusses multilingual data curation strategies for training foundation models, revealing that targeted improvements in data qu...

arXiv - Machine Learning · 4 min ·
[2602.15200] COMPOT: Calibration-Optimized Matrix Procrustes Orthogonalization for Transformers Compression
Machine Learning

[2602.15200] COMPOT: Calibration-Optimized Matrix Procrustes Orthogonalization for Transformers Compression

The paper presents COMPOT, a novel framework for compressing Transformer models using Calibration-Optimized Matrix Procrustes Orthogonali...

arXiv - Machine Learning · 3 min ·
[2602.15183] Seeing to Generalize: How Visual Data Corrects Binding Shortcuts
Llms

[2602.15183] Seeing to Generalize: How Visual Data Corrects Binding Shortcuts

This article explores how Vision Language Models (VLMs) enhance performance on text-only tasks by correcting binding shortcuts through vi...

arXiv - Machine Learning · 4 min ·
[2602.15155] Refine Now, Query Fast: A Decoupled Refinement Paradigm for Implicit Neural Fields
Machine Learning

[2602.15155] Refine Now, Query Fast: A Decoupled Refinement Paradigm for Implicit Neural Fields

The paper presents a Decoupled Representation Refinement (DRR) paradigm for Implicit Neural Representations (INRs), enhancing speed and f...

arXiv - Machine Learning · 4 min ·
[2602.15089] Hybrid Feature Learning with Time Series Embeddings for Equipment Anomaly Prediction
Machine Learning

[2602.15089] Hybrid Feature Learning with Time Series Embeddings for Equipment Anomaly Prediction

This study presents a hybrid approach for equipment anomaly prediction by combining time series embeddings with statistical features, ach...

arXiv - Machine Learning · 4 min ·
Llms

[D] Semantic Compression Vectors in LLMs: A Field Study on Topic Persistence in 5.1 vs 4o Models

This article explores the effectiveness of Semantic Compression Vectors (SCVs) in large language models (LLMs), comparing the 5.1 and 4o ...

Reddit - Machine Learning · 1 min ·
Machine Learning

[D] We tested the same INT8 model on 5 Snapdragon chipsets. Accuracy ranged from 93% to 71%. Same weights, same ONNX file.

This article presents findings from testing an INT8 model across five Snapdragon chipsets, revealing significant variations in accuracy, ...

Reddit - Machine Learning · 1 min ·
Llms

Live demo: This is what AI shopping actually looks like when stores serve structured data via UCP

This article discusses a live demo showcasing AI shopping experiences using structured data via the Universal Commerce Protocol (UCP), hi...

Reddit - Artificial Intelligence · 1 min ·
NVIDIA Nemotron 2 Nano 9B Japanese: 日本のソブリンAIを支える最先端小規模言語モデル
Open Source Ai

NVIDIA Nemotron 2 Nano 9B Japanese: 日本のソブリンAIを支える最先端小規模言語モデル

NVIDIA has launched the Nemotron-Nano-9B-v2-Japanese, a lightweight language model designed to enhance Japanese language understanding an...

Hugging Face Blog · 2 min ·
Cohere launches a family of open multilingual models | TechCrunch
Machine Learning

Cohere launches a family of open multilingual models | TechCrunch

Cohere has launched Tiny Aya, a family of open multilingual models that support over 70 languages and can run on everyday devices, enhanc...

TechCrunch - AI · 5 min ·
[2602.10058] Evaluating Disentangled Representations for Controllable Music Generation
Machine Learning

[2602.10058] Evaluating Disentangled Representations for Controllable Music Generation

This article evaluates disentangled representations in music generation, focusing on their effectiveness for controllable synthesis and i...

arXiv - Machine Learning · 3 min ·
[2602.10551] C^2ROPE: Causal Continuous Rotary Positional Encoding for 3D Large Multimodal-Models Reasoning
Llms

[2602.10551] C^2ROPE: Causal Continuous Rotary Positional Encoding for 3D Large Multimodal-Models Reasoning

The paper presents C^2ROPE, an advanced positional encoding method for 3D Large Multimodal Models, addressing limitations of existing Rot...

arXiv - AI · 4 min ·
[2602.10139] Anonymization-Enhanced Privacy Protection for Mobile GUI Agents: Available but Invisible
Llms

[2602.10139] Anonymization-Enhanced Privacy Protection for Mobile GUI Agents: Available but Invisible

The paper presents a novel framework for enhancing privacy protection in mobile GUI agents by anonymizing sensitive data while maintainin...

arXiv - AI · 4 min ·
[2602.08274] Language Modeling and Understanding Through Paraphrase Generation and Detection
Llms

[2602.08274] Language Modeling and Understanding Through Paraphrase Generation and Detection

The paper explores the role of paraphrase generation and detection in language modeling, emphasizing the need for fine-grained semantic u...

arXiv - AI · 4 min ·
[2602.07672] Debugging code world models
Llms

[2602.07672] Debugging code world models

The paper explores Code World Models (CWMs), which simulate program execution and identify error sources, focusing on local semantic exec...

arXiv - AI · 4 min ·
[2601.20336] Do Whitepaper Claims Predict Market Behavior? Evidence from Cryptocurrency Factor Analysis
Nlp

[2601.20336] Do Whitepaper Claims Predict Market Behavior? Evidence from Cryptocurrency Factor Analysis

This study examines the relationship between cryptocurrency whitepaper claims and actual market behavior, revealing weak predictive power...

arXiv - Machine Learning · 3 min ·
[2601.15518] DS@GT at TREC TOT 2025: Bridging Vague Recollection with Fusion Retrieval and Learned Reranking
Llms

[2601.15518] DS@GT at TREC TOT 2025: Bridging Vague Recollection with Fusion Retrieval and Learned Reranking

This paper presents a two-stage retrieval system designed for the TREC Tip-of-the-Tongue task, integrating multiple retrieval methods wit...

arXiv - Machine Learning · 3 min ·
Previous Page 114 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime