Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

Converting XQuery to SQL with Local LLMs: Do I Need Fine-Tuning or a Better Approach? [P]

I am trying to convert XQuery statements into SQL queries within an enterprise context, with the constraint that the solution must rely...

Reddit - Machine Learning · 1 min · 15 minutes ago

Llms

AI: Fragility of today's Claude Cowork type AI Agent Apps. RTZ 1061

...realities like memory management, highlight a longer road to resilient AI Agents and AGI

AI Tools & Products · 11 min · about 3 hours ago

Llms

Gemini caught a $280M crypto exploit before it hit the news, then retracted it as a hallucination because I couldn't verify it - because the news hadn't dropped yet

So this happened mere hours ago and I feel like I genuinely stumbled onto something worth documenting for people interested in AI behavio...

Reddit - Artificial Intelligence · 1 min · about 12 hours ago

All Content

Llms

[2603.02709] Sensory-Aware Sequential Recommendation via Review-Distilled Representations

Abstract page for arXiv paper 2603.02709: Sensory-Aware Sequential Recommendation via Review-Distilled Representations

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.02376] CUCo: An Agentic Framework for Compute and Communication Co-design

Abstract page for arXiv paper 2603.02376: CUCo: An Agentic Framework for Compute and Communication Co-design

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2603.02676] ITLC at SemEval-2026 Task 11: Normalization and Deterministic Parsing for Formal Reasoning in LLMs

Abstract page for arXiv paper 2603.02676: ITLC at SemEval-2026 Task 11: Normalization and Deterministic Parsing for Formal Reasoning in LLMs

arXiv - AI · 3 min · about 2 months ago

Llms

[2603.02655] Real-Time Generation of Game Video Commentary with Multimodal LLMs: Pause-Aware Decoding Approaches

Abstract page for arXiv paper 2603.02655: Real-Time Generation of Game Video Commentary with Multimodal LLMs: Pause-Aware Decoding Approa...

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.02597] GPUTOK: GPU Accelerated Byte Level BPE Tokenization

Abstract page for arXiv paper 2603.02597: GPUTOK: GPU Accelerated Byte Level BPE Tokenization

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2603.02578] How Controllable Are Large Language Models? A Unified Evaluation across Behavioral Granularities

Abstract page for arXiv paper 2603.02578: How Controllable Are Large Language Models? A Unified Evaluation across Behavioral Granularities

arXiv - AI · 3 min · about 2 months ago

Llms

[2603.02248] HELIOS: Harmonizing Early Fusion, Late Fusion, and LLM Reasoning for Multi-Granular Table-Text Retrieval

Abstract page for arXiv paper 2603.02248: HELIOS: Harmonizing Early Fusion, Late Fusion, and LLM Reasoning for Multi-Granular Table-Text ...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2603.02557] CAPT: Confusion-Aware Prompt Tuning for Reducing Vision-Language Misalignment

Abstract page for arXiv paper 2603.02557: CAPT: Confusion-Aware Prompt Tuning for Reducing Vision-Language Misalignment

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.02556] Through the Lens of Contrast: Self-Improving Visual Reasoning in VLMs

Abstract page for arXiv paper 2603.02556: Through the Lens of Contrast: Self-Improving Visual Reasoning in VLMs

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2603.02547] CoDAR: Continuous Diffusion Language Models are More Powerful Than You Think

Abstract page for arXiv paper 2603.02547: CoDAR: Continuous Diffusion Language Models are More Powerful Than You Think

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2603.02512] Human-Certified Module Repositories for the AI Age

Abstract page for arXiv paper 2603.02512: Human-Certified Module Repositories for the AI Age

arXiv - AI · 3 min · about 2 months ago

Llms

[2603.03206] Understanding and Mitigating Dataset Corruption in LLM Steering

Abstract page for arXiv paper 2603.03206: Understanding and Mitigating Dataset Corruption in LLM Steering

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.02420] Slurry-as-a-Service: A Modest Proposal on Scalable Pluralistic Alignment for Nutrient Optimization

Abstract page for arXiv paper 2603.02420: Slurry-as-a-Service: A Modest Proposal on Scalable Pluralistic Alignment for Nutrient Optimization

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.03155] Information Routing in Atomistic Foundation Models: How Equivariance Creates Linearly Disentangled Representations

Abstract page for arXiv paper 2603.03155: Information Routing in Atomistic Foundation Models: How Equivariance Creates Linearly Disentang...

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.02297] ZeroDayBench: Evaluating LLM Agents on Unseen Zero-Day Vulnerabilities for Cyberdefense

Abstract page for arXiv paper 2603.02297: ZeroDayBench: Evaluating LLM Agents on Unseen Zero-Day Vulnerabilities for Cyberdefense

arXiv - AI · 3 min · about 2 months ago

Llms

[2603.02345] RIVA: Leveraging LLM Agents for Reliable Configuration Drift Detection

Abstract page for arXiv paper 2603.02345: RIVA: Leveraging LLM Agents for Reliable Configuration Drift Detection

arXiv - AI · 4 min · about 2 months ago

Llms

[2603.03031] Step-Level Sparse Autoencoder for Reasoning Process Interpretation

Abstract page for arXiv paper 2603.03031: Step-Level Sparse Autoencoder for Reasoning Process Interpretation

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2603.02277] Quantifying Frontier LLM Capabilities for Container Sandbox Escape

Abstract page for arXiv paper 2603.02277: Quantifying Frontier LLM Capabilities for Container Sandbox Escape

arXiv - AI · 3 min · about 2 months ago

Llms

[2603.03000] Why Does RLAIF Work At All?

Abstract page for arXiv paper 2603.03000: Why Does RLAIF Work At All?

arXiv - AI · 3 min · about 2 months ago

Llms

[2603.02266] When Scaling Fails: Mitigating Audio Perception Decay of LALMs via Multi-Step Perception-Aware Reasoning

Abstract page for arXiv paper 2603.02266: When Scaling Fails: Mitigating Audio Perception Decay of LALMs via Multi-Step Perception-Aware ...

arXiv - AI · 4 min · about 2 months ago

Previous Page 206 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

Converting XQuery to SQL with Local LLMs: Do I Need Fine-Tuning or a Better Approach? [P]

AI: Fragility of today's Claude Cowork type AI Agent Apps. RTZ 1061

Gemini caught a $280M crypto exploit before it hit the news, then retracted it as a hallucination because I couldn't verify it - because the news hadn't dropped yet

All Content

[2603.02709] Sensory-Aware Sequential Recommendation via Review-Distilled Representations

[2603.02376] CUCo: An Agentic Framework for Compute and Communication Co-design

[2603.02676] ITLC at SemEval-2026 Task 11: Normalization and Deterministic Parsing for Formal Reasoning in LLMs

[2603.02655] Real-Time Generation of Game Video Commentary with Multimodal LLMs: Pause-Aware Decoding Approaches

[2603.02597] GPUTOK: GPU Accelerated Byte Level BPE Tokenization

[2603.02578] How Controllable Are Large Language Models? A Unified Evaluation across Behavioral Granularities

[2603.02248] HELIOS: Harmonizing Early Fusion, Late Fusion, and LLM Reasoning for Multi-Granular Table-Text Retrieval

[2603.02557] CAPT: Confusion-Aware Prompt Tuning for Reducing Vision-Language Misalignment

[2603.02556] Through the Lens of Contrast: Self-Improving Visual Reasoning in VLMs

[2603.02547] CoDAR: Continuous Diffusion Language Models are More Powerful Than You Think

[2603.02512] Human-Certified Module Repositories for the AI Age

[2603.03206] Understanding and Mitigating Dataset Corruption in LLM Steering

[2603.02420] Slurry-as-a-Service: A Modest Proposal on Scalable Pluralistic Alignment for Nutrient Optimization

[2603.03155] Information Routing in Atomistic Foundation Models: How Equivariance Creates Linearly Disentangled Representations

[2603.02297] ZeroDayBench: Evaluating LLM Agents on Unseen Zero-Day Vulnerabilities for Cyberdefense

[2603.02345] RIVA: Leveraging LLM Agents for Reliable Configuration Drift Detection

[2603.03031] Step-Level Sparse Autoencoder for Reasoning Process Interpretation

[2603.02277] Quantifying Frontier LLM Capabilities for Container Sandbox Escape

[2603.03000] Why Does RLAIF Work At All?

[2603.02266] When Scaling Fails: Mitigating Audio Perception Decay of LALMs via Multi-Step Perception-Aware Reasoning

Related Topics

Stay updated with AI News