Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

Associative memory system for LLMs that learns during inference [P]

I've been working on MDA (Modular Dynamic Architecture), an online associative memory system for LLMs. Here's what I learned building it....

Reddit - Machine Learning · 1 min ·
Llms

Things I got wrong building a confidence evaluator for local LLMs [D]

I've been building **Autodidact**, a local-first AI agent framework. The central piece is a **confidence evaluator** - something that dec...

Reddit - Machine Learning · 1 min ·
Llms

I’m convinced 90% of you building "AI Agents" are just burning money on proxy providers. [D]

Seriously, I just audited my stack and realized I’m spending more on rotating residential proxies than I am on the actual Claude and Open...

Reddit - Machine Learning · 1 min ·

All Content

[2603.05121] Measuring the Redundancy of Decoder Layers in SpeechLLMs
Llms

[2603.05121] Measuring the Redundancy of Decoder Layers in SpeechLLMs

Abstract page for arXiv paper 2603.05121: Measuring the Redundancy of Decoder Layers in SpeechLLMs

arXiv - AI · 3 min ·
[2603.04982] Training for Technology: Adoption and Productive Use of Generative AI in Legal Analysis
Llms

[2603.04982] Training for Technology: Adoption and Productive Use of Generative AI in Legal Analysis

Abstract page for arXiv paper 2603.04982: Training for Technology: Adoption and Productive Use of Generative AI in Legal Analysis

arXiv - AI · 4 min ·
[2603.04976] 3D-RFT: Reinforcement Fine-Tuning for Video-based 3D Scene Understanding
Llms

[2603.04976] 3D-RFT: Reinforcement Fine-Tuning for Video-based 3D Scene Understanding

Abstract page for arXiv paper 2603.04976: 3D-RFT: Reinforcement Fine-Tuning for Video-based 3D Scene Understanding

arXiv - AI · 4 min ·
[2603.04968] When Weak LLMs Speak with Confidence, Preference Alignment Gets Stronger
Llms

[2603.04968] When Weak LLMs Speak with Confidence, Preference Alignment Gets Stronger

Abstract page for arXiv paper 2603.04968: When Weak LLMs Speak with Confidence, Preference Alignment Gets Stronger

arXiv - AI · 3 min ·
[2603.04918] BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning
Llms

[2603.04918] BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning

Abstract page for arXiv paper 2603.04918: BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforc...

arXiv - Machine Learning · 3 min ·
[2603.04893] Free Lunch for Pass@$k$? Low Cost Diverse Sampling for Diffusion Language Models
Llms

[2603.04893] Free Lunch for Pass@$k$? Low Cost Diverse Sampling for Diffusion Language Models

Abstract page for arXiv paper 2603.04893: Free Lunch for Pass@$k$? Low Cost Diverse Sampling for Diffusion Language Models

arXiv - AI · 4 min ·
[2603.04819] On the Strengths and Weaknesses of Data for Open-set Embodied Assistance
Llms

[2603.04819] On the Strengths and Weaknesses of Data for Open-set Embodied Assistance

Abstract page for arXiv paper 2603.04819: On the Strengths and Weaknesses of Data for Open-set Embodied Assistance

arXiv - Machine Learning · 4 min ·
[2603.04805] Attention's Gravitational Field:A Power-Law Interpretation of Positional Correlation
Llms

[2603.04805] Attention's Gravitational Field:A Power-Law Interpretation of Positional Correlation

Abstract page for arXiv paper 2603.04805: Attention's Gravitational Field:A Power-Law Interpretation of Positional Correlation

arXiv - AI · 3 min ·
[2603.04799] Beyond Linear LLM Invocation: An Efficient and Effective Semantic Filter Paradigm
Llms

[2603.04799] Beyond Linear LLM Invocation: An Efficient and Effective Semantic Filter Paradigm

Abstract page for arXiv paper 2603.04799: Beyond Linear LLM Invocation: An Efficient and Effective Semantic Filter Paradigm

arXiv - AI · 4 min ·
[2603.04772] TSEmbed: Unlocking Task Scaling in Universal Multimodal Embeddings
Llms

[2603.04772] TSEmbed: Unlocking Task Scaling in Universal Multimodal Embeddings

Abstract page for arXiv paper 2603.04772: TSEmbed: Unlocking Task Scaling in Universal Multimodal Embeddings

arXiv - AI · 3 min ·
[2603.04763] Evaluating GPT-5 as a Multimodal Clinical Reasoner: A Landscape Commentary
Llms

[2603.04763] Evaluating GPT-5 as a Multimodal Clinical Reasoner: A Landscape Commentary

Abstract page for arXiv paper 2603.04763: Evaluating GPT-5 as a Multimodal Clinical Reasoner: A Landscape Commentary

arXiv - Machine Learning · 4 min ·
[2603.04743] DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval
Llms

[2603.04743] DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval

Abstract page for arXiv paper 2603.04743: DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval

arXiv - AI · 4 min ·
[2603.04759] Stacked from One: Multi-Scale Self-Injection for Context Window Extension
Llms

[2603.04759] Stacked from One: Multi-Scale Self-Injection for Context Window Extension

Abstract page for arXiv paper 2603.04759: Stacked from One: Multi-Scale Self-Injection for Context Window Extension

arXiv - AI · 4 min ·
[2603.04727] Are Multimodal LLMs Ready for Surveillance? A Reality Check on Zero-Shot Anomaly Detection in the Wild
Llms

[2603.04727] Are Multimodal LLMs Ready for Surveillance? A Reality Check on Zero-Shot Anomaly Detection in the Wild

Abstract page for arXiv paper 2603.04727: Are Multimodal LLMs Ready for Surveillance? A Reality Check on Zero-Shot Anomaly Detection in t...

arXiv - AI · 4 min ·
[2603.04707] Detection of Illicit Content on Online Marketplaces using Large Language Models
Llms

[2603.04707] Detection of Illicit Content on Online Marketplaces using Large Language Models

Abstract page for arXiv paper 2603.04707: Detection of Illicit Content on Online Marketplaces using Large Language Models

arXiv - AI · 4 min ·
[2603.04698] Hate Speech Detection using Large Language Models with Data Augmentation and Feature Enhancement
Llms

[2603.04698] Hate Speech Detection using Large Language Models with Data Augmentation and Feature Enhancement

Abstract page for arXiv paper 2603.04698: Hate Speech Detection using Large Language Models with Data Augmentation and Feature Enhancement

arXiv - AI · 3 min ·
[2603.04678] Optimizing Language Models for Crosslingual Knowledge Consistency
Llms

[2603.04678] Optimizing Language Models for Crosslingual Knowledge Consistency

Abstract page for arXiv paper 2603.04678: Optimizing Language Models for Crosslingual Knowledge Consistency

arXiv - AI · 3 min ·
[2603.04676] Decoding the Pulse of Reasoning VLMs in Multi-Image Understanding Tasks
Llms

[2603.04676] Decoding the Pulse of Reasoning VLMs in Multi-Image Understanding Tasks

Abstract page for arXiv paper 2603.04676: Decoding the Pulse of Reasoning VLMs in Multi-Image Understanding Tasks

arXiv - AI · 3 min ·
[2603.04663] Neuro-Symbolic Financial Reasoning via Deterministic Fact Ledgers and Adversarial Low-Latency Hallucination Detector
Llms

[2603.04663] Neuro-Symbolic Financial Reasoning via Deterministic Fact Ledgers and Adversarial Low-Latency Hallucination Detector

Abstract page for arXiv paper 2603.04663: Neuro-Symbolic Financial Reasoning via Deterministic Fact Ledgers and Adversarial Low-Latency H...

arXiv - Machine Learning · 4 min ·
[2603.04597] Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning
Llms

[2603.04597] Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Abstract page for arXiv paper 2603.04597: Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

arXiv - AI · 4 min ·
Previous Page 240 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime