Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Anthropic’s new cybersecurity model could get it back in the government’s good graces | The Verge
Llms

Anthropic’s new cybersecurity model could get it back in the government’s good graces | The Verge

After Anthropic announced Claude Mythos Preview, the Trump administration reportedly took notice. It may inspire change in the Anthropic-...

The Verge - AI · 6 min ·
Llms

What is the current landscape on AI agents knowledge

Recently used "free" rates codex to give me a quick fastapi project sample. It gave me deprecated (a)app.on_event("startup). What are you...

Reddit - Artificial Intelligence · 1 min ·
OpenAI Executive Kevin Weil Is Leaving the Company | WIRED
Llms

OpenAI Executive Kevin Weil Is Leaving the Company | WIRED

The former Instagram VP is departing the ChatGPT-maker, which is folding the AI science application he led into Codex.

Wired - AI · 5 min ·

All Content

[2603.04976] 3D-RFT: Reinforcement Fine-Tuning for Video-based 3D Scene Understanding
Llms

[2603.04976] 3D-RFT: Reinforcement Fine-Tuning for Video-based 3D Scene Understanding

Abstract page for arXiv paper 2603.04976: 3D-RFT: Reinforcement Fine-Tuning for Video-based 3D Scene Understanding

arXiv - AI · 4 min ·
[2603.04968] When Weak LLMs Speak with Confidence, Preference Alignment Gets Stronger
Llms

[2603.04968] When Weak LLMs Speak with Confidence, Preference Alignment Gets Stronger

Abstract page for arXiv paper 2603.04968: When Weak LLMs Speak with Confidence, Preference Alignment Gets Stronger

arXiv - AI · 3 min ·
[2603.04918] BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning
Llms

[2603.04918] BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning

Abstract page for arXiv paper 2603.04918: BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforc...

arXiv - Machine Learning · 3 min ·
[2603.04893] Free Lunch for Pass@$k$? Low Cost Diverse Sampling for Diffusion Language Models
Llms

[2603.04893] Free Lunch for Pass@$k$? Low Cost Diverse Sampling for Diffusion Language Models

Abstract page for arXiv paper 2603.04893: Free Lunch for Pass@$k$? Low Cost Diverse Sampling for Diffusion Language Models

arXiv - AI · 4 min ·
[2603.04819] On the Strengths and Weaknesses of Data for Open-set Embodied Assistance
Llms

[2603.04819] On the Strengths and Weaknesses of Data for Open-set Embodied Assistance

Abstract page for arXiv paper 2603.04819: On the Strengths and Weaknesses of Data for Open-set Embodied Assistance

arXiv - Machine Learning · 4 min ·
[2603.04805] Attention's Gravitational Field:A Power-Law Interpretation of Positional Correlation
Llms

[2603.04805] Attention's Gravitational Field:A Power-Law Interpretation of Positional Correlation

Abstract page for arXiv paper 2603.04805: Attention's Gravitational Field:A Power-Law Interpretation of Positional Correlation

arXiv - AI · 3 min ·
[2603.04799] Beyond Linear LLM Invocation: An Efficient and Effective Semantic Filter Paradigm
Llms

[2603.04799] Beyond Linear LLM Invocation: An Efficient and Effective Semantic Filter Paradigm

Abstract page for arXiv paper 2603.04799: Beyond Linear LLM Invocation: An Efficient and Effective Semantic Filter Paradigm

arXiv - AI · 4 min ·
[2603.04772] TSEmbed: Unlocking Task Scaling in Universal Multimodal Embeddings
Llms

[2603.04772] TSEmbed: Unlocking Task Scaling in Universal Multimodal Embeddings

Abstract page for arXiv paper 2603.04772: TSEmbed: Unlocking Task Scaling in Universal Multimodal Embeddings

arXiv - AI · 3 min ·
[2603.04763] Evaluating GPT-5 as a Multimodal Clinical Reasoner: A Landscape Commentary
Llms

[2603.04763] Evaluating GPT-5 as a Multimodal Clinical Reasoner: A Landscape Commentary

Abstract page for arXiv paper 2603.04763: Evaluating GPT-5 as a Multimodal Clinical Reasoner: A Landscape Commentary

arXiv - Machine Learning · 4 min ·
[2603.04743] DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval
Llms

[2603.04743] DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval

Abstract page for arXiv paper 2603.04743: DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval

arXiv - AI · 4 min ·
[2603.04759] Stacked from One: Multi-Scale Self-Injection for Context Window Extension
Llms

[2603.04759] Stacked from One: Multi-Scale Self-Injection for Context Window Extension

Abstract page for arXiv paper 2603.04759: Stacked from One: Multi-Scale Self-Injection for Context Window Extension

arXiv - AI · 4 min ·
[2603.04727] Are Multimodal LLMs Ready for Surveillance? A Reality Check on Zero-Shot Anomaly Detection in the Wild
Llms

[2603.04727] Are Multimodal LLMs Ready for Surveillance? A Reality Check on Zero-Shot Anomaly Detection in the Wild

Abstract page for arXiv paper 2603.04727: Are Multimodal LLMs Ready for Surveillance? A Reality Check on Zero-Shot Anomaly Detection in t...

arXiv - AI · 4 min ·
[2603.04707] Detection of Illicit Content on Online Marketplaces using Large Language Models
Llms

[2603.04707] Detection of Illicit Content on Online Marketplaces using Large Language Models

Abstract page for arXiv paper 2603.04707: Detection of Illicit Content on Online Marketplaces using Large Language Models

arXiv - AI · 4 min ·
[2603.04698] Hate Speech Detection using Large Language Models with Data Augmentation and Feature Enhancement
Llms

[2603.04698] Hate Speech Detection using Large Language Models with Data Augmentation and Feature Enhancement

Abstract page for arXiv paper 2603.04698: Hate Speech Detection using Large Language Models with Data Augmentation and Feature Enhancement

arXiv - AI · 3 min ·
[2603.04678] Optimizing Language Models for Crosslingual Knowledge Consistency
Llms

[2603.04678] Optimizing Language Models for Crosslingual Knowledge Consistency

Abstract page for arXiv paper 2603.04678: Optimizing Language Models for Crosslingual Knowledge Consistency

arXiv - AI · 3 min ·
[2603.04676] Decoding the Pulse of Reasoning VLMs in Multi-Image Understanding Tasks
Llms

[2603.04676] Decoding the Pulse of Reasoning VLMs in Multi-Image Understanding Tasks

Abstract page for arXiv paper 2603.04676: Decoding the Pulse of Reasoning VLMs in Multi-Image Understanding Tasks

arXiv - AI · 3 min ·
[2603.04663] Neuro-Symbolic Financial Reasoning via Deterministic Fact Ledgers and Adversarial Low-Latency Hallucination Detector
Llms

[2603.04663] Neuro-Symbolic Financial Reasoning via Deterministic Fact Ledgers and Adversarial Low-Latency Hallucination Detector

Abstract page for arXiv paper 2603.04663: Neuro-Symbolic Financial Reasoning via Deterministic Fact Ledgers and Adversarial Low-Latency H...

arXiv - Machine Learning · 4 min ·
[2603.04597] Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning
Llms

[2603.04597] Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Abstract page for arXiv paper 2603.04597: Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

arXiv - AI · 4 min ·
[2603.04474] From Spark to Fire: Modeling and Mitigating Error Cascades in LLM-Based Multi-Agent Collaboration
Llms

[2603.04474] From Spark to Fire: Modeling and Mitigating Error Cascades in LLM-Based Multi-Agent Collaboration

Abstract page for arXiv paper 2603.04474: From Spark to Fire: Modeling and Mitigating Error Cascades in LLM-Based Multi-Agent Collaboration

arXiv - AI · 4 min ·
[2603.04464] Understanding the Dynamics of Demonstration Conflict in In-Context Learning
Llms

[2603.04464] Understanding the Dynamics of Demonstration Conflict in In-Context Learning

Abstract page for arXiv paper 2603.04464: Understanding the Dynamics of Demonstration Conflict in In-Context Learning

arXiv - Machine Learning · 4 min ·
Previous Page 187 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime