Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Anthropic Teams Up With Its Rivals to Keep AI From Hacking Everything | WIRED
Llms

Anthropic Teams Up With Its Rivals to Keep AI From Hacking Everything | WIRED

The AI lab's Project Glasswing will bring together Apple, Google, and more than 45 other organizations. They'll use the new Claude Mythos...

Wired - AI · 7 min ·
Llms

The public needs to control AI-run infrastructure, labor, education, and governance— NOT private actors

A lot of discussion around AI is becoming siloed, and I think that is dangerous. People in AI-focused spaces often talk as if the only qu...

Reddit - Artificial Intelligence · 1 min ·
Llms

Agents that write their own code at runtime and vote on capabilities, no human in the loop

hollowOS just hit v4.4 and I added something that I haven’t seen anyone else do. Previous versions gave you an OS for agents: structured ...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2603.05210] Balancing Coverage and Draft Latency in Vocabulary Trimming for Faster Speculative Decoding
Llms

[2603.05210] Balancing Coverage and Draft Latency in Vocabulary Trimming for Faster Speculative Decoding

Abstract page for arXiv paper 2603.05210: Balancing Coverage and Draft Latency in Vocabulary Trimming for Faster Speculative Decoding

arXiv - Machine Learning · 4 min ·
[2603.05299] WavSLM: Single-Stream Speech Language Modeling via WavLM Distillation
Llms

[2603.05299] WavSLM: Single-Stream Speech Language Modeling via WavLM Distillation

Abstract page for arXiv paper 2603.05299: WavSLM: Single-Stream Speech Language Modeling via WavLM Distillation

arXiv - Machine Learning · 3 min ·
[2603.05167] C2-Faith: Benchmarking LLM Judges for Causal and Coverage Faithfulness in Chain-of-Thought Reasoning
Llms

[2603.05167] C2-Faith: Benchmarking LLM Judges for Causal and Coverage Faithfulness in Chain-of-Thought Reasoning

Abstract page for arXiv paper 2603.05167: C2-Faith: Benchmarking LLM Judges for Causal and Coverage Faithfulness in Chain-of-Thought Reas...

arXiv - AI · 3 min ·
[2603.05121] Measuring the Redundancy of Decoder Layers in SpeechLLMs
Llms

[2603.05121] Measuring the Redundancy of Decoder Layers in SpeechLLMs

Abstract page for arXiv paper 2603.05121: Measuring the Redundancy of Decoder Layers in SpeechLLMs

arXiv - AI · 3 min ·
[2603.04982] Training for Technology: Adoption and Productive Use of Generative AI in Legal Analysis
Llms

[2603.04982] Training for Technology: Adoption and Productive Use of Generative AI in Legal Analysis

Abstract page for arXiv paper 2603.04982: Training for Technology: Adoption and Productive Use of Generative AI in Legal Analysis

arXiv - AI · 4 min ·
[2603.04976] 3D-RFT: Reinforcement Fine-Tuning for Video-based 3D Scene Understanding
Llms

[2603.04976] 3D-RFT: Reinforcement Fine-Tuning for Video-based 3D Scene Understanding

Abstract page for arXiv paper 2603.04976: 3D-RFT: Reinforcement Fine-Tuning for Video-based 3D Scene Understanding

arXiv - AI · 4 min ·
[2603.04968] When Weak LLMs Speak with Confidence, Preference Alignment Gets Stronger
Llms

[2603.04968] When Weak LLMs Speak with Confidence, Preference Alignment Gets Stronger

Abstract page for arXiv paper 2603.04968: When Weak LLMs Speak with Confidence, Preference Alignment Gets Stronger

arXiv - AI · 3 min ·
[2603.04918] BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning
Llms

[2603.04918] BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning

Abstract page for arXiv paper 2603.04918: BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforc...

arXiv - Machine Learning · 3 min ·
[2603.04893] Free Lunch for Pass@$k$? Low Cost Diverse Sampling for Diffusion Language Models
Llms

[2603.04893] Free Lunch for Pass@$k$? Low Cost Diverse Sampling for Diffusion Language Models

Abstract page for arXiv paper 2603.04893: Free Lunch for Pass@$k$? Low Cost Diverse Sampling for Diffusion Language Models

arXiv - AI · 4 min ·
[2603.04819] On the Strengths and Weaknesses of Data for Open-set Embodied Assistance
Llms

[2603.04819] On the Strengths and Weaknesses of Data for Open-set Embodied Assistance

Abstract page for arXiv paper 2603.04819: On the Strengths and Weaknesses of Data for Open-set Embodied Assistance

arXiv - Machine Learning · 4 min ·
[2603.04805] Attention's Gravitational Field:A Power-Law Interpretation of Positional Correlation
Llms

[2603.04805] Attention's Gravitational Field:A Power-Law Interpretation of Positional Correlation

Abstract page for arXiv paper 2603.04805: Attention's Gravitational Field:A Power-Law Interpretation of Positional Correlation

arXiv - AI · 3 min ·
[2603.04799] Beyond Linear LLM Invocation: An Efficient and Effective Semantic Filter Paradigm
Llms

[2603.04799] Beyond Linear LLM Invocation: An Efficient and Effective Semantic Filter Paradigm

Abstract page for arXiv paper 2603.04799: Beyond Linear LLM Invocation: An Efficient and Effective Semantic Filter Paradigm

arXiv - AI · 4 min ·
[2603.04772] TSEmbed: Unlocking Task Scaling in Universal Multimodal Embeddings
Llms

[2603.04772] TSEmbed: Unlocking Task Scaling in Universal Multimodal Embeddings

Abstract page for arXiv paper 2603.04772: TSEmbed: Unlocking Task Scaling in Universal Multimodal Embeddings

arXiv - AI · 3 min ·
[2603.04763] Evaluating GPT-5 as a Multimodal Clinical Reasoner: A Landscape Commentary
Llms

[2603.04763] Evaluating GPT-5 as a Multimodal Clinical Reasoner: A Landscape Commentary

Abstract page for arXiv paper 2603.04763: Evaluating GPT-5 as a Multimodal Clinical Reasoner: A Landscape Commentary

arXiv - Machine Learning · 4 min ·
[2603.04743] DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval
Llms

[2603.04743] DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval

Abstract page for arXiv paper 2603.04743: DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval

arXiv - AI · 4 min ·
[2603.04759] Stacked from One: Multi-Scale Self-Injection for Context Window Extension
Llms

[2603.04759] Stacked from One: Multi-Scale Self-Injection for Context Window Extension

Abstract page for arXiv paper 2603.04759: Stacked from One: Multi-Scale Self-Injection for Context Window Extension

arXiv - AI · 4 min ·
[2603.04727] Are Multimodal LLMs Ready for Surveillance? A Reality Check on Zero-Shot Anomaly Detection in the Wild
Llms

[2603.04727] Are Multimodal LLMs Ready for Surveillance? A Reality Check on Zero-Shot Anomaly Detection in the Wild

Abstract page for arXiv paper 2603.04727: Are Multimodal LLMs Ready for Surveillance? A Reality Check on Zero-Shot Anomaly Detection in t...

arXiv - AI · 4 min ·
[2603.04707] Detection of Illicit Content on Online Marketplaces using Large Language Models
Llms

[2603.04707] Detection of Illicit Content on Online Marketplaces using Large Language Models

Abstract page for arXiv paper 2603.04707: Detection of Illicit Content on Online Marketplaces using Large Language Models

arXiv - AI · 4 min ·
[2603.04698] Hate Speech Detection using Large Language Models with Data Augmentation and Feature Enhancement
Llms

[2603.04698] Hate Speech Detection using Large Language Models with Data Augmentation and Feature Enhancement

Abstract page for arXiv paper 2603.04698: Hate Speech Detection using Large Language Models with Data Augmentation and Feature Enhancement

arXiv - AI · 3 min ·
[2603.04678] Optimizing Language Models for Crosslingual Knowledge Consistency
Llms

[2603.04678] Optimizing Language Models for Crosslingual Knowledge Consistency

Abstract page for arXiv paper 2603.04678: Optimizing Language Models for Crosslingual Knowledge Consistency

arXiv - AI · 3 min ·
Previous Page 109 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime