Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Claude Mythos and misguided open-weight fearmongering
Llms

Claude Mythos and misguided open-weight fearmongering

AI Tools & Products · 9 min ·
Llms

Anthropic Agrees to Rent CoreWeave AI Capacity to Power Claude

AI Tools & Products · 1 min ·
CoreWeave strikes a deal to power Anthropic's Claude AI models — and the stock surges 12%
Llms

CoreWeave strikes a deal to power Anthropic's Claude AI models — and the stock surges 12%

AI Tools & Products · 3 min ·

All Content

[2510.20095] BioCAP: Exploiting Synthetic Captions Beyond Labels in Biological Foundation Models
Llms

[2510.20095] BioCAP: Exploiting Synthetic Captions Beyond Labels in Biological Foundation Models

Abstract page for arXiv paper 2510.20095: BioCAP: Exploiting Synthetic Captions Beyond Labels in Biological Foundation Models

arXiv - Machine Learning · 4 min ·
[2510.13849] Language steering in latent space to mitigate unintended code-switching
Llms

[2510.13849] Language steering in latent space to mitigate unintended code-switching

Abstract page for arXiv paper 2510.13849: Language steering in latent space to mitigate unintended code-switching

arXiv - Machine Learning · 3 min ·
[2510.08919] PHyCLIP: $\ell_1$-Product of Hyperbolic Factors Unifies Hierarchy and Compositionality in Vision-Language Representation Learning
Llms

[2510.08919] PHyCLIP: $\ell_1$-Product of Hyperbolic Factors Unifies Hierarchy and Compositionality in Vision-Language Representation Learning

Abstract page for arXiv paper 2510.08919: PHyCLIP: $\ell_1$-Product of Hyperbolic Factors Unifies Hierarchy and Compositionality in Visio...

arXiv - Machine Learning · 4 min ·
[2506.16411] When Does Divide and Conquer Work for Long Context LLM? A Noise Decomposition Framework
Llms

[2506.16411] When Does Divide and Conquer Work for Long Context LLM? A Noise Decomposition Framework

Abstract page for arXiv paper 2506.16411: When Does Divide and Conquer Work for Long Context LLM? A Noise Decomposition Framework

arXiv - Machine Learning · 4 min ·
[2506.05639] FictionalQA: A Dataset for Studying Memorization and Knowledge Acquisition
Llms

[2506.05639] FictionalQA: A Dataset for Studying Memorization and Knowledge Acquisition

Abstract page for arXiv paper 2506.05639: FictionalQA: A Dataset for Studying Memorization and Knowledge Acquisition

arXiv - Machine Learning · 3 min ·
[2505.09655] DRA-GRPO: Your GRPO Needs to Know Diverse Reasoning Paths for Mathematical Reasoning
Llms

[2505.09655] DRA-GRPO: Your GRPO Needs to Know Diverse Reasoning Paths for Mathematical Reasoning

Abstract page for arXiv paper 2505.09655: DRA-GRPO: Your GRPO Needs to Know Diverse Reasoning Paths for Mathematical Reasoning

arXiv - Machine Learning · 4 min ·
[2501.06762] Improving the adaptive and continuous learning capabilities of artificial neural networks: Lessons from multi-neuromodulatory dynamics
Llms

[2501.06762] Improving the adaptive and continuous learning capabilities of artificial neural networks: Lessons from multi-neuromodulatory dynamics

Abstract page for arXiv paper 2501.06762: Improving the adaptive and continuous learning capabilities of artificial neural networks: Less...

arXiv - Machine Learning · 4 min ·
[2602.11761] MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context Modeling
Llms

[2602.11761] MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context Modeling

Abstract page for arXiv paper 2602.11761: MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context Modeling

arXiv - Machine Learning · 4 min ·
[2602.10609] Online Causal Kalman Filtering for Stable and Effective Policy Optimization
Llms

[2602.10609] Online Causal Kalman Filtering for Stable and Effective Policy Optimization

Abstract page for arXiv paper 2602.10609: Online Causal Kalman Filtering for Stable and Effective Policy Optimization

arXiv - AI · 4 min ·
[2602.02185] Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models
Llms

[2602.02185] Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Abstract page for arXiv paper 2602.02185: Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Langua...

arXiv - Machine Learning · 4 min ·
[2602.01701] Beyond Single-Modal Analytics: A Framework for Integrating Heterogeneous LLM-Based Query Systems for Multi-Modal Data
Llms

[2602.01701] Beyond Single-Modal Analytics: A Framework for Integrating Heterogeneous LLM-Based Query Systems for Multi-Modal Data

Abstract page for arXiv paper 2602.01701: Beyond Single-Modal Analytics: A Framework for Integrating Heterogeneous LLM-Based Query System...

arXiv - AI · 4 min ·
[2602.01649] Contribution-aware Token Compression for Efficient Video Understanding via Reinforcement Learning
Llms

[2602.01649] Contribution-aware Token Compression for Efficient Video Understanding via Reinforcement Learning

Abstract page for arXiv paper 2602.01649: Contribution-aware Token Compression for Efficient Video Understanding via Reinforcement Learning

arXiv - AI · 4 min ·
[2602.00428] When Agents "Misremember" Collectively: Exploring the Mandela Effect in LLM-based Multi-Agent Systems
Llms

[2602.00428] When Agents "Misremember" Collectively: Exploring the Mandela Effect in LLM-based Multi-Agent Systems

Abstract page for arXiv paper 2602.00428: When Agents "Misremember" Collectively: Exploring the Mandela Effect in LLM-based Multi-Agent S...

arXiv - AI · 4 min ·
[2601.22060] Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models
Llms

[2601.22060] Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Abstract page for arXiv paper 2601.22060: Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

arXiv - AI · 4 min ·
[2601.21895] Learn-to-Distance: Distance Learning for Detecting LLM-Generated Text
Llms

[2601.21895] Learn-to-Distance: Distance Learning for Detecting LLM-Generated Text

Abstract page for arXiv paper 2601.21895: Learn-to-Distance: Distance Learning for Detecting LLM-Generated Text

arXiv - AI · 4 min ·
[2602.08324] Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression
Llms

[2602.08324] Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression

Abstract page for arXiv paper 2602.08324: Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression

arXiv - Machine Learning · 4 min ·
[2602.05735] CSRv2: Unlocking Ultra-Sparse Embeddings
Llms

[2602.05735] CSRv2: Unlocking Ultra-Sparse Embeddings

Abstract page for arXiv paper 2602.05735: CSRv2: Unlocking Ultra-Sparse Embeddings

arXiv - Machine Learning · 4 min ·
[2602.04369] Multi-scale hypergraph meets LLMs: Aligning large language models for time series analysis
Llms

[2602.04369] Multi-scale hypergraph meets LLMs: Aligning large language models for time series analysis

Abstract page for arXiv paper 2602.04369: Multi-scale hypergraph meets LLMs: Aligning large language models for time series analysis

arXiv - Machine Learning · 4 min ·
[2602.02742] Entropy-Guided Dynamic Tokens for Graph-LLM Alignment in Molecular Understanding
Llms

[2602.02742] Entropy-Guided Dynamic Tokens for Graph-LLM Alignment in Molecular Understanding

Abstract page for arXiv paper 2602.02742: Entropy-Guided Dynamic Tokens for Graph-LLM Alignment in Molecular Understanding

arXiv - Machine Learning · 3 min ·
[2602.02555] Learning to Explore with Parameter-Space Noise: A Deep Dive into Parameter-Space Noise for Reinforcement Learning with Verifiable Rewards
Llms

[2602.02555] Learning to Explore with Parameter-Space Noise: A Deep Dive into Parameter-Space Noise for Reinforcement Learning with Verifiable Rewards

Abstract page for arXiv paper 2602.02555: Learning to Explore with Parameter-Space Noise: A Deep Dive into Parameter-Space Noise for Rein...

arXiv - Machine Learning · 4 min ·
Previous Page 155 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime