Large Language Models
GPT, Claude, Gemini, and other LLMs
Top This Week
All Content
[2510.20095] BioCAP: Exploiting Synthetic Captions Beyond Labels in Biological Foundation Models
Abstract page for arXiv paper 2510.20095: BioCAP: Exploiting Synthetic Captions Beyond Labels in Biological Foundation Models
[2510.13849] Language steering in latent space to mitigate unintended code-switching
Abstract page for arXiv paper 2510.13849: Language steering in latent space to mitigate unintended code-switching
[2510.08919] PHyCLIP: $\ell_1$-Product of Hyperbolic Factors Unifies Hierarchy and Compositionality in Vision-Language Representation Learning
Abstract page for arXiv paper 2510.08919: PHyCLIP: $\ell_1$-Product of Hyperbolic Factors Unifies Hierarchy and Compositionality in Visio...
[2506.16411] When Does Divide and Conquer Work for Long Context LLM? A Noise Decomposition Framework
Abstract page for arXiv paper 2506.16411: When Does Divide and Conquer Work for Long Context LLM? A Noise Decomposition Framework
[2506.05639] FictionalQA: A Dataset for Studying Memorization and Knowledge Acquisition
Abstract page for arXiv paper 2506.05639: FictionalQA: A Dataset for Studying Memorization and Knowledge Acquisition
[2505.09655] DRA-GRPO: Your GRPO Needs to Know Diverse Reasoning Paths for Mathematical Reasoning
Abstract page for arXiv paper 2505.09655: DRA-GRPO: Your GRPO Needs to Know Diverse Reasoning Paths for Mathematical Reasoning
[2501.06762] Improving the adaptive and continuous learning capabilities of artificial neural networks: Lessons from multi-neuromodulatory dynamics
Abstract page for arXiv paper 2501.06762: Improving the adaptive and continuous learning capabilities of artificial neural networks: Less...
[2602.11761] MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context Modeling
Abstract page for arXiv paper 2602.11761: MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context Modeling
[2602.10609] Online Causal Kalman Filtering for Stable and Effective Policy Optimization
Abstract page for arXiv paper 2602.10609: Online Causal Kalman Filtering for Stable and Effective Policy Optimization
[2602.02185] Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models
Abstract page for arXiv paper 2602.02185: Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Langua...
[2602.01701] Beyond Single-Modal Analytics: A Framework for Integrating Heterogeneous LLM-Based Query Systems for Multi-Modal Data
Abstract page for arXiv paper 2602.01701: Beyond Single-Modal Analytics: A Framework for Integrating Heterogeneous LLM-Based Query System...
[2602.01649] Contribution-aware Token Compression for Efficient Video Understanding via Reinforcement Learning
Abstract page for arXiv paper 2602.01649: Contribution-aware Token Compression for Efficient Video Understanding via Reinforcement Learning
[2602.00428] When Agents "Misremember" Collectively: Exploring the Mandela Effect in LLM-based Multi-Agent Systems
Abstract page for arXiv paper 2602.00428: When Agents "Misremember" Collectively: Exploring the Mandela Effect in LLM-based Multi-Agent S...
[2601.22060] Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models
Abstract page for arXiv paper 2601.22060: Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models
[2601.21895] Learn-to-Distance: Distance Learning for Detecting LLM-Generated Text
Abstract page for arXiv paper 2601.21895: Learn-to-Distance: Distance Learning for Detecting LLM-Generated Text
[2602.08324] Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression
Abstract page for arXiv paper 2602.08324: Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression
[2602.05735] CSRv2: Unlocking Ultra-Sparse Embeddings
Abstract page for arXiv paper 2602.05735: CSRv2: Unlocking Ultra-Sparse Embeddings
[2602.04369] Multi-scale hypergraph meets LLMs: Aligning large language models for time series analysis
Abstract page for arXiv paper 2602.04369: Multi-scale hypergraph meets LLMs: Aligning large language models for time series analysis
[2602.02742] Entropy-Guided Dynamic Tokens for Graph-LLM Alignment in Molecular Understanding
Abstract page for arXiv paper 2602.02742: Entropy-Guided Dynamic Tokens for Graph-LLM Alignment in Molecular Understanding
[2602.02555] Learning to Explore with Parameter-Space Noise: A Deep Dive into Parameter-Space Noise for Reinforcement Learning with Verifiable Rewards
Abstract page for arXiv paper 2602.02555: Learning to Explore with Parameter-Space Noise: A Deep Dive into Parameter-Space Noise for Rein...
Related Topics
Stay updated with AI News
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime