Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

[2603.18532] Scaling Sim-to-Real Reinforcement Learning for Robot VLAs with Generative 3D Worlds
Llms

[2603.18532] Scaling Sim-to-Real Reinforcement Learning for Robot VLAs with Generative 3D Worlds

Abstract page for arXiv paper 2603.18532: Scaling Sim-to-Real Reinforcement Learning for Robot VLAs with Generative 3D Worlds

arXiv - Machine Learning · 4 min ·
[2603.12702] FGTR: Fine-Grained Multi-Table Retrieval via Hierarchical LLM Reasoning
Llms

[2603.12702] FGTR: Fine-Grained Multi-Table Retrieval via Hierarchical LLM Reasoning

Abstract page for arXiv paper 2603.12702: FGTR: Fine-Grained Multi-Table Retrieval via Hierarchical LLM Reasoning

arXiv - Machine Learning · 4 min ·
[2603.12681] Colluding LoRA: A Compositional Vulnerability in LLM Safety Alignment
Llms

[2603.12681] Colluding LoRA: A Compositional Vulnerability in LLM Safety Alignment

Abstract page for arXiv paper 2603.12681: Colluding LoRA: A Compositional Vulnerability in LLM Safety Alignment

arXiv - Machine Learning · 3 min ·

All Content

[2603.21389] Task-Specific Efficiency Analysis: When Small Language Models Outperform Large Language Models
Llms

[2603.21389] Task-Specific Efficiency Analysis: When Small Language Models Outperform Large Language Models

Abstract page for arXiv paper 2603.21389: Task-Specific Efficiency Analysis: When Small Language Models Outperform Large Language Models

arXiv - Machine Learning · 3 min ·
[2603.21335] TimeTox: An LLM-Based Pipeline for Automated Extraction of Time Toxicity from Clinical Trial Protocols
Llms

[2603.21335] TimeTox: An LLM-Based Pipeline for Automated Extraction of Time Toxicity from Clinical Trial Protocols

Abstract page for arXiv paper 2603.21335: TimeTox: An LLM-Based Pipeline for Automated Extraction of Time Toxicity from Clinical Trial Pr...

arXiv - Machine Learning · 4 min ·
[2603.21033] TabPFN Extensions for Interpretable Geotechnical Modelling
Llms

[2603.21033] TabPFN Extensions for Interpretable Geotechnical Modelling

Abstract page for arXiv paper 2603.21033: TabPFN Extensions for Interpretable Geotechnical Modelling

arXiv - Machine Learning · 4 min ·
[2603.20975] DiscoUQ: Structured Disagreement Analysis for Uncertainty Quantification in LLM Agent Ensembles
Llms

[2603.20975] DiscoUQ: Structured Disagreement Analysis for Uncertainty Quantification in LLM Agent Ensembles

Abstract page for arXiv paper 2603.20975: DiscoUQ: Structured Disagreement Analysis for Uncertainty Quantification in LLM Agent Ensembles

arXiv - Machine Learning · 3 min ·
[2603.20895] LLM Router: Prefill is All You Need
Llms

[2603.20895] LLM Router: Prefill is All You Need

Abstract page for arXiv paper 2603.20895: LLM Router: Prefill is All You Need

arXiv - Machine Learning · 3 min ·
[2603.20808] Predictive Regularization Against Visual Representation Degradation in Multimodal Large Language Models
Llms

[2603.20808] Predictive Regularization Against Visual Representation Degradation in Multimodal Large Language Models

Abstract page for arXiv paper 2603.20808: Predictive Regularization Against Visual Representation Degradation in Multimodal Large Languag...

arXiv - Machine Learning · 4 min ·
[2603.20799] RLVR Training of LLMs Does Not Improve Thinking Ability for General QA: Evaluation Method and a Simple Solution
Llms

[2603.20799] RLVR Training of LLMs Does Not Improve Thinking Ability for General QA: Evaluation Method and a Simple Solution

Abstract page for arXiv paper 2603.20799: RLVR Training of LLMs Does Not Improve Thinking Ability for General QA: Evaluation Method and a...

arXiv - Machine Learning · 4 min ·
[2603.20389] A chemical language model for reticular materials design
Llms

[2603.20389] A chemical language model for reticular materials design

Abstract page for arXiv paper 2603.20389: A chemical language model for reticular materials design

arXiv - Machine Learning · 4 min ·
[2603.20314] VGS-Decoding: Visual Grounding Score Guided Decoding for Hallucination Mitigation in Medical VLMs
Llms

[2603.20314] VGS-Decoding: Visual Grounding Score Guided Decoding for Hallucination Mitigation in Medical VLMs

Abstract page for arXiv paper 2603.20314: VGS-Decoding: Visual Grounding Score Guided Decoding for Hallucination Mitigation in Medical VLMs

arXiv - Machine Learning · 3 min ·
[2603.20219] Thinking into the Future: Latent Lookahead Training for Transformers
Llms

[2603.20219] Thinking into the Future: Latent Lookahead Training for Transformers

Abstract page for arXiv paper 2603.20219: Thinking into the Future: Latent Lookahead Training for Transformers

arXiv - Machine Learning · 4 min ·
[2603.20218] An experimental study of KV cache reuse strategies in chunk-level caching systems
Llms

[2603.20218] An experimental study of KV cache reuse strategies in chunk-level caching systems

Abstract page for arXiv paper 2603.20218: An experimental study of KV cache reuse strategies in chunk-level caching systems

arXiv - Machine Learning · 3 min ·
[2603.20215] Multi-Agent Debate with Memory Masking
Llms

[2603.20215] Multi-Agent Debate with Memory Masking

Abstract page for arXiv paper 2603.20215: Multi-Agent Debate with Memory Masking

arXiv - Machine Learning · 4 min ·
[2603.20212] Fast-Slow Thinking RM: Efficient Integration of Scalar and Generative Reward Models
Llms

[2603.20212] Fast-Slow Thinking RM: Efficient Integration of Scalar and Generative Reward Models

Abstract page for arXiv paper 2603.20212: Fast-Slow Thinking RM: Efficient Integration of Scalar and Generative Reward Models

arXiv - Machine Learning · 3 min ·
[2603.20217] Expected Reward Prediction, with Applications to Model Routing
Llms

[2603.20217] Expected Reward Prediction, with Applications to Model Routing

Abstract page for arXiv paper 2603.20217: Expected Reward Prediction, with Applications to Model Routing

arXiv - Machine Learning · 4 min ·
[2603.22206] Chimera: Latency- and Performance-Aware Multi-agent Serving for Heterogeneous LLMs
Llms

[2603.22206] Chimera: Latency- and Performance-Aware Multi-agent Serving for Heterogeneous LLMs

Abstract page for arXiv paper 2603.22206: Chimera: Latency- and Performance-Aware Multi-agent Serving for Heterogeneous LLMs

arXiv - Machine Learning · 4 min ·
[2603.22184] Revisiting Quantum Code Generation: Where Should Domain Knowledge Live?
Llms

[2603.22184] Revisiting Quantum Code Generation: Where Should Domain Knowledge Live?

Abstract page for arXiv paper 2603.22184: Revisiting Quantum Code Generation: Where Should Domain Knowledge Live?

arXiv - Machine Learning · 4 min ·
[2603.22161] Causal Evidence that Language Models use Confidence to Drive Behavior
Llms

[2603.22161] Causal Evidence that Language Models use Confidence to Drive Behavior

Abstract page for arXiv paper 2603.22161: Causal Evidence that Language Models use Confidence to Drive Behavior

arXiv - Machine Learning · 4 min ·
[2603.22154] dynActivation: A Trainable Activation Family for Adaptive Nonlinearity
Llms

[2603.22154] dynActivation: A Trainable Activation Family for Adaptive Nonlinearity

Abstract page for arXiv paper 2603.22154: dynActivation: A Trainable Activation Family for Adaptive Nonlinearity

arXiv - Machine Learning · 3 min ·
[2603.22017] AdditiveLLM2: A Multi-modal Large Language Model for Additive Manufacturing
Llms

[2603.22017] AdditiveLLM2: A Multi-modal Large Language Model for Additive Manufacturing

Abstract page for arXiv paper 2603.22017: AdditiveLLM2: A Multi-modal Large Language Model for Additive Manufacturing

arXiv - Machine Learning · 3 min ·
[2603.21972] Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe
Llms

[2603.21972] Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe

Abstract page for arXiv paper 2603.21972: Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe

arXiv - Machine Learning · 4 min ·
Previous Page 35 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime