[2603.27343] Beyond Completion: Probing Cumulative State Tracking to Predict LLM Agent Performance
Abstract page for arXiv paper 2603.27343: Beyond Completion: Probing Cumulative State Tracking to Predict LLM Agent Performance
Abstract page for arXiv paper 2603.27343: Beyond Completion: Probing Cumulative State Tracking to Predict LLM Agent Performance
Abstract page for arXiv paper 2603.27338: CounterMoral: Editing Morals in Language Models
Abstract page for arXiv paper 2603.27314: TokenDance: Token-to-Token Music-to-Dance Generation with Bidirectional Mamba
Abstract page for arXiv paper 2603.27304: EpochX: Building the Infrastructure for an Emergent Agent Civilization
Abstract page for arXiv paper 2603.27150: MediHive: A Decentralized Agent Collective for Medical Reasoning
Abstract page for arXiv paper 2603.27270: Quantification of Credal Uncertainty: A Distance-Based Approach
Abstract page for arXiv paper 2603.27303: Self-evolving AI agents for protein discovery and directed evolution
Abstract page for arXiv paper 2603.27195: AutoMS: Multi-Agent Evolutionary Search for Cross-Physics Inverse Microstructure Design
Abstract page for arXiv paper 2603.27169: Aligning LLMs with Graph Neural Solvers for Combinatorial Optimization
Abstract page for arXiv paper 2603.27116: The Price of Meaning: Why Every Semantic Memory System Forgets
Abstract page for arXiv paper 2603.27164: daVinci-LLM:Towards the Science of Pretraining
Abstract page for arXiv paper 2603.27076: When Verification Hurts: Asymmetric Effects of Multi-Agent Feedback in Logic Proof Tutoring
Abstract page for arXiv paper 2603.26948: Compliance-Aware Predictive Process Monitoring: A Neuro-Symbolic Approach
Abstract page for arXiv paper 2603.26765: Bitboard version of Tetris AI
Abstract page for arXiv paper 2603.26944: Neuro-Symbolic Learning for Predictive Process Monitoring via Two-Stage Logic Tensor Networks w...
Abstract page for arXiv paper 2603.18532: Scaling Sim-to-Real Reinforcement Learning for Robot VLAs with Generative 3D Worlds
Abstract page for arXiv paper 2603.12702: FGTR: Fine-Grained Multi-Table Retrieval via Hierarchical LLM Reasoning
Abstract page for arXiv paper 2603.12681: Colluding LoRA: A Compositional Vulnerability in LLM Safety Alignment
Abstract page for arXiv paper 2603.09645: Noise in Photonic Quantum Machine Learning: Models, Impacts, and Mitigation Strategies
Abstract page for arXiv paper 2602.08961: MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE