[2511.15090] SciEGQA: A Dataset for Scientific Evidence-Grounded Question Answering and Reasoning
Abstract page for arXiv paper 2511.15090: SciEGQA: A Dataset for Scientific Evidence-Grounded Question Answering and Reasoning
Abstract page for arXiv paper 2511.15090: SciEGQA: A Dataset for Scientific Evidence-Grounded Question Answering and Reasoning
Abstract page for arXiv paper 2511.11483: ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generation
Abstract page for arXiv paper 2511.10696: $π$-Attention: Periodic Sparse Transformers for Efficient Long-Context Modeling
Abstract page for arXiv paper 2511.10465: Beyond Elicitation: Provision-based Prompt Optimization for Knowledge-Intensive Tasks
Abstract page for arXiv paper 2511.07014: Diffolio: A Diffusion Model for Multivariate Probabilistic Financial Time-Series Forecasting an...
Abstract page for arXiv paper 2510.20351: Evaluating Latent Knowledge of Public Tabular Datasets in Large Language Models
Abstract page for arXiv paper 2510.15681: ProofBridge: Auto-Formalization of Natural Language Proofs in Lean via Joint Embeddings
Abstract page for arXiv paper 2510.16518: DIV-Nav: Open-Vocabulary Spatial Relationships for Multi-Object Navigation
Abstract page for arXiv paper 2510.13905: Schema for In-Context Learning
Abstract page for arXiv paper 2510.13044: SceneAdapt: Scene-aware Adaptation of Human Motion Diffusion
Abstract page for arXiv paper 2510.10063: CLMN: Concept based Language Models via Neural Symbolic Reasoning
Abstract page for arXiv paper 2504.19467: BRIDGE: Benchmarking Large Language Models for Understanding Real-world Clinical Practice Text
Abstract page for arXiv paper 2510.09328: Randomized HyperSteiner: A Stochastic Delaunay Triangulation Heuristic for the Hyperbolic Stein...
Abstract page for arXiv paper 2510.08553: Dream to Recall: Imagination-Guided Experience Retrieval for Memory-Persistent Vision-and-Langu...
Abstract page for arXiv paper 2510.06961: Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Re...
Abstract page for arXiv paper 2510.05145: Efficient Tree-Structured Deep Research with Adaptive Resource Allocation
Abstract page for arXiv paper 2509.25848: More Thought, Less Accuracy? On the Dual Nature of Reasoning in Vision-Language Models
Abstract page for arXiv paper 2509.23362: Dual-Space Smoothness for Robust and Balanced LLM Unlearning
Abstract page for arXiv paper 2509.16952: AirQA: A Comprehensive QA Dataset for AI Research with Instance-Level Evaluation
Abstract page for arXiv paper 2509.17292: Multi-View Attention Multiple-Instance Learning Enhanced by LLM Reasoning for Cognitive Distort...