[2512.22065] StreamAvatar: Streaming Diffusion Models for Real-Time Interactive Human Avatars
Abstract page for arXiv paper 2512.22065: StreamAvatar: Streaming Diffusion Models for Real-Time Interactive Human Avatars
Abstract page for arXiv paper 2512.22065: StreamAvatar: Streaming Diffusion Models for Real-Time Interactive Human Avatars
Abstract page for arXiv paper 2512.17396: RadImageNet-VQA: A Large-Scale CT and MRI Dataset for Radiologic Visual Question Answering
Abstract page for arXiv paper 2511.23342: Overcoming the Curvature Bottleneck in MeanFlow
Abstract page for arXiv paper 2512.12812: Does Tone Change the Answer? Evaluating Prompt Politeness Effects on Modern LLMs: GPT, Gemini, ...
Abstract page for arXiv paper 2512.10932: BabyVLM-V2: Toward Developmentally Grounded Pretraining and Benchmarking of Vision Foundation M...
Abstract page for arXiv paper 2512.08503: Disrupting Hierarchical Reasoning: Adversarial Protection for Geographic Privacy in Multimodal ...
Abstract page for arXiv paper 2512.05658: Multilingual Medical Reasoning for Question Answering with Large Language Models
Abstract page for arXiv paper 2511.21428: From Observation to Action: Latent Action-based Primitive Segmentation for VLA Pre-training in ...
Abstract page for arXiv paper 2511.16719: SAM 3: Segment Anything with Concepts
Abstract page for arXiv paper 2511.16681: Towards Hyper-Efficient RAG Systems in VecDBs: Distributed Parallel Multi-Resolution Vector Search
Abstract page for arXiv paper 2511.15090: SciEGQA: A Dataset for Scientific Evidence-Grounded Question Answering and Reasoning
Abstract page for arXiv paper 2511.11483: ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generation
Abstract page for arXiv paper 2511.10696: $π$-Attention: Periodic Sparse Transformers for Efficient Long-Context Modeling
Abstract page for arXiv paper 2511.10465: Beyond Elicitation: Provision-based Prompt Optimization for Knowledge-Intensive Tasks
Abstract page for arXiv paper 2511.07014: Diffolio: A Diffusion Model for Multivariate Probabilistic Financial Time-Series Forecasting an...
Abstract page for arXiv paper 2510.20351: Evaluating Latent Knowledge of Public Tabular Datasets in Large Language Models
Abstract page for arXiv paper 2510.15681: ProofBridge: Auto-Formalization of Natural Language Proofs in Lean via Joint Embeddings
Abstract page for arXiv paper 2510.16518: DIV-Nav: Open-Vocabulary Spatial Relationships for Multi-Object Navigation
Abstract page for arXiv paper 2510.13905: Schema for In-Context Learning
Abstract page for arXiv paper 2510.13044: SceneAdapt: Scene-aware Adaptation of Human Motion Diffusion