Content Feed

The latest content from across the network

[2512.22065] StreamAvatar: Streaming Diffusion Models for Real-Time Interactive Human Avatars
Machine Learning

[2512.22065] StreamAvatar: Streaming Diffusion Models for Real-Time Interactive Human Avatars

Abstract page for arXiv paper 2512.22065: StreamAvatar: Streaming Diffusion Models for Real-Time Interactive Human Avatars

arXiv - AI · 4 min ·
[2512.17396] RadImageNet-VQA: A Large-Scale CT and MRI Dataset for Radiologic Visual Question Answering
Data Science

[2512.17396] RadImageNet-VQA: A Large-Scale CT and MRI Dataset for Radiologic Visual Question Answering

Abstract page for arXiv paper 2512.17396: RadImageNet-VQA: A Large-Scale CT and MRI Dataset for Radiologic Visual Question Answering

arXiv - AI · 3 min ·
[2511.23342] Overcoming the Curvature Bottleneck in MeanFlow
Machine Learning

[2511.23342] Overcoming the Curvature Bottleneck in MeanFlow

Abstract page for arXiv paper 2511.23342: Overcoming the Curvature Bottleneck in MeanFlow

arXiv - AI · 4 min ·
[2512.12812] Does Tone Change the Answer? Evaluating Prompt Politeness Effects on Modern LLMs: GPT, Gemini, and LLaMA
Llms

[2512.12812] Does Tone Change the Answer? Evaluating Prompt Politeness Effects on Modern LLMs: GPT, Gemini, and LLaMA

Abstract page for arXiv paper 2512.12812: Does Tone Change the Answer? Evaluating Prompt Politeness Effects on Modern LLMs: GPT, Gemini, ...

arXiv - AI · 4 min ·
[2512.10932] BabyVLM-V2: Toward Developmentally Grounded Pretraining and Benchmarking of Vision Foundation Models
Llms

[2512.10932] BabyVLM-V2: Toward Developmentally Grounded Pretraining and Benchmarking of Vision Foundation Models

Abstract page for arXiv paper 2512.10932: BabyVLM-V2: Toward Developmentally Grounded Pretraining and Benchmarking of Vision Foundation M...

arXiv - AI · 4 min ·
[2512.08503] Disrupting Hierarchical Reasoning: Adversarial Protection for Geographic Privacy in Multimodal Reasoning Models
Machine Learning

[2512.08503] Disrupting Hierarchical Reasoning: Adversarial Protection for Geographic Privacy in Multimodal Reasoning Models

Abstract page for arXiv paper 2512.08503: Disrupting Hierarchical Reasoning: Adversarial Protection for Geographic Privacy in Multimodal ...

arXiv - AI · 4 min ·
[2512.05658] Multilingual Medical Reasoning for Question Answering with Large Language Models
Llms

[2512.05658] Multilingual Medical Reasoning for Question Answering with Large Language Models

Abstract page for arXiv paper 2512.05658: Multilingual Medical Reasoning for Question Answering with Large Language Models

arXiv - AI · 4 min ·
[2511.21428] From Observation to Action: Latent Action-based Primitive Segmentation for VLA Pre-training in Industrial Settings
Machine Learning

[2511.21428] From Observation to Action: Latent Action-based Primitive Segmentation for VLA Pre-training in Industrial Settings

Abstract page for arXiv paper 2511.21428: From Observation to Action: Latent Action-based Primitive Segmentation for VLA Pre-training in ...

arXiv - AI · 4 min ·
[2511.16719] SAM 3: Segment Anything with Concepts
Machine Learning

[2511.16719] SAM 3: Segment Anything with Concepts

Abstract page for arXiv paper 2511.16719: SAM 3: Segment Anything with Concepts

arXiv - AI · 4 min ·
[2511.16681] Towards Hyper-Efficient RAG Systems in VecDBs: Distributed Parallel Multi-Resolution Vector Search
Llms

[2511.16681] Towards Hyper-Efficient RAG Systems in VecDBs: Distributed Parallel Multi-Resolution Vector Search

Abstract page for arXiv paper 2511.16681: Towards Hyper-Efficient RAG Systems in VecDBs: Distributed Parallel Multi-Resolution Vector Search

arXiv - AI · 4 min ·
[2511.15090] SciEGQA: A Dataset for Scientific Evidence-Grounded Question Answering and Reasoning
Machine Learning

[2511.15090] SciEGQA: A Dataset for Scientific Evidence-Grounded Question Answering and Reasoning

Abstract page for arXiv paper 2511.15090: SciEGQA: A Dataset for Scientific Evidence-Grounded Question Answering and Reasoning

arXiv - AI · 4 min ·
[2511.11483] ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generation
Machine Learning

[2511.11483] ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generation

Abstract page for arXiv paper 2511.11483: ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generation

arXiv - AI · 4 min ·
[2511.10696] $π$-Attention: Periodic Sparse Transformers for Efficient Long-Context Modeling
Machine Learning

[2511.10696] $π$-Attention: Periodic Sparse Transformers for Efficient Long-Context Modeling

Abstract page for arXiv paper 2511.10696: $π$-Attention: Periodic Sparse Transformers for Efficient Long-Context Modeling

arXiv - AI · 4 min ·
[2511.10465] Beyond Elicitation: Provision-based Prompt Optimization for Knowledge-Intensive Tasks
Llms

[2511.10465] Beyond Elicitation: Provision-based Prompt Optimization for Knowledge-Intensive Tasks

Abstract page for arXiv paper 2511.10465: Beyond Elicitation: Provision-based Prompt Optimization for Knowledge-Intensive Tasks

arXiv - AI · 3 min ·
[2511.07014] Diffolio: A Diffusion Model for Multivariate Probabilistic Financial Time-Series Forecasting and Portfolio Construction
Machine Learning

[2511.07014] Diffolio: A Diffusion Model for Multivariate Probabilistic Financial Time-Series Forecasting and Portfolio Construction

Abstract page for arXiv paper 2511.07014: Diffolio: A Diffusion Model for Multivariate Probabilistic Financial Time-Series Forecasting an...

arXiv - AI · 4 min ·
[2510.20351] Evaluating Latent Knowledge of Public Tabular Datasets in Large Language Models
Llms

[2510.20351] Evaluating Latent Knowledge of Public Tabular Datasets in Large Language Models

Abstract page for arXiv paper 2510.20351: Evaluating Latent Knowledge of Public Tabular Datasets in Large Language Models

arXiv - AI · 4 min ·
[2510.15681] ProofBridge: Auto-Formalization of Natural Language Proofs in Lean via Joint Embeddings
Nlp

[2510.15681] ProofBridge: Auto-Formalization of Natural Language Proofs in Lean via Joint Embeddings

Abstract page for arXiv paper 2510.15681: ProofBridge: Auto-Formalization of Natural Language Proofs in Lean via Joint Embeddings

arXiv - AI · 4 min ·
[2510.16518] DIV-Nav: Open-Vocabulary Spatial Relationships for Multi-Object Navigation
Nlp

[2510.16518] DIV-Nav: Open-Vocabulary Spatial Relationships for Multi-Object Navigation

Abstract page for arXiv paper 2510.16518: DIV-Nav: Open-Vocabulary Spatial Relationships for Multi-Object Navigation

arXiv - AI · 4 min ·
[2510.13905] Schema for In-Context Learning
Llms

[2510.13905] Schema for In-Context Learning

Abstract page for arXiv paper 2510.13905: Schema for In-Context Learning

arXiv - AI · 4 min ·
[2510.13044] SceneAdapt: Scene-aware Adaptation of Human Motion Diffusion
Generative Ai

[2510.13044] SceneAdapt: Scene-aware Adaptation of Human Motion Diffusion

Abstract page for arXiv paper 2510.13044: SceneAdapt: Scene-aware Adaptation of Human Motion Diffusion

arXiv - AI · 4 min ·