Content Feed

The latest content from across the network

[2603.28625] Dynamic Lookahead Distance via Reinforcement Learning-Based Pure Pursuit for Autonomous Racing
Robotics

[2603.28625] Dynamic Lookahead Distance via Reinforcement Learning-Based Pure Pursuit for Autonomous Racing

Abstract page for arXiv paper 2603.28625: Dynamic Lookahead Distance via Reinforcement Learning-Based Pure Pursuit for Autonomous Racing

arXiv - AI · 4 min ·
[2603.28696] AdaptToken: Entropy-based Adaptive Token Selection for MLLM Long Video Understanding
Llms

[2603.28696] AdaptToken: Entropy-based Adaptive Token Selection for MLLM Long Video Understanding

Abstract page for arXiv paper 2603.28696: AdaptToken: Entropy-based Adaptive Token Selection for MLLM Long Video Understanding

arXiv - AI · 4 min ·
[2603.28610] ResAdapt: Adaptive Resolution for Efficient Multimodal Reasoning
Llms

[2603.28610] ResAdapt: Adaptive Resolution for Efficient Multimodal Reasoning

Abstract page for arXiv paper 2603.28610: ResAdapt: Adaptive Resolution for Efficient Multimodal Reasoning

arXiv - AI · 4 min ·
[2603.28622] Trust-Aware Routing for Distributed Generative AI Inference at the Edge
Machine Learning

[2603.28622] Trust-Aware Routing for Distributed Generative AI Inference at the Edge

Abstract page for arXiv paper 2603.28622: Trust-Aware Routing for Distributed Generative AI Inference at the Edge

arXiv - AI · 4 min ·
[2603.28613] TGIF2: Extended Text-Guided Inpainting Forgery Dataset & Benchmark
Generative Ai

[2603.28613] TGIF2: Extended Text-Guided Inpainting Forgery Dataset & Benchmark

Abstract page for arXiv paper 2603.28613: TGIF2: Extended Text-Guided Inpainting Forgery Dataset & Benchmark

arXiv - AI · 4 min ·
[2603.28596] Moving Beyond Review: Applying Language Models to Planning and Translation in Reflection
Llms

[2603.28596] Moving Beyond Review: Applying Language Models to Planning and Translation in Reflection

Abstract page for arXiv paper 2603.28596: Moving Beyond Review: Applying Language Models to Planning and Translation in Reflection

arXiv - AI · 4 min ·
[2603.28554] Hydra: Unifying Document Retrieval and Generation in a Single Vision-Language Model
Llms

[2603.28554] Hydra: Unifying Document Retrieval and Generation in a Single Vision-Language Model

Abstract page for arXiv paper 2603.28554: Hydra: Unifying Document Retrieval and Generation in a Single Vision-Language Model

arXiv - AI · 4 min ·
[2603.28583] Navigating the Mirage: A Dual-Path Agentic Framework for Robust Misleading Chart Question Answering
Llms

[2603.28583] Navigating the Mirage: A Dual-Path Agentic Framework for Robust Misleading Chart Question Answering

Abstract page for arXiv paper 2603.28583: Navigating the Mirage: A Dual-Path Agentic Framework for Robust Misleading Chart Question Answe...

arXiv - AI · 4 min ·
[2603.28594] Detection of Adversarial Attacks in Robotic Perception
Machine Learning

[2603.28594] Detection of Adversarial Attacks in Robotic Perception

Abstract page for arXiv paper 2603.28594: Detection of Adversarial Attacks in Robotic Perception

arXiv - AI · 3 min ·
[2603.28561] Fine-Tuning Large Language Models for Cooperative Tactical Deconfliction of Small Unmanned Aerial Systems
Llms

[2603.28561] Fine-Tuning Large Language Models for Cooperative Tactical Deconfliction of Small Unmanned Aerial Systems

Abstract page for arXiv paper 2603.28561: Fine-Tuning Large Language Models for Cooperative Tactical Deconfliction of Small Unmanned Aeri...

arXiv - AI · 4 min ·
[2603.28555] Domain-Invariant Prompt Learning for Vision-Language Models
Llms

[2603.28555] Domain-Invariant Prompt Learning for Vision-Language Models

Abstract page for arXiv paper 2603.28555: Domain-Invariant Prompt Learning for Vision-Language Models

arXiv - AI · 3 min ·
[2603.28498] MRI-to-CT synthesis using drifting models
Machine Learning

[2603.28498] MRI-to-CT synthesis using drifting models

Abstract page for arXiv paper 2603.28498: MRI-to-CT synthesis using drifting models

arXiv - AI · 4 min ·
[2603.28488] Courtroom-Style Multi-Agent Debate with Progressive RAG and Role-Switching for Controversial Claim Verification
Llms

[2603.28488] Courtroom-Style Multi-Agent Debate with Progressive RAG and Role-Switching for Controversial Claim Verification

Abstract page for arXiv paper 2603.28488: Courtroom-Style Multi-Agent Debate with Progressive RAG and Role-Switching for Controversial Cl...

arXiv - AI · 3 min ·
[2603.28371] Coherent Without Grounding, Grounded Without Success: Observability and Epistemic Failure
Llms

[2603.28371] Coherent Without Grounding, Grounded Without Success: Observability and Epistemic Failure

Abstract page for arXiv paper 2603.28371: Coherent Without Grounding, Grounded Without Success: Observability and Epistemic Failure

arXiv - AI · 4 min ·
[2603.28474] CiQi-Agent: Aligning Vision, Tools and Aesthetics in Multimodal Agent for Cultural Reasoning on Chinese Porcelains
Computer Vision

[2603.28474] CiQi-Agent: Aligning Vision, Tools and Aesthetics in Multimodal Agent for Cultural Reasoning on Chinese Porcelains

Abstract page for arXiv paper 2603.28474: CiQi-Agent: Aligning Vision, Tools and Aesthetics in Multimodal Agent for Cultural Reasoning on...

arXiv - AI · 4 min ·
[2603.28431] GeoHCC: Local Geometry-Aware Hierarchical Context Compression for 3D Gaussian Splatting
Machine Learning

[2603.28431] GeoHCC: Local Geometry-Aware Hierarchical Context Compression for 3D Gaussian Splatting

Abstract page for arXiv paper 2603.28431: GeoHCC: Local Geometry-Aware Hierarchical Context Compression for 3D Gaussian Splatting

arXiv - AI · 3 min ·
[2603.28429] AceleradorSNN: A Neuromorphic Cognitive System Integrating Spiking Neural Networks and DynamicImage Signal Processing on FPGA
Machine Learning

[2603.28429] AceleradorSNN: A Neuromorphic Cognitive System Integrating Spiking Neural Networks and DynamicImage Signal Processing on FPGA

Abstract page for arXiv paper 2603.28429: AceleradorSNN: A Neuromorphic Cognitive System Integrating Spiking Neural Networks and DynamicI...

arXiv - AI · 3 min ·
[2603.28251] DiffAttn: Diffusion-Based Drivers' Visual Attention Prediction with LLM-Enhanced Semantic Reasoning
Llms

[2603.28251] DiffAttn: Diffusion-Based Drivers' Visual Attention Prediction with LLM-Enhanced Semantic Reasoning

Abstract page for arXiv paper 2603.28251: DiffAttn: Diffusion-Based Drivers' Visual Attention Prediction with LLM-Enhanced Semantic Reaso...

arXiv - AI · 4 min ·
[2603.28421] Learning unified control of internal spin squeezing in atomic qudits for magnetometry
Ai Infrastructure

[2603.28421] Learning unified control of internal spin squeezing in atomic qudits for magnetometry

Abstract page for arXiv paper 2603.28421: Learning unified control of internal spin squeezing in atomic qudits for magnetometry

arXiv - AI · 4 min ·
[2603.28405] EdgeDiT: Hardware-Aware Diffusion Transformers for Efficient On-Device Image Generation
Machine Learning

[2603.28405] EdgeDiT: Hardware-Aware Diffusion Transformers for Efficient On-Device Image Generation

Abstract page for arXiv paper 2603.28405: EdgeDiT: Hardware-Aware Diffusion Transformers for Efficient On-Device Image Generation

arXiv - AI · 4 min ·