[2603.28233] TwinMixing: A Shuffle-Aware Feature Interaction Model for Multi-Task Segmentation
Abstract page for arXiv paper 2603.28233: TwinMixing: A Shuffle-Aware Feature Interaction Model for Multi-Task Segmentation
Abstract page for arXiv paper 2603.28233: TwinMixing: A Shuffle-Aware Feature Interaction Model for Multi-Task Segmentation
Abstract page for arXiv paper 2603.28217: An Optimal Battery-Free Approach for Emission Reduction by Storing Solar Surplus in Building Th...
Abstract page for arXiv paper 2603.28166: Evaluating Privilege Usage of Agents on Real-World Tools
Abstract page for arXiv paper 2603.28196: Designing AI for Real Users -- Accessibility Gaps in Retail AI Front-End
Abstract page for arXiv paper 2603.28142: RecycleLoRA: Rank-Revealing QR-Based Dual-LoRA Subspace Adaptation for Domain Generalized Seman...
Abstract page for arXiv paper 2603.28123: Does Claude's Constitution Have a Culture?
Abstract page for arXiv paper 2603.28130: MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios
Abstract page for arXiv paper 2603.28122: Q-DIVER: Integrated Quantum Transfer Learning and Differentiable Quantum Architecture Search wi...
Abstract page for arXiv paper 2603.28108: Quid est VERITAS? A Modular Framework for Archival Document Analysis
Abstract page for arXiv paper 2603.28103: Transcription and Recognition of Italian Parliamentary Speeches Using Vision-Language Models
Abstract page for arXiv paper 2603.28086: MOSS-VoiceGenerator: Create Realistic Voices with Natural Language Descriptions
Abstract page for arXiv paper 2603.28069: MolmoPoint: Better Pointing for VLMs with Grounding Tokens
Abstract page for arXiv paper 2603.28066: Synonymix: Unified Group Personas for Generative Simulations
Abstract page for arXiv paper 2603.28032: CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied ...
Abstract page for arXiv paper 2603.27987: Beyond Dataset Distillation: Lossless Dataset Concentration via Diffusion-Assisted Distribution...
Abstract page for arXiv paper 2603.27991: ViviDoc: Generating Interactive Documents through Human-Agent Collaboration
Abstract page for arXiv paper 2603.27918: Adversarial Attacks on Multimodal Large Language Models: A Comprehensive Survey
Abstract page for arXiv paper 2603.27982: CDH-Bench: A Commonsense-Driven Hallucination Benchmark for Evaluating Visual Fidelity in Visio...
Abstract page for arXiv paper 2603.27942: JaWildText: A Benchmark for Vision-Language Models on Japanese Scene Text Understanding
Abstract page for arXiv paper 2603.27817: Towards Context-Aware Image Anonymization with Multi-Agent Reasoning