Content Feed

The latest content from across the network

[2603.27918] Adversarial Attacks on Multimodal Large Language Models: A Comprehensive Survey
Llms

[2603.27918] Adversarial Attacks on Multimodal Large Language Models: A Comprehensive Survey

Abstract page for arXiv paper 2603.27918: Adversarial Attacks on Multimodal Large Language Models: A Comprehensive Survey

arXiv - AI · 3 min ·
[2603.27982] CDH-Bench: A Commonsense-Driven Hallucination Benchmark for Evaluating Visual Fidelity in Vision-Language Models
Llms

[2603.27982] CDH-Bench: A Commonsense-Driven Hallucination Benchmark for Evaluating Visual Fidelity in Vision-Language Models

Abstract page for arXiv paper 2603.27982: CDH-Bench: A Commonsense-Driven Hallucination Benchmark for Evaluating Visual Fidelity in Visio...

arXiv - AI · 4 min ·
[2603.27942] JaWildText: A Benchmark for Vision-Language Models on Japanese Scene Text Understanding
Llms

[2603.27942] JaWildText: A Benchmark for Vision-Language Models on Japanese Scene Text Understanding

Abstract page for arXiv paper 2603.27942: JaWildText: A Benchmark for Vision-Language Models on Japanese Scene Text Understanding

arXiv - AI · 4 min ·
[2603.27817] Towards Context-Aware Image Anonymization with Multi-Agent Reasoning
Ai Agents

[2603.27817] Towards Context-Aware Image Anonymization with Multi-Agent Reasoning

Abstract page for arXiv paper 2603.27817: Towards Context-Aware Image Anonymization with Multi-Agent Reasoning

arXiv - AI · 4 min ·
[2603.27886] AI-ready design of realistic 2D materials and interfaces with Mat3ra-2D
Machine Learning

[2603.27886] AI-ready design of realistic 2D materials and interfaces with Mat3ra-2D

Abstract page for arXiv paper 2603.27886: AI-ready design of realistic 2D materials and interfaces with Mat3ra-2D

arXiv - AI · 3 min ·
[2603.27868] A Revealed Preference Framework for AI Alignment
Machine Learning

[2603.27868] A Revealed Preference Framework for AI Alignment

Abstract page for arXiv paper 2603.27868: A Revealed Preference Framework for AI Alignment

arXiv - AI · 3 min ·
[2603.27798] Towards Emotion Recognition with 3D Pointclouds Obtained from Facial Expression Images
Machine Learning

[2603.27798] Towards Emotion Recognition with 3D Pointclouds Obtained from Facial Expression Images

Abstract page for arXiv paper 2603.27798: Towards Emotion Recognition with 3D Pointclouds Obtained from Facial Expression Images

arXiv - AI · 4 min ·
[2603.27756] Heracles: Bridging Precise Tracking and Generative Synthesis for General Humanoid Control
Robotics

[2603.27756] Heracles: Bridging Precise Tracking and Generative Synthesis for General Humanoid Control

Abstract page for arXiv paper 2603.27756: Heracles: Bridging Precise Tracking and Generative Synthesis for General Humanoid Control

arXiv - AI · 4 min ·
[2603.27747] AI-Powered Facial Mask Removal Is Not Suitable For Biometric Identification

[2603.27747] AI-Powered Facial Mask Removal Is Not Suitable For Biometric Identification

Abstract page for arXiv paper 2603.27747: AI-Powered Facial Mask Removal Is Not Suitable For Biometric Identification

arXiv - AI · 3 min ·
[2603.27745] Needle in the Repo: A Benchmark for Maintainability in AI-Generated Repository Edits
Ai Startups

[2603.27745] Needle in the Repo: A Benchmark for Maintainability in AI-Generated Repository Edits

Abstract page for arXiv paper 2603.27745: Needle in the Repo: A Benchmark for Maintainability in AI-Generated Repository Edits

arXiv - AI · 4 min ·
[2603.27727] Suppression of $^{14}\mathrm{C}$ photon hits in large liquid scintillator detectors via spatiotemporal deep learning
Machine Learning

[2603.27727] Suppression of $^{14}\mathrm{C}$ photon hits in large liquid scintillator detectors via spatiotemporal deep learning

Abstract page for arXiv paper 2603.27727: Suppression of $^{14}\mathrm{C}$ photon hits in large liquid scintillator detectors via spatiot...

arXiv - AI · 4 min ·
[2603.27716] The role of neuromorphic principles in the future of biomedicine and healthcare

[2603.27716] The role of neuromorphic principles in the future of biomedicine and healthcare

Abstract page for arXiv paper 2603.27716: The role of neuromorphic principles in the future of biomedicine and healthcare

arXiv - AI · 3 min ·
[2603.27705] RAP: Retrieve, Adapt, and Prompt-Fit for Training-Free Few-Shot Medical Image Segmentation
Machine Learning

[2603.27705] RAP: Retrieve, Adapt, and Prompt-Fit for Training-Free Few-Shot Medical Image Segmentation

Abstract page for arXiv paper 2603.27705: RAP: Retrieve, Adapt, and Prompt-Fit for Training-Free Few-Shot Medical Image Segmentation

arXiv - AI · 4 min ·
[2603.27670] ProgressVLA: Progress-Guided Diffusion Policy for Vision-Language Robotic Manipulation
Machine Learning

[2603.27670] ProgressVLA: Progress-Guided Diffusion Policy for Vision-Language Robotic Manipulation

Abstract page for arXiv paper 2603.27670: ProgressVLA: Progress-Guided Diffusion Policy for Vision-Language Robotic Manipulation

arXiv - AI · 3 min ·
[2603.27667] EvA: An Evidence-First Audio Understanding Paradigm for LALMs
Llms

[2603.27667] EvA: An Evidence-First Audio Understanding Paradigm for LALMs

Abstract page for arXiv paper 2603.27667: EvA: An Evidence-First Audio Understanding Paradigm for LALMs

arXiv - AI · 3 min ·
[2603.27632] ContraMap: Contrastive Uncertainty Mapping for Robot Environment Representation
Robotics

[2603.27632] ContraMap: Contrastive Uncertainty Mapping for Robot Environment Representation

Abstract page for arXiv paper 2603.27632: ContraMap: Contrastive Uncertainty Mapping for Robot Environment Representation

arXiv - AI · 3 min ·
[2603.27626] Umwelt Engineering: Designing the Cognitive Worlds of Linguistic Agents
Llms

[2603.27626] Umwelt Engineering: Designing the Cognitive Worlds of Linguistic Agents

Abstract page for arXiv paper 2603.27626: Umwelt Engineering: Designing the Cognitive Worlds of Linguistic Agents

arXiv - AI · 3 min ·
[2603.27624] Expert Streaming: Accelerating Low-Batch MoE Inference via Multi-chiplet Architecture and Dynamic Expert Trajectory Scheduling
Machine Learning

[2603.27624] Expert Streaming: Accelerating Low-Batch MoE Inference via Multi-chiplet Architecture and Dynamic Expert Trajectory Scheduling

Abstract page for arXiv paper 2603.27624: Expert Streaming: Accelerating Low-Batch MoE Inference via Multi-chiplet Architecture and Dynam...

arXiv - AI · 4 min ·
[2603.27593] STRIDE: When to Speak Meets Sequence Denoising for Streaming Video Understanding
Llms

[2603.27593] STRIDE: When to Speak Meets Sequence Denoising for Streaming Video Understanding

Abstract page for arXiv paper 2603.27593: STRIDE: When to Speak Meets Sequence Denoising for Streaming Video Understanding

arXiv - AI · 3 min ·
[2603.27563] InnerPond: Fostering Inter-Self Dialogue with a Multi-Agent Approach for Introspection
Ai Agents

[2603.27563] InnerPond: Fostering Inter-Self Dialogue with a Multi-Agent Approach for Introspection

Abstract page for arXiv paper 2603.27563: InnerPond: Fostering Inter-Self Dialogue with a Multi-Agent Approach for Introspection

arXiv - AI · 4 min ·