[R] VOID: Video Object and Interaction Deletion (physically-consistent video inpainting)
We present VOID, a model for video object removal that aims to handle *physical interactions*, not just appearance. Most existing video i...
ML algorithms, training, and inference
We present VOID, a model for video object removal that aims to handle *physical interactions*, not just appearance. Most existing video i...
I sketched a cow and tested how different models interpret it into a realistic image for downstream 3D generation, turns out some models ...
Abstract page for arXiv paper 2603.24936: TIGFlow-GRPO: Trajectory Forecasting via Interaction-Aware Flow Matching and Reward-Driven Opti...
Abstract page for arXiv paper 2603.24626: A Large-Scale Comparative Analysis of Imputation Methods for Single-Cell RNA Sequencing Data
Abstract page for arXiv paper 2603.24617: Multi-LLM Query Optimization
Abstract page for arXiv paper 2603.24898: Sovereign AI at the Front Door of Care: A Physically Unidirectional Architecture for Secure Cli...
Abstract page for arXiv paper 2603.24598: Response-Aware Risk-Constrained Control Barrier Function With Application to Vehicles
Abstract page for arXiv paper 2603.25699: Neural Network Conversion of Machine Learning Pipelines
Abstract page for arXiv paper 2603.24891: Surrogates, Spikes, and Sparsity: Performance Analysis and Characterization of SNN Hyperparamet...
Abstract page for arXiv paper 2603.25692: A Unified Memory Perspective for Probabilistic Trustworthy AI
Abstract page for arXiv paper 2603.25687: On Neural Scaling Laws for Weather Emulation through Continual Training
Abstract page for arXiv paper 2603.25673: Longitudinal Digital Phenotyping for Early Cognitive-Motor Screening
Abstract page for arXiv paper 2603.24857: AI Security in the Foundation Model Era: A Comprehensive Survey from a Unified Perspective
Abstract page for arXiv paper 2603.25635: Anchored-Branched Steady-state WInd Flow Transformer (AB-SWIFT): a metamodel for 3D atmospheric...
Abstract page for arXiv paper 2603.24846: NeuroVLM-Bench: Evaluation of Vision-Enabled Large Language Models for Clinical Reasoning in Ne...
Abstract page for arXiv paper 2603.25614: Social Hippocampus Memory Learning
Abstract page for arXiv paper 2603.25561: An Integrative Genome-Scale Metabolic Modeling and Machine Learning Framework for Predicting an...
Abstract page for arXiv paper 2603.25562: Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes
Abstract page for arXiv paper 2603.24821: Generative Adversarial Perturbations with Cross-paradigm Transferability on Localized Crowd Cou...
Abstract page for arXiv paper 2603.24806: FODMP: Fast One-Step Diffusion of Movement Primitives Generation for Time-Dependent Robot Actions
Abstract page for arXiv paper 2603.25495: Interpretable PM2.5 Forecasting for Urban Air Quality: A Comparative Study of Operational Time-...
Abstract page for arXiv paper 2603.24804: GoldiCLIP: The Goldilocks Approach for Balancing Explicit Supervision for Language-Image Pretra...
Get the latest news, tools, and insights delivered to your inbox.
Daily or weekly digest • Unsubscribe anytime