[2510.18573] Kaleido: Open-Sourced Multi-Subject Reference Video

[2510.18573] Kaleido: Open-Sourced Multi-Subject Reference Video Generation Model

arXiv - AI March 05, 2026 4 min read

About this article

Abstract page for arXiv paper 2510.18573: Kaleido: Open-Sourced Multi-Subject Reference Video Generation Model

Computer Science > Computer Vision and Pattern Recognition arXiv:2510.18573 (cs) [Submitted on 21 Oct 2025 (v1), last revised 4 Mar 2026 (this version, v2)] Title:Kaleido: Open-Sourced Multi-Subject Reference Video Generation Model Authors:Zhenxing Zhang, Jiayan Teng, Zhuoyi Yang, Tiankun Cao, Cheng Wang, Xiaotao Gu, Jie Tang, Dan Guo, Meng Wang View a PDF of the paper titled Kaleido: Open-Sourced Multi-Subject Reference Video Generation Model, by Zhenxing Zhang and 8 other authors View PDF HTML (experimental) Abstract:We present Kaleido, a subject-to-video~(S2V) generation framework, which aims to synthesize subject-consistent videos conditioned on multiple reference images of target subjects. Despite recent progress in S2V generation models, existing approaches remain inadequate at maintaining multi-subject consistency and at handling background disentanglement, often resulting in lower reference fidelity and semantic drift under multi-image conditioning. These shortcomings can be attributed to several factors. Primarily, the training dataset suffers from a lack of diversity and high-quality samples, as well as cross-paired data, i.e., paired samples whose components originate from different instances. In addition, the current mechanism for integrating multiple reference images is suboptimal, potentially resulting in the confusion of multiple subjects. To overcome these limitations, we propose a dedicated data construction pipeline, incorporating low-quality sample filte...

Originally published on March 05, 2026. Curated by AI News.

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 1 hour ago

Machine Learning

[D] Could really use some guidance . I'm a 2nd year Data Science UG Student

I'm currently finishing up my second year of a three year Bachelor of Data Science degree. I've got the basics down quite well, linear re...

Reddit - Machine Learning · 1 min · about 2 hours ago

Machine Learning

[R] Spectral Compact Training: 172x memory reduction for 70B model training - verified on a Steam Deck (7.24 GB)

This is a research article about a patent I filed (not self promotion). I am dyslexic so I used AI to help with the writing. I have been ...

Reddit - Machine Learning · 1 min · about 3 hours ago

Llms

ChatGPT Critiques My Approach to AI

I uploaded VulcanAMI into ChatGPT and had it to a deep analysis. I then asked one simple question: What would be the result of wider adop...

Reddit - Artificial Intelligence · 1 min · about 4 hours ago

[2510.18573] Kaleido: Open-Sourced Multi-Subject Reference Video Generation Model

About this article

Related Articles

UMKC Announces New Master of Science in Artificial Intelligence

[D] Could really use some guidance . I'm a 2nd year Data Science UG Student

[R] Spectral Compact Training: 172x memory reduction for 70B model training - verified on a Steam Deck (7.24 GB)

ChatGPT Critiques My Approach to AI

No comments

Stay updated with AI News