[2602.11776] MUSE: Multi-Tenant Model Serving With Seamless Model Updates

[2602.11776] MUSE: Multi-Tenant Model Serving With Seamless Model Updates

arXiv - Machine Learning 4 min read Article

Summary

The paper presents MUSE, a framework for multi-tenant model serving that allows seamless updates of machine learning models, optimizing decision thresholds and reducing operational costs.

Why It Matters

MUSE addresses the challenges of model recalibration in multi-tenant environments, which is crucial for businesses relying on accurate predictions. By decoupling model scores from client-specific decision boundaries, it enhances model resilience and operational efficiency, significantly impacting industries like finance and fraud detection.

Key Takeaways

  • MUSE enables seamless updates of machine learning models in multi-tenant environments.
  • It decouples model scores from client decision thresholds, facilitating easier recalibration.
  • The framework has been deployed at scale, processing over 55 billion events in the past year.
  • MUSE reduces model lead time from weeks to minutes, enhancing operational efficiency.
  • It helps save millions in fraud losses and operational costs.

Computer Science > Machine Learning arXiv:2602.11776 (cs) [Submitted on 12 Feb 2026 (v1), last revised 24 Feb 2026 (this version, v2)] Title:MUSE: Multi-Tenant Model Serving With Seamless Model Updates Authors:Cláudio Correia, Alberto E. A. Ferreira, Lucas Martins, Miguel P. Bento, Sofia Guerreiro, Ricardo Ribeiro Pereira, Ana Sofia Gomes, Jacopo Bono, Hugo Ferreira, Pedro Bizarro View a PDF of the paper titled MUSE: Multi-Tenant Model Serving With Seamless Model Updates, by Cl\'audio Correia and 9 other authors View PDF HTML (experimental) Abstract:In binary classification systems, decision thresholds translate model scores into actions. Choosing suitable thresholds relies on the specific distribution of the underlying model scores but also on the specific business decisions of each client using that model. However, retraining models inevitably shifts score distributions, invalidating existing thresholds. In multi-tenant Score-as-a-Service environments, where decision boundaries reside in client-managed infrastructure, this creates a severe bottleneck: recalibration requires coordinating threshold updates across hundreds of clients, consuming excessive human hours and leading to model stagnation. We introduce MUSE, a model serving framework that enables seamless model updates by decoupling model scores from client decision boundaries. Designed for multi-tenancy, MUSE optimizes infrastructure re-use by sharing models via dynamic intent-based routing, combined with a two-le...

Related Articles

Machine Learning

[D] When to transition from simple heuristics to ML models (e.g., DensityFunction)?

Two questions: What are the recommendations around when to transition from a simple heuristic baseline to machine learning ML models for ...

Reddit - Machine Learning · 1 min ·
Machine Learning

[D] ICML 2026 Average Score

Hi all, I’m curious about the current review dynamics for ICML 2026, especially after the rebuttal phase. For those who are reviewers (or...

Reddit - Machine Learning · 1 min ·
Machine Learning

[R] VOID: Video Object and Interaction Deletion (physically-consistent video inpainting)

We present VOID, a model for video object removal that aims to handle *physical interactions*, not just appearance. Most existing video i...

Reddit - Machine Learning · 1 min ·
Machine Learning

FLUX 2 Pro (2026) Sketch to Image

I sketched a cow and tested how different models interpret it into a realistic image for downstream 3D generation, turns out some models ...

Reddit - Artificial Intelligence · 1 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime