Holotron-12B - High Throughput Computer Use Agent
About this article
A Blog post by H company on Hugging Face
Back to Articles Holotron-12B - High Throughput Computer Use Agent Team Article Published March 17, 2026 Upvote 12 +6 Pierre-Louis Cedoz plcedoz38 Follow Hcompany Hamza Benchekroun hamza-hcompany Follow Hcompany Aurélien Lac h-aurelien-lac Follow Hcompany delfosse aureliendelfosseathai Follow Hcompany Tony Wu h-tonywu Follow Hcompany Mats L. Richter MatsLRichter Follow Hcompany Antoine Bonnet ABonnetH Follow Hcompany Kai Yuan h-kaiy Follow Hcompany Aleix Cambray (H-AI) h-aleixcambray Follow Hcompany Alexandra a-constantinou Follow Hcompany We're thrilled to release Holotron-12B, a multimodal computer-use model from H Company. Post-trained from the open NVIDIA Nemotron-Nano-2 VL model on H Company’s proprietary data mixture, Holotron-12B is the result of a close collaboration between our research labs to engineer a new type of model optimized primarily for scale and performance in production. H Company is part of the NVIDIA Inception Program. The model is now available on Hugging Face. Why We Built Holotron-12B Most multimodal models today optimize primarily for static vision or following instructions. Holotron-12B, just like our Holo2 model, however, has a different goal: serving as a policy model for computer-use agents that must perceive, decide, and act efficiently in interactive environments. With Holotron-12B, we wanted to create a model that could efficiently and effectively scale in production while handling long contexts with multiple images, and still perform well...