[2603.04134] InstMeter: An Instruction-Level Method to Predict Energy

[2603.04134] InstMeter: An Instruction-Level Method to Predict Energy and Latency of DL Model Inference on MCUs

arXiv - Machine Learning March 05, 2026 4 min read

About this article

Abstract page for arXiv paper 2603.04134: InstMeter: An Instruction-Level Method to Predict Energy and Latency of DL Model Inference on MCUs

Computer Science > Machine Learning arXiv:2603.04134 (cs) [Submitted on 4 Mar 2026] Title:InstMeter: An Instruction-Level Method to Predict Energy and Latency of DL Model Inference on MCUs Authors:Hao Liu, Qing Wang, Marco Zuniga View a PDF of the paper titled InstMeter: An Instruction-Level Method to Predict Energy and Latency of DL Model Inference on MCUs, by Hao Liu and 2 other authors View PDF Abstract:Deep learning (DL) models can now run on microcontrollers (MCUs). Through neural architecture search (NAS), we can search DL models that meet the constraints of MCUs. Among various constraints, energy and latency costs of the model inference are critical metrics. To predict them, existing research relies on coarse proxies such as multiply-accumulations (MACs) and model's input parameters, often resulting in inaccurate predictions or requiring extensive data collection. In this paper, we propose InstMeter, a predictor leveraging MCUs' clock cycles to accurately estimate the energy and latency of DL models. Clock cycles are fundamental metrics reflecting MCU operations, directly determining energy and latency costs. Furthermore, a unique property of our predictor is its strong linearity, allowing it to be simple and accurate. We thoroughly evaluate InstMeter under different scenarios, MCUs, and software settings. Compared with state-of-the-art studies, InstMeter can reduce the energy and latency prediction errors by $3\times$ and $6.5\times$, respectively, while requiring ...

Originally published on March 05, 2026. Curated by AI News.

Machine Learning

I tried building a memory-first AI… and ended up discovering smaller models can beat larger ones

Dataset Model Acc F1 Δ vs Log Δ vs Static Avg Params Peak Params Steps Infer ms Size Banking77-20 Logistic TF-IDF 92.37% 0.9230 +0.00pp +...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

[D] Howcome Muon is only being used for Transformers?

Muon has quickly been adopted in LLM training, yet we don't see it being talked about in other contexts. Searches for Muon on ConvNets tu...

Reddit - Machine Learning · 1 min · about 1 hour ago

Machine Learning

[P] Run Karpathy's Autoresearch for $0.44 instead of $24 — Open-source parallel evolution pipeline on SageMaker Spot

TL;DR: I built an open-source pipeline that runs Karpathy's autoresearch on SageMaker Spot instances — 25 autonomous ML experiments for $...

Reddit - Machine Learning · 1 min · about 1 hour ago

Machine Learning

Improving AI models’ ability to explain their predictions

AI News - General · 9 min · about 1 hour ago

[2603.04134] InstMeter: An Instruction-Level Method to Predict Energy and Latency of DL Model Inference on MCUs

About this article

Related Articles

I tried building a memory-first AI… and ended up discovering smaller models can beat larger ones

[D] Howcome Muon is only being used for Transformers?

[P] Run Karpathy's Autoresearch for $0.44 instead of $24 — Open-source parallel evolution pipeline on SageMaker Spot

Improving AI models’ ability to explain their predictions

No comments

Stay updated with AI News