[2603.21749] Model selection in hybrid quantum neural networks with applications to quantum transformer architectures
About this article
Abstract page for arXiv paper 2603.21749: Model selection in hybrid quantum neural networks with applications to quantum transformer architectures
Quantum Physics arXiv:2603.21749 (quant-ph) [Submitted on 23 Mar 2026] Title:Model selection in hybrid quantum neural networks with applications to quantum transformer architectures Authors:Harsh Wadhwa, Rahul Bhowmick, Naipunnya Raj, Rajiv Sangle, Ruchira V. Bhat, Krishnakumar Sabapathy View a PDF of the paper titled Model selection in hybrid quantum neural networks with applications to quantum transformer architectures, by Harsh Wadhwa and 5 other authors View PDF Abstract:Quantum machine learning models generally lack principled design guidelines, often requiring full resource-intensive training across numerous choices of encodings, quantum circuit designs and initialization strategies to find effective configuration. To address this challenge, we develope the Quantum Bias-Expressivity Toolbox ($\texttt{QBET}$), a framework for evaluating quantum, classical, and hybrid transformer architectures. In this toolbox, we introduce lean metrics for Simplicity Bias ($\texttt{SB}$) and Expressivity ($\texttt{EXP}$), for comparing across various models, and extend the analysis of $\texttt{SB}$ to generative and multiclass-classification tasks. We show that $\texttt{QBET}$ enables efficient pre-screening of promising model variants obviating the need to execute complete training pipelines. In evaluations on transformer-based classification and generative tasks we employ a total of $18$ qubits for embeddings ($6$ qubits each for query, key, and value). We identify scenarios in whic...