[2603.00180] NNiT: Width-Agnostic Neural Network Generation with

[2603.00180] NNiT: Width-Agnostic Neural Network Generation with Structurally Aligned Weight Spaces

arXiv - Machine Learning March 03, 2026 3 min read

About this article

Abstract page for arXiv paper 2603.00180: NNiT: Width-Agnostic Neural Network Generation with Structurally Aligned Weight Spaces

Computer Science > Machine Learning arXiv:2603.00180 (cs) [Submitted on 26 Feb 2026] Title:NNiT: Width-Agnostic Neural Network Generation with Structurally Aligned Weight Spaces Authors:Jiwoo Kim, Swarajh Mehta, Hao-Lun Hsu, Hyunwoo Ryu, Yudong Liu, Miroslav Pajic View a PDF of the paper titled NNiT: Width-Agnostic Neural Network Generation with Structurally Aligned Weight Spaces, by Jiwoo Kim and 5 other authors View PDF HTML (experimental) Abstract:Generative modeling of neural network parameters is often tied to architectures because standard parameter representations rely on known weight-matrix dimensions. Generation is further complicated by permutation symmetries that allow networks to model similar input-output functions while having widely different, unaligned parameterizations. In this work, we introduce Neural Network Diffusion Transformers (NNiTs), which generate weights in a width-agnostic manner by tokenizing weight matrices into patches and modeling them as locally structured fields. We establish that Graph HyperNetworks (GHNs) with a convolutional neural network (CNN) decoder structurally align the weight space, creating the local correlation necessary for patch-based processing. Focusing on MLPs, where permutation symmetry is especially apparent, NNiT generates fully functional networks across a range of architectures. Our approach jointly models discrete architecture tokens and continuous weight patches within a single sequence model. On ManiSkill3 robotic...

Originally published on March 03, 2026. Curated by AI News.

Machine Learning

Hub Group Using AI, Machine Learning for Real-Time Visibility of Shipments

AI Events · 4 min · about 2 hours ago

Llms

Von Hammerstein’s Ghost: What a Prussian General’s Officer Typology Can Teach Us About AI Misalignment

Greetings all - I've posted mostly in r/claudecode and r/aigamedev a couple of times previously. Working with CC for personal projects re...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Llms

World models will be the next big thing, bye-bye LLMs

Was at Nvidia's GTC conference recently and honestly, it was one of the most eye-opening events I've attended in a while. There was a lot...

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

Machine Learning

[D] Got my first offer after months of searching — below posted range, contract-to-hire, and worried it may pause my search. Do I take it?

I could really use some outside perspective. I’m a senior ML/CV engineer in Canada with about 5–6 years across research and industry. Mas...

Reddit - Machine Learning · 1 min · about 6 hours ago

[2603.00180] NNiT: Width-Agnostic Neural Network Generation with Structurally Aligned Weight Spaces

About this article

Related Articles

Hub Group Using AI, Machine Learning for Real-Time Visibility of Shipments

Von Hammerstein’s Ghost: What a Prussian General’s Officer Typology Can Teach Us About AI Misalignment

World models will be the next big thing, bye-bye LLMs

[D] Got my first offer after months of searching — below posted range, contract-to-hire, and worried it may pause my search. Do I take it?

No comments

Stay updated with AI News