News Score: Score the News, Sort the News, Rewrite the Headlines

The Universal Weight Subspace Hypothesis

View PDF HTML (experimental) Abstract:We show that deep neural networks trained across diverse tasks exhibit remarkably similar low-dimensional parametric subspaces. We provide the first large-scale empirical evidence that demonstrates that neural networks systematically converge to shared spectral subspaces regardless of initialization, task, or domain. Through mode-wise spectral analysis of over 1100 models - including 500 Mistral-7B LoRAs, 500 Vision Transformers, and 50 LLaMA-8B models - we ...

Read more at arxiv.org

© News Score  score the news, sort the news, rewrite the headlines