Llms Machine Learning Ai Safety Ai Startups Data Science

[2602.17532] Systematic Evaluation of Single-Cell Foundation Model Interpretability Reveals Attention Captures Co-Expression Rather Than Unique Regulatory Signal

arXiv - AI February 20, 2026 3 min read Article

Summary

This article evaluates the interpretability of single-cell foundation models, revealing that attention mechanisms capture co-expression rather than unique regulatory signals, with implications for predictive modeling in genomics.

Why It Matters

Understanding how single-cell models interpret biological data is crucial for advancing genomic research. This study highlights the limitations of attention mechanisms in predictive accuracy, guiding future model development and quality control in genomics.

Key Takeaways

Attention patterns in models like scGPT and Geneformer encode biological information but lack predictive value.
Gene-level baselines outperform attention mechanisms in perturbation prediction.
The relationship between attention and correlation is context-dependent across different cell types.
Cell-State Stratified Interpretability (CSSI) improves gene regulatory network recovery significantly.
The study establishes quality-control standards for interpretability in single-cell models.

Quantitative Biology > Genomics arXiv:2602.17532 (q-bio) [Submitted on 19 Feb 2026] Title:Systematic Evaluation of Single-Cell Foundation Model Interpretability Reveals Attention Captures Co-Expression Rather Than Unique Regulatory Signal Authors:Ihor Kendiukhov View a PDF of the paper titled Systematic Evaluation of Single-Cell Foundation Model Interpretability Reveals Attention Captures Co-Expression Rather Than Unique Regulatory Signal, by Ihor Kendiukhov View PDF HTML (experimental) Abstract:We present a systematic evaluation framework - thirty-seven analyses, 153 statistical tests, four cell types, two perturbation modalities - for assessing mechanistic interpretability in single-cell foundation models. Applying this framework to scGPT and Geneformer, we find that attention patterns encode structured biological information with layer-specific organisation - protein-protein interactions in early layers, transcriptional regulation in late layers - but this structure provides no incremental value for perturbation prediction: trivial gene-level baselines outperform both attention and correlation edges (AUROC 0.81-0.88 versus 0.70), pairwise edge scores add zero predictive contribution, and causal ablation of regulatory heads produces no degradation. These findings generalise from K562 to RPE1 cells; the attention-correlation relationship is context-dependent, but gene-level dominance is universal. Cell-State Stratified Interpretability (CSSI) addresses an attention-specif...

Read Original Article

[2602.17532] Systematic Evaluation of Single-Cell Foundation Model Interpretability Reveals Attention Captures Co-Expression Rather Than Unique Regulatory Signal

Summary

Why It Matters

Key Takeaways

Related Articles

A robot car with a Claude AI brain started a YouTube vlog about its own existence

Study: LLMs Able to De-Anonymize User Accounts on Reddit, Hacker News & Other "Pseudonymous" Platforms; Report Co-Author Expands, Advises

do you guys actually trust AI tools with your data?

[P] Remote sensing foundation models made easy to use.

No comments

Stay updated with AI News