[2603.02123] Nano-EmoX: Unifying Multimodal Emotional Intelligence from Perception to Empathy

[2603.02123] Nano-EmoX: Unifying Multimodal Emotional Intelligence from Perception to Empathy

arXiv - AI 4 min read

About this article

Abstract page for arXiv paper 2603.02123: Nano-EmoX: Unifying Multimodal Emotional Intelligence from Perception to Empathy

Computer Science > Artificial Intelligence arXiv:2603.02123 (cs) [Submitted on 2 Mar 2026] Title:Nano-EmoX: Unifying Multimodal Emotional Intelligence from Perception to Empathy Authors:Jiahao Huang, Fengyan Lin, Xuechao Yang, Chen Feng, Kexin Zhu, Xu Yang, Zhide Chen View a PDF of the paper titled Nano-EmoX: Unifying Multimodal Emotional Intelligence from Perception to Empathy, by Jiahao Huang and 6 other authors View PDF HTML (experimental) Abstract:The development of affective multimodal language models (MLMs) has long been constrained by a gap between low-level perception and high-level interaction, leading to fragmented affective capabilities and limited generalization. To bridge this gap, we propose a cognitively inspired three-level hierarchy that organizes affective tasks according to their cognitive depth-perception, understanding, and interaction-and provides a unified conceptual foundation for advancing affective modeling. Guided by this hierarchy, we introduce Nano-EmoX, a small-scale multitask MLM, and P2E (Perception-to-Empathy), a curriculum-based training framework. Nano-EmoX integrates a suite of omni-modal encoders, including an enhanced facial encoder and a fusion encoder, to capture key multimodal affective cues and improve cross-task transferability. The outputs are projected into a unified language space via heterogeneous adapters, empowering a lightweight language model to tackle diverse affective tasks. Concurrently, P2E progressively cultivates emo...

Originally published on March 03, 2026. Curated by AI News.

Related Articles

Llms

The Claude Code leak accidentally published the first complete blueprint for production AI agents. Here's what it tells us about where this is all going.

Most coverage of the Claude Code leak focuses on the drama or the hidden features. But the bigger story is that this is the first time we...

Reddit - Artificial Intelligence · 1 min ·
AI can push your Stream Deck buttons for you | The Verge
Llms

AI can push your Stream Deck buttons for you | The Verge

The Stream Deck 7.4 software update introduces MCP support, allowing AI assistants to find and activate Stream Deck actions on your behalf.

The Verge - AI · 4 min ·
Llms

[For Hire] Junior AI/ML Engineer | RAG · LLMs · FastAPI · Vector DBs | Remote

Posting this for a friend who isn't on Reddit. A recent graduate, entry level, no commercial production experience but spent the past yea...

Reddit - ML Jobs · 1 min ·
I Asked ChatGPT What WIRED’s Reviewers Recommend—Its Answers Were All Wrong | WIRED
Llms

I Asked ChatGPT What WIRED’s Reviewers Recommend—Its Answers Were All Wrong | WIRED

Want to know what our reviewers have actually tested and picked as the best TVs, headphones, and laptops? Ask ChatGPT, and it'll give you...

Wired - AI · 8 min ·
More in Llms: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime