[2603.01055] MMCOMET: A Large-Scale Multimodal Commonsense Knowledge

[2603.01055] MMCOMET: A Large-Scale Multimodal Commonsense Knowledge Graph for Contextual Reasoning

arXiv - AI March 03, 2026 3 min read

About this article

Abstract page for arXiv paper 2603.01055: MMCOMET: A Large-Scale Multimodal Commonsense Knowledge Graph for Contextual Reasoning

Computer Science > Artificial Intelligence arXiv:2603.01055 (cs) [Submitted on 1 Mar 2026] Title:MMCOMET: A Large-Scale Multimodal Commonsense Knowledge Graph for Contextual Reasoning Authors:Eileen Wang, Hiba Arnaout, Dhita Pratama, Shuo Yang, Dangyang Liu, Jie Yang, Josiah Poon, Jeff Pan, Caren Han View a PDF of the paper titled MMCOMET: A Large-Scale Multimodal Commonsense Knowledge Graph for Contextual Reasoning, by Eileen Wang and 7 other authors View PDF HTML (experimental) Abstract:We present MMCOMET, the first multimodal commonsense knowledge graph (MMKG) that integrates physical, social, and eventive knowledge. MMCOMET extends the ATOMIC2020 knowledge graph to include a visual dimension, through an efficient image retrieval process, resulting in over 900K multimodal triples. This new resource addresses a major limitation of existing MMKGs in supporting complex reasoning tasks like image captioning and storytelling. Through a standard visual storytelling experiment, we show that our holistic approach enables the generation of richer, coherent, and contextually grounded stories than those produced using text-only knowledge. This resource establishes a new foundation for multimodal commonsense reasoning and narrative generation. Subjects: Artificial Intelligence (cs.AI) Cite as: arXiv:2603.01055 [cs.AI] (or arXiv:2603.01055v1 [cs.AI] for this version) https://doi.org/10.48550/arXiv.2603.01055 Focus to learn more arXiv-issued DOI via DataCite (pending registration...

Originally published on March 03, 2026. Curated by AI News.

Nlp

[D] Simple Questions Thread

Please post your questions here instead of creating a new thread. Encourage others who create new posts for questions to post here instea...

Reddit - Machine Learning · 1 min · about 1 hour ago

Llms

Which LLM is the best for writing a scientific paper?

I'll need to write a scientifc research paper for university. We're allowed and encouraged to use AI for our work. Be it for language or ...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

The Claude Code leak accidentally published the first complete blueprint for production AI agents. Here's what it tells us about where this is all going.

Most coverage of the Claude Code leak focuses on the drama or the hidden features. But the bigger story is that this is the first time we...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Llms

[For Hire] Junior AI/ML Engineer | RAG · LLMs · FastAPI · Vector DBs | Remote

Posting this for a friend who isn't on Reddit. A recent graduate, entry level, no commercial production experience but spent the past yea...

Reddit - ML Jobs · 1 min · about 6 hours ago

[2603.01055] MMCOMET: A Large-Scale Multimodal Commonsense Knowledge Graph for Contextual Reasoning

About this article

Related Articles

[D] Simple Questions Thread

Which LLM is the best for writing a scientific paper?

The Claude Code leak accidentally published the first complete blueprint for production AI agents. Here's what it tells us about where this is all going.

[For Hire] Junior AI/ML Engineer | RAG · LLMs · FastAPI · Vector DBs | Remote

No comments

Stay updated with AI News