[2601.11556] CSyMR: Benchmarking Compositional Music Information Retrieval in Symbolic Music Reasoning

[2601.11556] CSyMR: Benchmarking Compositional Music Information Retrieval in Symbolic Music Reasoning

arXiv - Machine Learning 4 min read

About this article

Abstract page for arXiv paper 2601.11556: CSyMR: Benchmarking Compositional Music Information Retrieval in Symbolic Music Reasoning

Computer Science > Machine Learning arXiv:2601.11556 (cs) [Submitted on 16 Dec 2025 (v1), last revised 27 Feb 2026 (this version, v2)] Title:CSyMR: Benchmarking Compositional Music Information Retrieval in Symbolic Music Reasoning Authors:Boyang Wang, Yash Vishe, Xin Xu, Zachary Novack, Xunyi Jiang, Julian McAuley, Junda Wu View a PDF of the paper titled CSyMR: Benchmarking Compositional Music Information Retrieval in Symbolic Music Reasoning, by Boyang Wang and 6 other authors View PDF HTML (experimental) Abstract:Natural language information needs over symbolic music scores rarely reduce to a single step lookup. Many queries require compositional Music Information Retrieval (MIR) that extracts multiple pieces of evidence from structured notation and aggregates them to answer the question. This setting remains challenging for Large Language Models due to the mismatch between natural language intents and symbolic representations, as well as the difficulty of reliably handling long structured contexts. Existing benchmarks only partially capture these retrieval demands, often emphasizing isolated theoretical knowledge or simplified settings. We introduce CSyMR-Bench, a benchmark for compositional MIR in symbolic music reasoning grounded in authentic user scenarios. It contains 126 multiple choice questions curated from community discussions and professional examinations, where each item requires chaining multiple atomic analyses over a score to derive implicit musical eviden...

Originally published on March 02, 2026. Curated by AI News.

Related Articles

Llms

Combining the robot operating system with LLMs for natural-language control

Over the past few decades, robotics researchers have developed a wide range of increasingly advanced robots that can autonomously complet...

Reddit - Artificial Intelligence · 1 min ·
Llms

Which LLM is the best for writing a scientific paper?

I'll need to write a scientifc research paper for university. We're allowed and encouraged to use AI for our work. Be it for language or ...

Reddit - Artificial Intelligence · 1 min ·
Llms

Anthropic is training Claude to recognize when its own tools are trying to manipulate it

One thing from Claude Code's source that I think is underappreciated. There's an explicit instruction in the system prompt: if the AI sus...

Reddit - Artificial Intelligence · 1 min ·
Llms

The Claude Code leak accidentally published the first complete blueprint for production AI agents. Here's what it tells us about where this is all going.

Most coverage of the Claude Code leak focuses on the drama or the hidden features. But the bigger story is that this is the first time we...

Reddit - Artificial Intelligence · 1 min ·
More in Llms: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime