[2601.11556] CSyMR: Benchmarking Compositional Music Information

[2601.11556] CSyMR: Benchmarking Compositional Music Information Retrieval in Symbolic Music Reasoning

arXiv - Machine Learning March 02, 2026 4 min read

About this article

Abstract page for arXiv paper 2601.11556: CSyMR: Benchmarking Compositional Music Information Retrieval in Symbolic Music Reasoning

Computer Science > Machine Learning arXiv:2601.11556 (cs) [Submitted on 16 Dec 2025 (v1), last revised 27 Feb 2026 (this version, v2)] Title:CSyMR: Benchmarking Compositional Music Information Retrieval in Symbolic Music Reasoning Authors:Boyang Wang, Yash Vishe, Xin Xu, Zachary Novack, Xunyi Jiang, Julian McAuley, Junda Wu View a PDF of the paper titled CSyMR: Benchmarking Compositional Music Information Retrieval in Symbolic Music Reasoning, by Boyang Wang and 6 other authors View PDF HTML (experimental) Abstract:Natural language information needs over symbolic music scores rarely reduce to a single step lookup. Many queries require compositional Music Information Retrieval (MIR) that extracts multiple pieces of evidence from structured notation and aggregates them to answer the question. This setting remains challenging for Large Language Models due to the mismatch between natural language intents and symbolic representations, as well as the difficulty of reliably handling long structured contexts. Existing benchmarks only partially capture these retrieval demands, often emphasizing isolated theoretical knowledge or simplified settings. We introduce CSyMR-Bench, a benchmark for compositional MIR in symbolic music reasoning grounded in authentic user scenarios. It contains 126 multiple choice questions curated from community discussions and professional examinations, where each item requires chaining multiple atomic analyses over a score to derive implicit musical eviden...

Originally published on March 02, 2026. Curated by AI News.

Llms

Combining the robot operating system with LLMs for natural-language control

Over the past few decades, robotics researchers have developed a wide range of increasingly advanced robots that can autonomously complet...

Reddit - Artificial Intelligence · 1 min · 29 minutes ago

Llms

Which LLM is the best for writing a scientific paper?

I'll need to write a scientifc research paper for university. We're allowed and encouraged to use AI for our work. Be it for language or ...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Llms

Anthropic is training Claude to recognize when its own tools are trying to manipulate it

One thing from Claude Code's source that I think is underappreciated. There's an explicit instruction in the system prompt: if the AI sus...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Llms

The Claude Code leak accidentally published the first complete blueprint for production AI agents. Here's what it tells us about where this is all going.

Most coverage of the Claude Code leak focuses on the drama or the hidden features. But the bigger story is that this is the first time we...

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

[2601.11556] CSyMR: Benchmarking Compositional Music Information Retrieval in Symbolic Music Reasoning

About this article

Related Articles

Combining the robot operating system with LLMs for natural-language control

Which LLM is the best for writing a scientific paper?

Anthropic is training Claude to recognize when its own tools are trying to manipulate it

The Claude Code leak accidentally published the first complete blueprint for production AI agents. Here's what it tells us about where this is all going.

No comments

Stay updated with AI News