Skip to main content
Society Logo
Journal Name Logo

Transactions of the International Society for Music Information Retrieval

CCMusic: An Open and Diverse Database for Chinese Music Information Retrieval Research

DATASET ARTICLE

Authors
  • Monan Zhou
  • Shenyang Xu
  • Zhaorui Liu
  • Zhaowen Wang
  • Feng Yu
  • Wei Li
  • Baoqiang Han

Abstract

Data are crucial in various computer‑related fields, including music information retrieval (MIR), an interdisciplinary area bridging computer science and music. This paper introduces CCMusic, an open and diverse database comprising multiple datasets specifically designed for tasks related to Chinese music, highlighting our focus on this culturally rich domain. The database integrates both published and unpublished datasets, with steps taken such as data cleaning, label refinement, and data structure unification to ensure data consistency and create ready‑to‑use versions. We conduct benchmark evaluations for all datasets using a unified evaluation framework developed specifically for this purpose. This publicly available framework supports both classification and detection tasks, ensuring standardized and reproducible results across all datasets. The database is hosted on HuggingFace and ModelScope, two open and multifunctional data and model hosting platforms, ensuring ease of accessibility and usability.

Year: 2025
Volume: 8 Issue: 1
Page/Article: 22–38
DOI: 10.5334/tismir.194
Accepted on Feb 21, 2025
Published on Mar 24, 2025
Peer Reviewed

Metrics

Click on the tabs below to view various metrics for this article.
Loading metrics