Public Domain Music Dataset

Creators:
Phillip Long, Zachary Novack, Julian McAuley, Taylor Berg-Kirkpatrick
Publication Date:
2024-09-16
Data Category:
Dataset Description:
PDMX is a large-scale public domain MusicXML dataset for symbolic music processing, consisting of over 250,000 MusicXML scores sourced from MuseScore, designed for symbolic music research and analysis. Each score in the dataset is accompanied by metadata detailing attributes such as user interactions, ratings, and licensing information, which can be utilized to filter and analyze the dataset for specific research needs. The number of observations corresponds to the over 250,000 individual MusicXML scores included in the dataset. In total, the dataset has a size of 1,6 GB. Structurally, the dataset is organized with each row representing a different song, including paths to the MusicRender JSON file and associated metadata, along with various attributes such as user status, ratings, and licensing information.
Variables:
Details:

Sign In

Register

Reset Password

Please enter your username or email address, you will receive a link to create a new password via email.