entertainment

Showing 1-2 of 2 results

Million song dataset

Creators: Thierry Bertin-Mahieux and Daniel P.W. Ellis from Columbia University, along with Brian Whitman and Paul Lamere from The Echo Nest.
Publication Date: 2011
Creators: Thierry Bertin-Mahieux and Daniel P.W. Ellis from Columbia University, along with Brian Whitman and Paul Lamere from The Echo Nest.

The Million Song Dataset is a large-scale music dataset created by The Echo Nest and LabROSA to advance research in music information retrieval and recommendation systems. It contains metadata for one million contemporary music tracks, including details such as song titles, artists, release years, and genres, as well as audio features like tempo, loudness, and key. Each track in the dataset includes detailed metadata such as song titles, artists, albums, release years, and genres. The dataset provides various audio features, including tempo, loudness, key, time signature, and mode, facilitating in-depth musical analysis. With data on one million tracks, the MSD enables the development and evaluation of algorithms that can scale to commercial music collections. The dataset comprises approximately 280 GB of data and features 44,745 unique artists. The MSD encompasses contemporary popular music tracks up to its release in 2011. The dataset is organized in the HDF5 file format, offering efficient storage and access to large amounts of data. Each track’s data is stored as a separate HDF5 file, containing the following key variables:

  • Metadata:

    • Song ID: A unique identifier for each track.

    • Title: The name of the song.

    • Artist: The performing artist or band.

    • Album: The album on which the song was released.

    • Release Year: The year the song was released.

    • Genre: The musical genre classification.

  • Audio Features:

    • Tempo: The speed of the song in beats per minute (BPM).

    • Loudness: The average volume level in decibels (dB).

    • Key: The musical key of the song (e.g., C major, G minor).

    • Time Signature: The meter of the song, indicating beats per measure (e.g., 4/4, 3/4).

    • Mode: The tonal mode, typically major or minor.

Popular Movies of TMDb

Creators: Mondal, Sankha Subhra
Publication Date: 2020
Creators: Mondal, Sankha Subhra

This dataset of the 10,000 most popular movies across the world has been fetched through the read API.
TMDB’s free API provides for developers and their team to programmatically fetch and use TMDb’s data.
Their API is to use as long as you attribute TMDb as the source of the data and/or images. Also, they update their API from time to time. The data set is 3.2 MB large. It offers valuable insights into global cinematic trends and preferences.

Each movie entry in the dataset includes the following attributes:

  • title: The name of the movie.
  • overview: A brief summary of the movie’s plot.
  • original_language: The language in which the movie was originally produced.
  • vote_average: The average user rating of the movie on TMDb.

Sign In

Register

Reset Password

Please enter your username or email address, you will receive a link to create a new password via email.