

Note that although there's a free tier for EC2 processors, AmazonĬharges for EBS usage this 500G partition costs something like The 493G partition at the end (of which only 272G used) is the MSD
What is the million song dataset license#
Just have to mount sudo mkdir sudo mount -t ext4 /dev/xvdf ls /mnt/snapĪdditionalFiles data LICENSE lost+found df -hįilesystem Size Used Avail Use% Mounted on Virtual machine, it appears as /dev/xvdf from within Ubuntu. Snap-5178cf30 (I think this means your EC2 virtual machine has to be inįor me, when I launch an EC2 virtual machine running Ubuntu, thenĬreate an EBS instance from that snapshot, then attach the EBS to the You simply set up an EBS disk instance from The dataset is available as an Amazon Public Dataset snapshot which can easily be attached to an Amazon EC2 virtual machine to run your experiments in the cloud. The following universities should have a copy: Drexel, Ithaca College, QMUL, NYU, UCSD, UPF. If you want the whole dataset, check to see if you know someone that has it already.

What is the million song dataset download#
You can download the corresponding raw HDF5 file here: TRAXLZU12903D05F94.h5.

Here is a page showing the contents of a single example file. We do, however, provide a directly-downloadable subset for a quick look.īefore you start, you might want to review exactly what the dataset contains. The logistics of distributing a 300 GB dataset are a little more complicated than for smaller collections. If you're looking to download listenable audio, don't bother with this data. The closest we get is per-beat 12-dimensional chroma and timbre vectors. Important note: There is no audio included in the dataset.
