Inference with In and Out of Domain Samples

Jasmine and Jim

Replicating the paper’s experiment with in and out of domain samples with the pretrained model and .wav samples we collected on our own

Summary

The purpose of this experiment was to somewhat tangentially replicate the paper’s experiment with in and out of domain samples with the pretrained model and .wav samples we collected on our own, so this would be easily comparable to our trained model and so we can more objectively test this on our own/compare results to those of the paper. We ran samples through the encoder and decoder to get outputs, then ran the outputs through the encoder again to compare with the encoder outputs from the original file.

Samples

We collected 10 short (~10 sec) music samples on Youtube of various samples/domains not included in the training dataset for the pretrained model, including classical solo music on various instruments, Chinese classical music, jazz, and rock. In general, the model performed well for classical music in instruments it was trained on, and decently on instruments it had never seen, provided that the notes/pitches were relatively distinct. However, it performed poorly on audio clips containing non-pitched instruments or other irregular sounds, such as heavy metal and swing jazz.

Results (to piano)

Original violin

Violin to piano

Original trumpet

Trumpet to piano

Original swing jazz

Swing jazz to piano

Original saxophone

Saxophone to piano

Original piano

Piano to piano

Original orchestra

Violin to piano

Original metal

Metal to piano

Original marimba

Marima to piano

Original Chinese

Chinese to piano

Original bassoon

Bassoon to piano

Written on March 29, 2020