Jazz Bass Transcription Using a U-Net Architecture

Abeßer J, Müller M (2021)


Publication Type: Journal article

Publication year: 2021

Journal

Book Volume: 10

Pages Range: 670

Issue: 6

Journal Issue: 6

DOI: 10.3390/electronics10060670

Abstract

In this paper, we adapt a recently proposed U-net deep neural network architecture from melody to bass transcription. We investigate pitch shifting and random equalization as data augmentation techniques. In a parameter importance study, we study the influence of the skip connection strategy between the encoder and decoder layers, the data augmentation strategy, as well as of the overall model capacity on the system’s performance. Using a training set that covers various music genres and a validation set that includes jazz ensemble recordings, we obtain the best transcription performance for a downscaled version of the reference algorithm combined with skip connections that transfer intermediate activations between the encoder and decoder. The U-net based method outperforms previous knowledge-driven and data-driven bass transcription algorithms by around five percentage points in overall accuracy. In addition to a pitch estimation improvement, the voicing estimation performance is clearly enhanced.

Authors with CRIS profile

Involved external institutions

How to cite

APA:

Abeßer, J., & Müller, M. (2021). Jazz Bass Transcription Using a U-Net Architecture. Electronics, 10(6), 670. https://doi.org/10.3390/electronics10060670

MLA:

Abeßer, Jakob, and Meinard Müller. "Jazz Bass Transcription Using a U-Net Architecture." Electronics 10.6 (2021): 670.

BibTeX: Download