gulaerchen.github.io

MIR-SingerSeparation Dataset & Auto-selection

Dataset Description

The dataset consists of 3 types (Please refer to paper to introduce various categories.) of singer separation datasets, each track 10 seconds long, segments from 476 English and 500 Chinese songs, and male/female vocalist ratio for English songs was 269:207, while that for Chinese songs was 223:277. The tracks are all 8kHz Mono 16-bit audio files in .wav format.

Versions

1.0.0 (default): No release notes.

System Demo

Singer Separation

Download datasets

Download size

Dataset size

Splits

Auto Selection

Citation

Please cite the paper to use the dataset.

@misc{chen2021singer,
    title={Singer separation for karaoke content generation},
    author={Hsuan-Yu Chen and Xuanjun Chen and Jyh-Shing Roger Jang},
    year={2021},
    eprint={2110.06707},
    archivePrefix={arXiv},
    primaryClass={cs.SD}
}