The dataset consists of 3 types (Please refer to paper to introduce various categories.) of singer separation datasets, each track 10 seconds long, segments from 476 English and 500 Chinese songs, and male/female vocalist ratio for English songs was 269:207, while that for Chinese songs was 223:277. The tracks are all 8kHz Mono 16-bit audio files in .wav format.
1.0.0 (default): No release notes.
python auto_selection.py
Please cite the paper to use the dataset.
@misc{chen2021singer,
title={Singer separation for karaoke content generation},
author={Hsuan-Yu Chen and Xuanjun Chen and Jyh-Shing Roger Jang},
year={2021},
eprint={2110.06707},
archivePrefix={arXiv},
primaryClass={cs.SD}
}