KunquDB
a large-scale, well-annotated audio-visual Kunqu Opera dataset
The Kunqu Opera Art Canon, encompasses the most significant literary, musical, and audiovisual materials spanning over 600 years, embodying the essence of Kunqu Opera art. Specifically, it contains over 22.3 million words of textual documentation, 396 sets of reprinted documents totaling over 70,000 pages, 127 hours of recorded audio, over 400 hours of video recordings, and over 6,000 images, all compiled into 149 volumes. For more details about this book, you can visit the official website of its publisher, or check out its introduction on douban.
After purchasing the book, we negotiated with the publisher and secured their authorization for its utilization in Kunqu Opera research. The publisher explicitly stated that the book's digital resource can be employed solely for scholarly or research endeavors upon the approval of the publisher. It may not be illegally disseminated or used for commercial purposes.
Here’s a screenshot of the annotation:
Due to data restrictions, we choose not to publicly disclose the data from the KunquDB dataset as examples. Instead, we showcase similar annotations for online Kunqu Opera videos here.
KunquDB: An Attempt for Speaker Verification
in the Chinese Opera Scenario
If our work is useful for your research, please consider citing:
@article{zhou2024kunqudb,
title={KunquDB: An Attempt for Speaker Verification in the Chinese Opera Scenario},
author={Zhou, Huali and Lin, Yuke and Liu, Dong and Li, Ming},
journal={arXiv preprint arXiv:2403.13356},
year={2024}
}
Researchers can gain access to the source video data by purchasing Kunqu yishu dadian. It is the user’s responsibility to get the approval from the publisher to conduct research for non-commercial purposes. We only provide our annotation dataset and processing scripts. To obtain the annotation dataset, please contact us via E-mail: huali.zhou@dukekunshan.edu.cn or ming.li369@dukekunshan.edu.cn, along with your affiliation and the consent from the publisher.
The dataset is licensed under the CC BY-NC-SA 4.0 license. This means that you can share and adapt the dataset for non-commercial purposes as long as you provide appropriate attribution and distribute your contributions under the same license. Detailed terms can be found on LICENSE.
This research is funded by the Kunshan Municipal Government Research Funding under the project "Deep Learning based Singing Voice Synthesis for Kun Opera". We want to thank the publisher for allowing us to conduct research on their data and DKU library staff members for their coordination. Special thanks to Xiaoyi Qin for his assistance.