Crowdsourced high-quality Burmese speech data set.
Identifier: SLR80
Summary: Data set which contains recordings of Burmese.
Category: Speech
License: Attribution-ShareAlike 4.0 International
Downloads (use a mirror closer to you):
about.html [1.4K] (Information about the data set
) Mirrors:
[US]
[EU]
[CN]
LICENSE [20K] (License information for the data set
) Mirrors:
[US]
[EU]
[CN]
line_index_female.tsv [541K] (All utterances for the female speakers.
) Mirrors:
[US]
[EU]
[CN]
my_mm_female.zip [948M] (Archive file with all audio for the female speakers.
) Mirrors:
[US]
[EU]
[CN]
About this resource:
The data set has been manually quality checked, but there might still be errors.
Please report any issues in the following issue tracker on GitHub. https://github.com/googlei18n/language-resources/issues
See LICENSE file for license information.
Copyright 2018, 2019 Google, Inc.
If you use this data in publications, please cite it as follows:
@inproceedings{oo-etal-2020-burmese,
title = {{Burmese Speech Corpus, Finite-State Text Normalization and Pronunciation Grammars with an Application to Text-to-Speech}},
author = {Oo, Yin May and Wattanavekin, Theeraphol and Li, Chenfang and De Silva, Pasindu and Sarin, Supheakmungkol and Pipatsrisawat, Knot and Jansche, Martin and Kjartansson, Oddur and Gutkin, Alexander},
booktitle = {Proceedings of The 12th Language Resources and Evaluation Conference (LREC)},
month = may,
year = {2020},
pages = "6328--6339",
address = {Marseille, France},
publisher = {European Language Resources Association (ELRA)},
url = {https://www.aclweb.org/anthology/2020.lrec-1.777},
ISBN = {979-10-95546-34-4},
}