Text to Speech for Dzongkha Language

Authors

  • Yeshi Wangchuk Information Technology Department, College of Science and Technology, Royal University of Bhutan, Rinchending, Phuentsholing, 21002, Bhutan.
  • Kamal K. Chapagai Electronics and Communication Engineering Department, College of Science and Technology, Royal University of Bhutan, Rinchending, Phuentsholing, 21002, Bhutan.
  • Pema Galey Information Technology Department, College of Science and Technology, Royal University of Bhutan, Rinchending, Phuentsholing, 21002, Bhutan.
  • Yeshi Jamtsho Information Technology Department, College of Science and Technology, Royal University of Bhutan, Rinchending, Phuentsholing, 21002, Bhutan.

DOI:

https://doi.org/10.9734/bpi/ratmcs/v4/5490C

Keywords:

Natural Language processing (NLP), Dzongkha, Text to speech (TTS) system, Statistical speech synthesis, phoneme, corpus, transcription

Abstract

Text to Speech plays a vital role in imparting information to the general population who have difficulty reading text but can understand spoken language. In Bhutan, such system were not available and building a system will leverage the use of the language by different segment of people. This article describes an attempt to create a functioning model of a Text to Speech system for the Dzongkha language by creating a transcription or grapheme table for phonetic transcription from Dzongkha text to its comparable phone set. The transcription tables for consonants and vowels have been produced in such a way that they allow for improved computer compatibility. 3000 phrases were painstakingly transcribed and recorded using a single male voice. On the FESTIVAL platform, the voice synthesis was based on a statistical approach with concatenative speech creation. The model is generated using the two variants CLUSTERGEN and CLUNITS of the FESTIVAL speech tools FESTVOX where the earlier method produce more natural speech than the later for the large data set. The development of system prototype is of the first kind for the Dzongkha language in spite of attempts being made by researchers.

Published

2023-09-09

How to Cite

Yeshi Wangchuk, Kamal K. Chapagai, Pema Galey, & Yeshi Jamtsho. (2023). Text to Speech for Dzongkha Language. Research and Applications Towards Mathematics and Computer Science Vol. 4, 86–95. https://doi.org/10.9734/bpi/ratmcs/v4/5490C