Nepali Text to Speech using Time Domain Pitch Synchronous Overlap Add Method

dc.contributor.authorMalla, Pratistha
dc.date.accessioned2022-01-30T05:48:21Z
dc.date.available2022-01-30T05:48:21Z
dc.date.issued2015-02
dc.descriptionNepali text to speech has range of application. Nepali Text to speech synthesizer provides help in reading for differently-able and illiterate people.en_US
dc.description.abstractNepali text to speech has range of application. Nepali Text to speech synthesizer provides help in reading for differently-able and illiterate people. Text to speech is the system of converting Nepali written text to speech through text analysis and speech synthesis. Since Nepali is phonetically rich language, the system uses concatenative approach. The Time Domain Pitch Synchronous Overlap Add (TD-PSOLA) concatenative synthesis is simple and efficient concatenative method. It is based on pre-recorded samples, cut this up into small pieces and then recombines these to form new speech. The raw Nepali text undergoes the letter to sound conversion. The letter to sound conversion is based on diphones concatenation. The diphone dictionary is created from the pre-recorded speech signals. The speech database consists of the diphones, start position, end position of speech segment extracted from pre-recorded speech. For each diphone signal the pitch is determined using autocorrelation method. Hanning window and Hamming window has been used for windowing of speech signal. TD-PSOLA method implies the signals are overlapped and added to generate the synthetic speech signal. Hence TD-PSOLA concatenative method has been implemented to generate synthetic speech signal for Nepali text.en_US
dc.identifier.citationMASTER OF SCIENCE IN COMPUTER SYSTEM AND KNOWLEDGE ENGINEERINGen_US
dc.identifier.urihttps://hdl.handle.net/20.500.14540/7809
dc.language.isoenen_US
dc.publisherPulchowk Campusen_US
dc.subjectDiphone,en_US
dc.subjectText to Speech (TTS),en_US
dc.subjectSpeech Synthesis,en_US
dc.subjectTD-PSOLAen_US
dc.titleNepali Text to Speech using Time Domain Pitch Synchronous Overlap Add Methoden_US
dc.typeThesisen_US
local.academic.levelMastersen_US
local.affiliatedinstitute.titlePulchowk Campusen_US
local.institute.titleInstitute of Engineeringen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
069MSCS660.pdf
Size:
1.82 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: