VOICE MORPHING BY LINEAR PREDICTIVE CODING COEFFICEINTS MAPPING USING ARTIFICIAL NEURAL NETWORK

dc.contributor.authorBasnet, Binod
dc.date.accessioned2022-01-23T05:54:25Z
dc.date.available2022-01-23T05:54:25Z
dc.date.issued2015-11
dc.descriptionVoice Morphing (VM) modifies speaker voice (source speaker) to be perceived as if another speaker (target speaker) had uttered it.en_US
dc.description.abstractVoice Morphing (VM) modifies speaker voice (source speaker) to be perceived as if another speaker (target speaker) had uttered it. The voice morphing has been done by two methods a) Voice Morphing based on LPC mapping and PSOLA b) Voice Morphing based on LPC mapping using NN. The Voice conversion in first method is done by PSOLA (Pitch Synchronous Overlap and Add) on source based on target speech to obtain an intermediate speech followed by residual or excitation signal extraction that approximates the target speaker excitation that finally combines with mapped spectral/LPC parameters (LPC coefficients, Formants) of the target speaker to produce the Morphed speech. Whereas the Voice morphing using LPC mapping by Artificial Neural Network is based on the training and finding the transformation network that transformed the source speaker LPC or Vocal Tract parameters and excitation into the targeted speaker Vocal Tract Parameters. The Parameters are further used to resynthesize the target speech. The Artificial Neural Networks (ANN) approximates the mapping function that predicts the highly nonlinear relationship between vocal tracts and pitch of a source speaker to that of a target speaker. The results of voice Morphing are evaluated among various 12-24-12, 14-28-14 and 16-32-16 Neural Network architectures. Also the best NN architecture is compared with PSOLA based VM architecture. The subjective and objective measure is used to perform evaluation. The transformation results of ANN architectures and PSOLA based Voice Morphing system are compared based on the quality, SNR and spectral properties of the converted target speech.en_US
dc.identifier.citationMASTER OF SCIENCE IN COMPUTER SYSTEM AND KNOWLEDGE ENGINEERINGen_US
dc.identifier.urihttps://hdl.handle.net/20.500.14540/7605
dc.language.isoenen_US
dc.publisherPulchowk Campusen_US
dc.subjectVoice Morphingen_US
dc.subjectNeural Networken_US
dc.titleVOICE MORPHING BY LINEAR PREDICTIVE CODING COEFFICEINTS MAPPING USING ARTIFICIAL NEURAL NETWORKen_US
dc.typeThesisen_US
local.academic.levelMastersen_US
local.affiliatedinstitute.titlePulchowk Campusen_US
local.institute.titleInstitute of Engineeringen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
069MSCS668.zip
Size:
2 MB
Format:
Unknown data format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: