NEURAL AUDIO CODEC

BARAL, SUBODH; PANDEY, TAPENDRA; BURLAKOTI, ACHYUT; BARAL, SIJAL

NEURAL AUDIO CODEC

dc.contributor.author	BARAL, SUBODH
dc.contributor.author	PANDEY, TAPENDRA
dc.contributor.author	BURLAKOTI, ACHYUT
dc.contributor.author	BARAL, SIJAL
dc.date.accessioned	2023-07-31T05:58:27Z
dc.date.available	2023-07-31T05:58:27Z
dc.date.issued	2023-04-30
dc.description	Neural audio codecs that use end-to-end approaches have gained popularity due to their ability to learn efficient audio representations through data-driven methods, without relying on handcrafted signal processing components.	en_US
dc.description.abstract	Neural audio codecs that use end-to-end approaches have gained popularity due to their ability to learn efficient audio representations through data-driven methods, without relying on handcrafted signal processing components. This research paper evaluates the performance of Neural Audio Codec in comparison to traditional audio codecs Opus and EVS in terms of audio quality and efficiency. The study highlights the limitations of existing audio codecs in leveraging the abundant data available in the audio compression pipeline and proposes deep learning-based models as a potential solution. The paper reviews recent advancements in deep learning-based audio synthesis and representation learning and explores the potential of deep learning-based audio codecs in enhancing compression efficiency. The study also addresses the limitations of existing models, including slower training times and increased memory requirements, by releasing open-source code and pre-trained models for further research and improvement. Experimental results show that our approach has comparable performance to widely used commercial codec OPUS at low bitrate, and a slight drop in performance compared to current deep learning-based frameworks but at the expense of significant improvement in speed and memory requirements. We have released our code and pre-trained models at https://github.com/AchyutBurlakoti/Neural-Audio-Compression for further research and improvement.	en_US
dc.identifier.uri	https://hdl.handle.net/20.500.14540/18840
dc.language.iso	en	en_US
dc.publisher	I.O.E. Pulchowk Campus	en_US
dc.subject	Audio Compression,	en_US
dc.subject	Deep Learning,	en_US
dc.subject	Audio Codec	en_US
dc.title	NEURAL AUDIO CODEC	en_US
dc.type	Report	en_US
local.academic.level	Bachelor	en_US
local.affiliatedinstitute.title	Pulchowk Campus	en_US
local.institute.title	Institute of Engineering	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Subodh Baral et al. be report computer apr2023.pdf
Size:: 1.19 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Computer Engineering