NEURAL AUDIO CODEC

dc.contributor.authorBARAL, SUBODH
dc.contributor.authorPANDEY, TAPENDRA
dc.contributor.authorBURLAKOTI, ACHYUT
dc.contributor.authorBARAL, SIJAL
dc.date.accessioned2023-07-31T05:58:27Z
dc.date.available2023-07-31T05:58:27Z
dc.date.issued2023-04-30
dc.descriptionNeural audio codecs that use end-to-end approaches have gained popularity due to their ability to learn efficient audio representations through data-driven methods, without relying on handcrafted signal processing components.en_US
dc.description.abstractNeural audio codecs that use end-to-end approaches have gained popularity due to their ability to learn efficient audio representations through data-driven methods, without relying on handcrafted signal processing components. This research paper evaluates the performance of Neural Audio Codec in comparison to traditional audio codecs Opus and EVS in terms of audio quality and efficiency. The study highlights the limitations of existing audio codecs in leveraging the abundant data available in the audio compression pipeline and proposes deep learning-based models as a potential solution. The paper reviews recent advancements in deep learning-based audio synthesis and representation learning and explores the potential of deep learning-based audio codecs in enhancing compression efficiency. The study also addresses the limitations of existing models, including slower training times and increased memory requirements, by releasing open-source code and pre-trained models for further research and improvement. Experimental results show that our approach has comparable performance to widely used commercial codec OPUS at low bitrate, and a slight drop in performance compared to current deep learning-based frameworks but at the expense of significant improvement in speed and memory requirements. We have released our code and pre-trained models at https://github.com/AchyutBurlakoti/Neural-Audio-Compression for further research and improvement.en_US
dc.identifier.urihttps://hdl.handle.net/20.500.14540/18840
dc.language.isoenen_US
dc.publisherI.O.E. Pulchowk Campusen_US
dc.subjectAudio Compression,en_US
dc.subjectDeep Learning,en_US
dc.subjectAudio Codecen_US
dc.titleNEURAL AUDIO CODECen_US
dc.typeReporten_US
local.academic.levelBacheloren_US
local.affiliatedinstitute.titlePulchowk Campusen_US
local.institute.titleInstitute of Engineeringen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Subodh Baral et al. be report computer apr2023.pdf
Size:
1.19 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: