Please use this identifier to cite or link to this item: https://elibrary.tucl.edu.np/handle/123456789/18557
Full metadata record
DC FieldValueLanguage
dc.contributor.authorLal, Janak Kumar-
dc.date.accessioned2023-07-17T10:12:38Z-
dc.date.available2023-07-17T10:12:38Z-
dc.date.issued2022-09-
dc.identifier.urihttps://elibrary.tucl.edu.np/handle/123456789/18557-
dc.descriptionMalcom Gladwell in his book “Outliers: The Story of Success” [1] writes that it takes 10,000 hours of intensive practice to achieve mastery of complex skills. Can an amature violinist be an expert by playing the same song for 10,000 hours? Does a person who has tossed an unbiased coin 10,000 times predict the next outcome more accurately than a person who has not tossed a coin? In case of tossing an unbiased coin no matter how many times a person has tossed a coin the outcome of the next toss will be random and unpredictableen_US
dc.description.abstractThis study adopts Double Deep Q learning algorithm to design trading strategies to trade stocks of four commercial banks listed in NEPSE. The reinforcement learning agent takes discrete actions and gets negative or positive reward from the environment. CNN is utilized to form the policy network. A target network is used to mitigate instability due to Deep Q Network. The concept of experience replay is used to randomly sample the batches of experience from the memory and train the network. The performance of Double Deep Q learning agent was compared with various baseline trading strategies in terms of annualised expected trade return. The maximum annualised expected trade return obtained with traditional baseline methods was 103% for testing data of NABIL, while for the same data the reinforcement learning agent using double deep Q learning algorithm obtained annualized expected trade return of 114.44%. The experiments showed that, Double Deep Q learning agent with experience replay had higher annualised expected trade return compared to baseline trading strategies.en_US
dc.language.isoenen_US
dc.publisherIOE Pulchowk Campusen_US
dc.relation.ispartofseriesTHESIS NO: 075/MSICE/009;-
dc.subjectReinforcement learningen_US
dc.subjectDouble Deep Q-Learningen_US
dc.title“Design of Stock Trading Agent Using Deep Reinforcement Learning”en_US
dc.typeThesisen_US
local.institute.titleInstitute of Engineeringen_US
local.academic.levelMastersen_US
local.affiliatedinstitute.titlePulchowk Campusen_US
Appears in Collections:Electronics and Computer Engineering

Files in This Item:
File Description SizeFormat 
Janak_075MSICE009 Master electronics and computer.pdf2.96 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.