“Design of Stock Trading Agent Using Deep Reinforcement Learning”

Lal, Janak Kumar

Please use this identifier to cite or link to this item: https://elibrary.tucl.edu.np/handle/123456789/18557

Full metadata record

DC Field	Value	Language
dc.contributor.author	Lal, Janak Kumar	-
dc.date.accessioned	2023-07-17T10:12:38Z	-
dc.date.available	2023-07-17T10:12:38Z	-
dc.date.issued	2022-09	-
dc.identifier.uri	https://elibrary.tucl.edu.np/handle/123456789/18557	-
dc.description	Malcom Gladwell in his book “Outliers: The Story of Success” [1] writes that it takes 10,000 hours of intensive practice to achieve mastery of complex skills. Can an amature violinist be an expert by playing the same song for 10,000 hours? Does a person who has tossed an unbiased coin 10,000 times predict the next outcome more accurately than a person who has not tossed a coin? In case of tossing an unbiased coin no matter how many times a person has tossed a coin the outcome of the next toss will be random and unpredictable	en_US
dc.description.abstract	This study adopts Double Deep Q learning algorithm to design trading strategies to trade stocks of four commercial banks listed in NEPSE. The reinforcement learning agent takes discrete actions and gets negative or positive reward from the environment. CNN is utilized to form the policy network. A target network is used to mitigate instability due to Deep Q Network. The concept of experience replay is used to randomly sample the batches of experience from the memory and train the network. The performance of Double Deep Q learning agent was compared with various baseline trading strategies in terms of annualised expected trade return. The maximum annualised expected trade return obtained with traditional baseline methods was 103% for testing data of NABIL, while for the same data the reinforcement learning agent using double deep Q learning algorithm obtained annualized expected trade return of 114.44%. The experiments showed that, Double Deep Q learning agent with experience replay had higher annualised expected trade return compared to baseline trading strategies.	en_US
dc.language.iso	en	en_US
dc.publisher	IOE Pulchowk Campus	en_US
dc.relation.ispartofseries	THESIS NO: 075/MSICE/009;	-
dc.subject	Reinforcement learning	en_US
dc.subject	Double Deep Q-Learning	en_US
dc.title	“Design of Stock Trading Agent Using Deep Reinforcement Learning”	en_US
dc.type	Thesis	en_US
local.institute.title	Institute of Engineering	en_US
local.academic.level	Masters	en_US
local.affiliatedinstitute.title	Pulchowk Campus	en_US
Appears in Collections:	Electronics and Computer Engineering

Files in This Item:

File	Description	Size	Format
Janak_075MSICE009 Master electronics and computer.pdf		2.96 MB	Adobe PDF	View/Open

Show simple item record

TUCL eLibrary

Easy and open access to all types of digital resources of TUCL