Rayment, George and Kampouridis, Michael (2024) Enhancing High-Frequency Trading with Deep Reinforcement Learning using Advanced Positional Awareness Under a Directional Changes Paradigm. In: International Conference on Machine Learning and Applications, 2024-12-18 - 2024-12-20, Miami, USA. (In Press)
Rayment, George and Kampouridis, Michael (2024) Enhancing High-Frequency Trading with Deep Reinforcement Learning using Advanced Positional Awareness Under a Directional Changes Paradigm. In: International Conference on Machine Learning and Applications, 2024-12-18 - 2024-12-20, Miami, USA. (In Press)
Rayment, George and Kampouridis, Michael (2024) Enhancing High-Frequency Trading with Deep Reinforcement Learning using Advanced Positional Awareness Under a Directional Changes Paradigm. In: International Conference on Machine Learning and Applications, 2024-12-18 - 2024-12-20, Miami, USA. (In Press)
Abstract
Deep reinforcement learning (DRL) offers the po- tential to make intelligent trading decisions in high frequency trading strategies in the foreign exchange (FX) market at a fraction of the time it takes humans. In this work, we use an event-based time sampling method referred to as directional changes (DC), which samples data only when there is a significant change in price, to build a DRL-based trading system. The enhanced price representation through DC sampling, combined with positional features reflecting the agent’s trading account, provides the DRL agent with information about its exposure to market changes. With this representation of the environment we can train a DRL agent to profitably trade the FX market at high frequencies. Tick data from fourteen FX currency pairs is sampled using the DC framework and then split into windows to form 784 datasets. The novel trading system called PADRL, uses Proximal Policy Optimisation (PPO) to train agents that can autonomously generate considerable levels of profit without rule-based interjections. The resultant agents are compared to four different benchmarks including Buy and Hold, an existing successful DC-based DRL strategy (FDRL) and two popular technical analysis based strategies (MACD and RSI). Strategy performance is measured across three different performance metrics (namely Total Return, Maximum Drawdown and Calmar Ratio), with the novel PADRL system significantly outperforming them all for Total Return and Calmar Ratio.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Uncontrolled Keywords: | directional changes, high frequency trading, machine learning, deep reinforcement learning |
Divisions: | Faculty of Science and Health Faculty of Science and Health > Computer Science and Electronic Engineering, School of |
SWORD Depositor: | Unnamed user with email elements@essex.ac.uk |
Depositing User: | Unnamed user with email elements@essex.ac.uk |
Date Deposited: | 02 Oct 2024 08:57 |
Last Modified: | 02 Oct 2024 08:59 |
URI: | http://repository.essex.ac.uk/id/eprint/39192 |
Available files
Filename: Enhancing_HFT_with_DRL_using_Advanced_Positional_Awareness_Under_a_DC_Paradigm___ICMLA_24___SUBMISSION.pdf