Deep Reinforcement Learning
MSc Course | University of Twente —
Course Overview
Advanced course on deep reinforcement learning with applications to trading, portfolio management, and financial decision-making. This course explores how intelligent agents learn to make sequential decisions in uncertain financial environments, combining theoretical foundations with hands-on implementation.
Information Systems for the Financial Services Industry, University of Twente, Netherlands, Spring 2025. Coordinator. Developed curriculum.
Topics Covered
Foundations
- Markov Decision Processes (MDPs) – States, actions, transitions, rewards
- Value Functions – State-value and action-value functions
- Bellman Equations – Optimality conditions and dynamic programming
- Policy Evaluation and Improvement – Iterative methods for policy optimization
Deep Q-Learning
- DQN – Deep Q-Networks with experience replay
- Double DQN – Addressing overestimation bias
- Dueling DQN – Separate value and advantage streams
- Prioritized Experience Replay – Efficient sample utilization
Policy Gradient Methods
- REINFORCE – Monte Carlo policy gradient
- A2C / A3C – Advantage actor-critic with parallel environments
- PPO – Proximal Policy Optimization for stable training
- SAC – Soft Actor-Critic with entropy regularization
- TD3 – Twin Delayed Deep Deterministic Policy Gradient
Advanced Topics
- Multi-Agent RL – Market simulation with competing agents
- Model-Based RL – Learning environment dynamics
- Explainable RL – Interpretable policies for compliance with the European AI Act
- Offline RL – Learning from historical trading data
Financial Applications
Students implement RL agents for real-world financial problems:
- Stock Trading Strategies – Learning buy/sell/hold policies from market data
- Portfolio Rebalancing – Dynamic asset allocation with transaction costs
- Market Making – Optimal bid-ask spread management
- Risk Management – Hedging strategies and drawdown control
- Optimal Execution – Minimizing market impact in large orders
Doctoral Training
RL methods are also covered at the doctoral level:
- Reinforcement Learning for Finance, University of Twente, Netherlands, June 2024. Co-Organizer and Trainer.
- European Summer School in Financial Mathematics, TU Delft, Netherlands, September 2023. Lecturer on Deep Reinforcement Learning.
Prerequisites
- Machine Learning fundamentals
- Python programming (NumPy, PyTorch or TensorFlow)
- Probability and statistics
- Linear algebra
Assessment
- Implementation project (50%)
- Written report (30%)
- Presentation (20%)