首页 | 本学科首页   官方微博 | 高级检索  
     


Learning equilibrium mean-variance strategy
Authors:Min Dai  Yuchao Dong  Yanwei Jia
Affiliation:1. Department of Applied Mathematics, Faculty of Science, and School of Accounting and Finance, Faculty of Business, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong;2. School of Mathematical Sciences, Tongji University, Shanghai, China;3. Department of Industrial Engineering and Operations Research, Columbia University, New York, New York, USA
Abstract:
We study a dynamic mean-variance portfolio optimization problem under the reinforcement learning framework, where an entropy regularizer is introduced to induce exploration. Due to the time–inconsistency involved in a mean-variance criterion, we aim to learn an equilibrium policy. Under an incomplete market setting, we obtain a semi-analytical, exploratory, equilibrium mean-variance policy that turns out to follow a Gaussian distribution. We then focus on a Gaussian mean return model and propose a reinforcement learning algorithm to find the equilibrium policy. Thanks to a thoroughly designed policy iteration procedure in our algorithm, we prove the convergence of our algorithm under mild conditions, despite that dynamic programming principle and the usual policy improvement theorem failing to hold for an equilibrium policy. Numerical experiments are given to demonstrate our algorithm. The design and implementation of our reinforcement learning algorithm apply to a general market setup.
Keywords:asset allocation  equilibrium mean variance analysis  entropy regularized exploration-exploitation  reinforcement learning
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号