×

Constrained decoupling adaptive dynamic programming for a partially uncontrollable time-delayed model of energy systems. (English) Zbl 1542.90244

Summary: The lack of feedback-loop design and uncontrollable state reconstruction, such as load demand and renewable generation, are key factors to hinder the development of energy control. Adaptive dynamic programming is an efficient method to break through the bottleneck. However, most current works assume that the evolution of uncontrollable state satisfies Markovian property. This paper proposes a time-delayed adaptive dynamic programming framework for optimal energy control, utilizing more comprehensive historical data for state reconstruction. The uncontrollable state is reconstructed by an attention mechanism-based Transformer network based on online measured data. Then, a self-learning iterative algorithm is developed to obtain the optimal state feedback control policy under a discounted performance index function. The problem of solving iterative control policy, which is a mixed-integer nonlinear programming problem with an enormous computational burden, is decomposed into three sub-problems, i.e. nonlinear programming with no discrete variable, which is solved by the Projected Newton method, mixed-integer quadratic programming, and quadratic programming. Numerical results illustrate that the developed control algorithm minimizes the electricity cost in the long term and avoids peak load.

MSC:

90C39 Dynamic programming
90C35 Programming involving graphs or networks
90C90 Applications of mathematical programming

Software:

BERT; Tensor2Tensor
Full Text: DOI

References:

[1] Pinzon, J. A.; Vergara, P. P.; da Silva, L. C.P.; Rider, M. J., Optimal management of energy consumption and comfort for smart buildings operating in a microgrid, IEEE Transactions on Smart Grid, 10, 3, 3236-3247 (2019)
[2] Farzin, H.; Fotuhi-Firuzabad, M.; Moeini-Aghtaie, M., A stochastic multi-objective framework for optimal scheduling of energy storage systems in microgrids, IEEE Transactions on Smart Grid, 8, 1, 117-127 (2017)
[3] Zhao, Z.; Guo, J.; Lai, C. S.; Xiao, H.; Zhou, K.; Lai, L. L., Distributed model predictive control strategy for islands multimicrogrids based on noncooperative game, IEEE Transactions on Industrial Informatics, 17, 6, 3803-3814 (2021)
[4] Zhang, G.; Yuan, J.; Li, Z.; Yu, S. S.; Chen, S.-Z.; Trinh, H.; Zhang, Y., Forming a reliable hybrid microgrid using electric spring coupled with non-sensitive loads and ess, IEEE Transactions on Smart Grid, 11, 4, 2867-2879 (2020)
[5] Zhong, W.; Xie, K.; Liu, Y.; Yang, C.; Xie, S.; Zhang, Y., Online control and near-optimal algorithm for distributed energy storage sharing in smart grid, IEEE Transactions on Smart Grid, 11, 3, 2552-2562 (2020)
[6] Ha, M.; Wang, D.; Liu, D., Event-triggered constrained control with dhp implementation for nonaffine discrete-time systems, Information Sciences, 519, 110-123 (2020) · Zbl 1461.93306
[7] Ji, L.; Wang, C.; Zhang, C.; Wang, H.; Li, H., Optimal consensus model-free control for multi-agent systems subject to input delays and switching topologies, Information Sciences, 589, 497-515 (2022) · Zbl 1536.93798
[8] Peng, Z.; Zhao, Y.; Hu, J.; Ghosh, B. K., Data-driven optimal tracking control of discrete-time multi-agent systems with two-stage policy iteration algorithm, Information Sciences, 481, 189-202 (2019) · Zbl 1451.93228
[9] Wang, D.; Ha, M.; Qiao, J., Data-driven iterative adaptive critic control toward an urban wastewater treatment plant, IEEE Transactions on Industrial Electronics, 68, 8, 7362-7369 (2021)
[10] Wang, D.; Qiao, J.; Cheng, L., An approximate neuro-optimal solution of discounted guaranteed cost control design, IEEE Transactions on Cybernetics, 52, 1, 77-86 (2022)
[11] Fuselli, D.; De Angelis, F.; Boaro, M.; Squartini, S.; Wei, Q.; Liu, D.; Piazza, F., Action dependent heuristic dynamic programming for home energy resource scheduling, International Journal of Electrical Power & Energy Systems, 48, 148-160 (2013)
[12] Liu, D.; Xu, Y.; Wei, Q.; Liu, X., Residential energy scheduling for variable weather solar energy based on adaptive dynamic programming, IEEE/CAA Journal of Automatica Sinica, 5, 1, 36-46 (2018)
[13] Zhao, Z.; Keerthisinghe, C., A fast and optimal smart home energy management system: State-space approximate dynamic programming, IEEE Access, 8, 184151-184159 (2020)
[14] Yuan, J.; Yu, S. S.; Zhang, G.; Lim, C. P.; Trinh, H.; Zhang, Y., Design and hil realization of an online adaptive dynamic programming approach for real-time economic operations of household energy systems, IEEE Transactions on Smart Grid, 13, 1, 330-341 (2022)
[15] Shuai, H.; Fang, J.; Ai, X.; Tang, Y.; Wen, J.; He, H., Stochastic optimization of economic dispatch for microgrid based on approximate dynamic programming, IEEE Transactions on Smart Grid, 10, 3, 2440-2452 (2019)
[16] Li, H.; Wan, Z.; He, H., Real-time residential demand response, IEEE Transactions on Smart Grid, 11, 5, 4144-4154 (2020)
[17] Shang, Y.; Wu, W.; Guo, J.; Ma, Z.; Sheng, W.; Lv, Z.; Fu, C., Stochastic dispatch of energy storage in microgrids: An augmented reinforcement learning approach, Applied Energy, 261, Article 114423 pp. (2020)
[18] Shuai, H.; He, H., Online scheduling of a residential microgrid via monte-carlo tree search and a learned model, IEEE Transactions on Smart Grid, 12, 2, 1073-1087 (2021)
[19] Wei, Q.; Liu, D.; Shi, G., A novel dual iterative q-learning method for optimal battery management in smart residential environments, IEEE Transactions on Industrial Electronics, 62, 4, 2509-2518 (2015)
[20] Wei, Q.; Liu, D.; Liu, Y.; Song, R., Optimal constrained self-learning battery sequential management in microgrid via adaptive dynamic programming, IEEE/CAA Journal of Automatica Sinica, 4, 2, 168-176 (2017)
[21] Wei, Q.; Shi, G.; Song, R.; Liu, Y., Adaptive dynamic programming-based optimal control scheme for energy storage systems with solar renewable energy, IEEE Transactions on Industrial Electronics, 64, 7, 5468-5478 (2017)
[22] Wei, Q.; Liao, Z.; Song, R.; Zhang, P.; Wang, Z.; Xiao, J., Self-learning optimal control for ice-storage air conditioning systems via data-based adaptive dynamic programming, IEEE Transactions on Industrial Electronics, 68, 4, 3599-3608 (2021)
[23] Wei, Q.; Liao, Z.; Shi, G., Generalized actor-critic learning optimal control in smart home energy management, IEEE Transactions on Industrial Informatics, 17, 10, 6614-6623 (2021)
[24] Shuai, H.; Fang, J.; Ai, X.; Wen, J.; He, H., Optimal real-time operation strategy for microgrid: An adp-based stochastic nonlinear optimization approach, IEEE Transactions on Sustainable Energy, 10, 2, 931-942 (2019)
[25] Zhu, Y.; Zhao, D.; Li, X.; Wang, D., Control-limited adaptive dynamic programming for multi-battery energy storage systems, IEEE Transactions on Smart Grid, 10, 4, 4235-4244 (2019)
[26] A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A.N. Gomez, L. Kaiser, I. Polosukhin, Attention is all you need (2017). arXiv:1706.03762.
[27] J. Devlin, M.-W. Chang, K. Lee, K. Toutanova, Bert: Pre-training of deep bidirectional transformers for language understanding (2019). arXiv:1810.04805.
[28] H. Zhou, S. Zhang, J. Peng, S. Zhang, J. Li, H. Xiong, W. Zhang, Informer: Beyond efficient transformer for long sequence time-series forecasting (2021). arXiv:2012.07436.
[29] Lewis, F. L.; Vrabie, D.; Vamvoudakis, K. G., Reinforcement learning and feedback control: Using natural decision methods to design optimal adaptive controllers, IEEE Control Systems Magazine, 32, 6, 76-105 (2012) · Zbl 1395.93584
[30] Si, J.; Wang, Y.-T., Online learning control by association and reinforcement, IEEE Transactions on Neural Networks, 12, 2, 264-276 (2001)
[31] Tassa, Y.; Mansard, N.; Todorov, E., Control-limited differential dynamic programming, IEEE International Conference on Robotics and Automation (ICRA), 2014, 1168-1175 (2014)
[32] Dataport. [Online]. Available: https://www.dataport.cloud (2019).
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.