|
Understanding World Models through Multi-Step Pruning Policy via Reinforcement Learning
Zhiqiang He,
Wen Qiu,
Wei Zhao,
Xun Shao,
Zhi Liu
Information Sciences, 2025. (IF=8.1, Q1)
Source Code |
Download PDF
Parallel Multi-Step Pruning Policies enhance diversity Sampling. (Analysis of convergence theory for MSPP and its PG Theorem.)
|
|
Erlang planning network: An iterative model-based reinforcement learning with multi-perspective
Jiao Wang,
Lemin Zhang,
Zhiqiang He,
Can Zhu,
Zihui Zhao
Pattern Recognition, 2022. (IF=8.5, Q1)
Source Code |
Download PDF
Bi-level reinforcement learning in Model-Based Reinforcement Learning.
|
|
Control Strategy of Speed Servo Systems Based on Deep Reinforcement Learning
Pengzhan Chen,
Zhiqiang He,
Chuanxi Chen,
Jiahong Xu
Algorithms, 2018
Source Code |
Download PDF (Cited 53 times)
First paper applied Reinforcement Learning in Jump Speed Servo System.
|
|