Experience
Between June 2022 and May 2023, I served as a Reinforcement Learning Algorithms Engineer at
InspirAI. I put forward and optimized a general artificial intelligence modeling paradigm suitable for card games,
which was successfully deployed in Hearthstone, Dou Dizhu (defeated professional players), and
Guan Dan. Notably, The Doudizhu AI has been launched on the Taptop platform.
In the summer of 2021, I had the opportunity to intern as a Research Engineer at Baidu AI Cloud in
Beijing. I developed an innovative
multi-agent cooperative adversarial algorithm, which we termed Expert Data-Assisted Multi-Agent
Proximal Policy Optimization (EDA-MAPPO). Our work finally released a video
showing the performance of our algorithm, which has been published in Bilibili
and the accompanying Source Code.
At the same time, we called "superfly" team completed a machine learning for combinatorial optimization competition (9/23).
|
|