zh

Reinforcement Learning · UEC Tokyo · 2026

Building reliable agents under uncertainty.

I'm Zhiqiang He (何志强) — a Ph.D. researcher at UEC Tokyo, working on plasticity, world models, and large-scale RL systems deployed in real environments.

Scroll

42.8

Highest impact factor (IEEE COMST 2025)

7

Q1 papers across IEEE TMM, PR, InfoSci, CTR

10K

Followers on Zhihu

¥2.2M

JST Next-Generation Researcher (2025-27)

Selected venues · reviewing & publishing

Research pillars

Four threads, one goal: agents that don't break.

Read all themes

Selected publications

Recent work I'm most proud of.

All publications

News & milestones

Selected moments.

  1. Jan 2026

    Paper accepted to IEEE Transactions on Multimedia (PA-MoE).

  2. Sep 2025

    Works on multi-agent RL for traffic, and DRL-based UAV communications accepted to leading journals.

  3. Apr 2025

    Recognized as JST Next-Generation Researcher (¥2.2M / year, 2025–2027).

  4. Apr 2024

    Started Ph.D. at the University of Electro-Communications (UEC), Tokyo with Prof. Zhi Liu.

  5. May 2023

    Concluded role as Reinforcement Learning Algorithms Engineer at InspirAI (Top-Performing Team Prize).

  6. Jun 2021

    Joined Baidu (Beijing) as Reinforcement Learning Research Intern — Super Special Offer.

  7. Jun 2019

    Selected as Outstanding Graduate (Top 1%) at East China Jiaotong University.

Open to collaboration

Let's build agents that don't break.

Always happy to discuss reinforcement learning — data efficiency, stability, large-scale deployment, continual learning. Drop a note with a brief intro.