Reinforcement Learning · UEC Tokyo · 2026

Building reliable agents under uncertainty.

I'm Zhiqiang He (何志强) — a Ph.D. researcher at UEC Tokyo, working on plasticity, world models, and large-scale RL systems deployed in real environments.

Read research View publications Get in touch

Scroll

42.8

Highest impact factor (IEEE COMST 2025)

Q1 papers across IEEE TMM, PR, InfoSci, CTR

10K

Followers on Zhihu

¥2.2M

JST Next-Generation Researcher (2025-27)

Selected venues · reviewing & publishing

IEEE Transactions on Multimedia ◆ IEEE Communications Surveys & Tutorials ◆ Pattern Recognition ◆ Information Sciences ◆ Communications in Transportation Research ◆ IEEE Internet of Things Journal ◆ IEEE TNSE ◆ ACM Multimedia ◆ Algorithms ◆ IEEE Transactions on Multimedia ◆ IEEE Communications Surveys & Tutorials ◆ Pattern Recognition ◆ Information Sciences ◆ Communications in Transportation Research ◆ IEEE Internet of Things Journal ◆ IEEE TNSE ◆ ACM Multimedia ◆ Algorithms ◆

Research pillars

Four threads, one goal: agents that don't break.

Read all themes

Adapting under shift

Recent work I'm most proud of.

All publications

IEEE TMM · 2026 Q1 · IF 9.7

Plasticity-Aware Mixture of Experts for Learning Under QoE Shifts in Adaptive Video Streaming

Zhiqiang He , Zhi Liu

Accepted

InfoSci · 2024 Q1 · IF 8.1

Understanding World Models through Multi-Step Pruning Policy via Reinforcement Learning

Zhiqiang He , Wen Qiu , Wei Zhao , Xun Shao , Zhi Liu

Published

IEEE COMST · 2025 Q1 · IF 42.8

A Survey on DRL based UAV Communications and Networking: DRL Fundamentals, Applications and Implementations

Wei Zhao , Shaoxin Cui , Wen Qiu ^*, Zhiqiang He ^*, Zhi Liu , Xiao Zheng , Bomin Mao , Nei Kato

Published

News & milestones

Selected moments.

Jan 2026

Paper accepted to IEEE Transactions on Multimedia (PA-MoE).
Sep 2025

Works on multi-agent RL for traffic, and DRL-based UAV communications accepted to leading journals.
Apr 2025

Recognized as JST Next-Generation Researcher (¥2.2M / year, 2025–2027).
Apr 2024

Started Ph.D. at the University of Electro-Communications (UEC), Tokyo with Prof. Zhi Liu.
May 2023

Concluded role as Reinforcement Learning Algorithms Engineer at InspirAI (Top-Performing Team Prize).
Jun 2021

Joined Baidu (Beijing) as Reinforcement Learning Research Intern — Super Special Offer.
Jun 2019

Selected as Outstanding Graduate (Top 1%) at East China Jiaotong University.

Open to collaboration

Let's build agents that don't break.

Always happy to discuss reinforcement learning — data efficiency, stability, large-scale deployment, continual learning. Drop a note with a brief intro.

Email me GitHub Google Scholar

Building reliable agents under uncertainty.

Four threads, one goal: agents that don't break.

Plasticity

World Models

Multi-Agent RL

Real-System RL