Research Interests
I am broadly interested in reinforcement learning and sequential decision making, especially towards
more general and reliable AI agents. A long-term goal of my research is to use AI to do scientific research itself,
enabling agents that can discover, test, and refine hypotheses and ultimately reshape how we do science.
- Data-efficient reinforcement learning under limited samples
- Stable and robust training for long-horizon control
- Large-scale RL and deployment in real systems
- Continual / sustainable learning under non-stationary environments
So far, many of my works use concrete application scenarios (e.g., communication networks, traffic systems, multimedia)
as research testbeds for these RL questions. Going forward, I plan to gradually move from \"RL for complex control\"
to AI for scientific discovery, starting from well-defined controlled systems and then expanding to broader scientific domains.
|
Publication / Preprint
For the full list of my publications, please visit the
Publications page.
|
Collaboration
I am always happy to discuss ideas and potential collaborations on reinforcement learning, especially topics related to
data efficiency, stability, large-scale deployment, and continual learning.
If you are interested in working together (e.g., on a joint paper / project), feel free to email me with a brief introduction,
your background, and what kind of problems you would like to work on.
|
News
- 2026: Paper accepted to IEEE Transactions on Multimedia on plasticity-aware mixture of experts for adaptive video streaming.
- 2025: Recognized as JST Next-Generation Researcher.
- 2025: Several works on world models, multi-agent RL for traffic, and DRL-based UAV communications accepted to leading journals.
- 2019: Selected as Outstanding Graduate (Top 1%).
|
Teaching & Service
I am committed to contributing to the research community and mentoring young researchers.
Journal Reviewing:
IEEE Transactions on Network Science and Engineering;
IEEE Internet of Things Journal;
IEEE Open Journal of the Computer Society.
Mentoring:
I regularly mentor junior students and collaborators on topics related to reinforcement learning and its applications.
|
|