About Jing-Cheng Pang (庞竟成):

[Research Overview] [Recent News] [Selected Publications] [Projects]

I am currently an AI Researcher at Huawei, focusing on data for building Domain Agent through reinforcement learning and large language models. I joined Huawei in July 2025 through the TopMinds Talent Program. Previously, I received my PhD in June 2025 from Nanjing University (LAMDA Group), where I was very fortunate to be supervised by Prof. Yang Yu to conduct reinforcement learning research. In 2024, I was a visiting scholar with Professor Masashi Sugiyama’s team at RIKEN-AIP in Tokyo, Japan (July - October). Prior to that, I obtained my BSc from the University of Electronic Science and Technology of China (UESTC) in June 2019.

Research Overview

During the PhD phase, my research focuses on connecting human and intelligent agent through natural language. By integrating reinforcement learning (RL) and large language models (LLMs), I aim to develop systems that not only interpret human intent but also act autonomously and learn iteratively in dynamic environments. Particularly, my study includes:

Reinforcement Learning: language-conditioned RL, optimization algorithm, imitation learning, generalist agent;
Large Language Models: model training, inference-time optimization, LLM-based agent;
Embodied AI: home-service robot, sim2real policy learning.

Currently, I am exploring how to leverage RL and LLMs to build Domain Agent that can autonomously perform complex tasks in specific domains (e.g., Wireless, Datacom) by understanding and executing human instructions. Specifically:

Data Research: data construction and orchestration for building domain agent.
Model Optimization: effective mechanisms to optimize LLM using RL.

Free free to contact/follow me if you are interested in my work.

[Back to top]

Selected Publications

Jing-Cheng Pang, Nan Tang, Kaiyuan Li, Yuting Tang, Xin-Qiang Cai, Zhen-Yu Zhang, Gang Niu, Masashi Sugiyama and Yang Yu. Learning View-invariant World Models for Visual Robotic Manipulation. In: ICLR, 2025. [paper]
Peng-Yuan Wang, Jing-Cheng Pang, Chen-Yang Wang, Xu-Hui Liu, Tian-Shuo Liu, Si-Hang Yang, Hong Qian and Yang Yu. InCLET: In-context Learning from Language Models can Improve Embodied Instruction-following. In: AAMAS (Oral), 2025. [paper]
Jing-Cheng Pang, Si-Hang Yang, Kaiyuan Li, Jiaji Zhang, Xiong-Hui Chen, Nan Tang and Yang Yu. KALM: Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts. In: NeurIPS, 2024. [paper]
Jing-Cheng Pang, Peng-Yuan Wang, Kaiyuan Li, Xiong-Hui Chen, Jiacheng Xu, Zongzhang Zhang and Yang Yu. Language Model Self-improvement by Reinforcement Learning Contemplation. In: ICLR, 2024. [paper]
Jing-Cheng Pang, Xinyu Yang, Si-Hang Yang, Xiong-Hui Chen and Yang Yu. Natural Language Instruction-following with Task-related Language Development and Translation. In: NeurIPS, 2023. [paper]
Jing-Cheng Pang, Tian Xu, Shengyi Jiang, Yu-Ren Liu and Yang Yu. Reinforcement Learning With Sparse-Executing Actions via Sparsity Regularization. TNNLS, to appear. [paper]
Shengyi Jiang, Jing-Cheng Pang and Yang Yu. Offline Imitation Learning with a Misspecified Simulator. In: NeurIPS, 2020. [paper]

[Full publication list][Google Scholar]

[Back to top]

Projects

Awesome-Papers-Autonomous-Agent
A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.
ImagineBench
A benchmark for evaluating RL algorithms that train the policies using both real data and imaginary rollouts from LLMs.

[Back to top]