Research
My research interests focus on LLM agents, multi-agent reinforcement learning, and their applications in decision-making and robotics.
|
|
Fine-tuning LLM Agents with Retrospective In-Context Online Learning
Wen-Tse Chen,
Jiayu Chen,
Fahim Tajwar,
Hao Zhu,
Xintong Duan,
Russ Salakhutdinov,
Jeff Schneider
NeurIPS Adaptive Foundation Models Workshop, 2024 (Oral presentation)
Presented a sample-efficient method for online fine-tuning LLM agents by using in-context learning to convert sparse feedback into dense signals, enabling LLMs to adapt to dynamic environments with minimal data.
|
|
Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization
Wen-Tse Chen,
Shiyu Huang,
Jeff Schneider,
Enhanced QMIX with maximum entropy RL for better exploration, ensuring locally optimal actions match global optima via an order preserving transformation, achieving SOTA performance on SAMC-v2 benchmark.
|
|
OpenRL: an open-source reinforcement learning research framework
Shiyu Huang,
Wen-Tse Chen,
Yiwen Sun,
Fuqing Bie,
Wei-Wei Tu
An open source framework supports single-agent RL, multi-agent RL, RLHF, and self-play training.
|
|
DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization
Wen-Tse Chen,
Shiyu Huang,
Yuan Chiang,
Tim Pearce,
Wei-Wei Tu,
Chen Ting,
Zhu Jun,
The 38th Annual AAAI Conference on Artificial Intelligence (AAAI2024)
Proposed an on-policy framework for discovering multiple diverse optimal strategies for the same task in a single training process.
|
|
TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play
Fanqi Lin*,
Shiyu Huang*,
Tim Pearce,
Wen-Tse Chen,
Wei-Wei Tu
The 22nd International Conference on Autonomous Agents and Multiagent Systems(AAMAS2023)
Created an on-policy MARL algorithm, along with an adaptive curriculum learning approach and a unique self-play strategy, for excelling in the Google Research Football game.
|
TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations
Shiyu Huang*,
Wen-Tse Chen*,
Longfei Zhang,
Shizhen Xu,
Ziyang Li,
Fengming Zhu,
Deheng Ye,
Ting Chen,
Jun Zhu
NeurIPS-21 Workshop: 2nd Offline Reinforcement Learning Workshop
Developed a distributed learning system and new offline algorithms to learn a powerful multi-agent AI from the fixed single-agent dataset.
|
|