Yiqun Chen

Ph.D. Candidate at GSAI, RUC.

personal_picture.jpg

Gaoling School of AI

Renmin University of China

Beijing, China


chenyiqun990321
@{ruc.edu.cn, gmail.com}

13853687820

@chenyiqun223336

My name is Yiqun Chen (ι™ˆι€ΈηΎ€). Currently, I am pursuing my Ph.D. at the Gaoling School of Artificial Intelligence, Renmin University of China (RUC), under the guidance of Prof. Jiaxin Mao.

πŸ”¬ Research Interests

My research interests primarily lie in Multi-Agent Reinforcement Learning and Agentic Search:

  • LLM Agent & Reinforcement Learning:
  • AI Search:
    • Retrieval-Augmented Generation (RAG)
    • Agentic Search & Deep Search/Research
  • Information Retrieval (IR):
    • Large Language Models for Ranking (LLM4Ranking)
    • Application of Reinforcement Learning for IR (e.g., RL for Diversified Search)

🏒 Industry Collaboration & Leadership

πŸš€ Recent Focus: Multi-Agent/Agent-Swarm Joint Optimization (RL)

Recently, I have maintained close collaborations with leading tech companies on LLM-based Multi-Agent RL, leading the development of UnityMAS-O, a Ray + veRL-based multi-agent reinforcement learning framework that supports customizable agent workflows, flexible agent-to-model mapping, and scalable distributed PPO optimization across shared, partially shared, or independent models.

🌟 Previous Internships My internship experiences include:

  • XiaoHongShu (Dots Agent & AI Search) (✨Ace Top Intern Program): End-to-end Multi-Agent RL optimization and full-link Agent research.
  • Baidu (Search Dept. & Intelligent Cloud): Agentic Search, Dumate Agent research.
  • ByteDance (Feishu/Lark): Memory-augmented AI search.
  • Huawei (Noah’s Ark Decision Making & Reasoning Lab): Multi-Agent Reinforcement Learning (MARL).
  • DiDi Chuxing (Ride-hailing Dept.): Pick-up/Drop-off location recommendation.

πŸ‘¨β€πŸŽ“ Job Market: Fall 2026 Internship

As a prospective Ph.D. graduate (Class of 2027), I am actively seeking a Fall 2026 Internship (targeting the 2027 campus recruitment season).

🀝 Why me? My mission is to build robust, scalable Multi-Agent paradigms and efficient infra/training framework. I prioritize practical utility over theoretical narratives (rejecting mere β€œstorytelling”). I am dedicated to bringing tangible performance gains and genuine, deployable innovation to industrial scenarios.

If you are looking for a researcher who focuses on what actually works, please contact me!


πŸŽ“ Education

  • Ph.D. Candidate in Artificial Intelligence Gaoling School of Artificial Intelligence (GSAI), Renmin University of China (RUC) 2023 - 2027 (Expected)

  • M.Sc. in Pattern Recognition and Intelligent Systems Institute of Automation, Chinese Academy of Sciences (CASIA) 2020 - 2023

  • B.Sc. in Automation Shandong University (SDU) 2016 - 2020


πŸ“° News

  • 2026.5: πŸŽ‰ One paper is accepted by ICML 2026.
  • 2025.12: πŸ”₯ We released a comprehensive survey: Deep Research: A Systematic Survey.
  • 2025.9: πŸŽ‰πŸŽ‰ Two papers are accepted by NeurIPS 2025.
  • 2025.8: πŸŽ‰ One paper is accepted by CIKM 2025.
  • 2025.7: πŸŽ‰ One paper is accepted by MM 2025.
  • 2025.6: πŸ”₯ Our AI Search Paradigm paper is publicly available.
  • 2025.1: πŸŽ‰πŸŽ‰ Two first-author papers are accepted by WWW 2025.
  • 2024.4: πŸŽ‰ One first-author paper is accepted by IJCAI 2024.
  • 2023.9: I joined Renmin University of China to pursue my Ph.D.
  • 2023.4: I joined the Search Department of Baidu Inc. as an algorithm intern.


πŸ—ΊοΈ Visitors


selected publications

  1. arXiv
    UnityMAS-O.png
    UnityMAS-O: A General RL Optimization Framework for LLM-Based Multi-Agent Systems
    Yiqun Chen, Wei Yang, Erhan Zhang, and 14 more authors
    arXiv preprint arXiv:2605.26646, 2026
  2. arXiv
    Tournament-GRPO.png
    Tournament-GRPO: Group-Wise Tournament Rewards for Reinforcement Learning in Open-Ended Long-Form Generation
    Zixuan Yang*, Yiqun Chen*, Wei Yang, and 7 more authors
    arXiv preprint arXiv:2605.26958, 2026
  3. arXiv
    PRAISE.png
    PRAISE: Prefix-Based Rollout Reuse in Agentic Search Training
    Erhan Zhang*, Yiqun Chen*, Zechun Niu, and 6 more authors
    arXiv:2604.03675, 2026
  4. ICML 2026
    JADE.png
    JADE: Bridging the Strategic-Operational Gap in Dynamic Agentic RAG
    Yiqun Chen, Erhan Zhang, Tianyi Hu, and 8 more authors
    arXiv preprint arXiv:2601.21916, 2026
  5. arXiv
    SCMA.png
    Self-Compression of Chain-of-Thought via Multi-Agent Reinforcement Learning
    Yiqun Chen, Jinyuan Feng, Wei Yang, and 9 more authors
    arXiv preprint arXiv:2601.21919, 2026
  6. arXiv
    M-ASK.png
    Beyond Monolithic Architectures: A Multi-Agent Search and Knowledge Optimization Framework for Agentic Search
    Yiqun Chen, Lingyong Yan, Zixuan Yang, and 5 more authors
    arXiv preprint arXiv:2601.04703, 2026
  7. arXiv
    deep_research.png
    Deep Research: A Systematic Survey
    Zhengliang Shi#, Yiqun Chen#, Haitao Li, and 23 more authors
    arXiv preprint arXiv:2512.02038, 2025
  8. NeurIPS 2025
    mmoa-rag.png
    Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning
    Yiqun Chen, Lingyong Yan, Weiwei Sun, and 6 more authors
    In Advances in Neural Information Processing Systems (NeurIPS), Dec 2025
  9. NeurIPS 2025
    ssr.png
    Structured Spectral Reasoning for Frequency-Adaptive Multimodal Recommendation
    Wei Yang*, Rui Zhong*, Yiqun Chen*, and 2 more authors
    In Advances in Neural Information Processing Systems (NeurIPS), Dec 2025
  10. arXiv
    mao-arag.png
    MAO-ARAG: Multi-Agent Orchestration for Adaptive Retrieval-Augmented Generation
    Yiqun Chen, Erhan Zhang, Lingyong Yan, and 4 more authors
    arXiv preprint arXiv:2508.01005, Dec 2025
  11. Baidu
    baidu_search.png
    Towards AI Search Paradigm (Technical Report of Baidu AI Search)
    Yuchen Li, Hengyi Cai, Rui Kong, and 18 more authors
    arXiv preprint arXiv:2506.17188, Dec 2025
  12. WWW 2025
    ma4div.png
    MA4DIV: Multi-Agent Reinforcement Learning for Search Result Diversification (Oral Presentation (Rate: Β 6%))
    Yiqun Chen, Jiaxin Mao, Yi Zhang, and 7 more authors
    In Proceedings of the ACM on Web Conference (WWW), Dec 2025
  13. WWW 2025
    tourrank.png
    TourRank: Utilizing Large Language Models for Documents Ranking with a Tournament-Inspired Strategy (Oral Presentation (Rate: Β 6%))
    Yiqun Chen, Qi Liu, Yi Zhang, and 6 more authors
    In Proceedings of the ACM on Web Conference (WWW), Dec 2025
  14. IJCAI 2024
    ptde.png
    PTDE: Personalized Training with Distilled Execution for Multi-Agent Reinforcement Learning
    Yiqun Chen, Hangyu Mao, Jiaxin Mao, and 5 more authors
    In International Joint Conference on Artificial Intelligence (IJCAI), Dec 2024
  15. IJCNN 2022
    csrl.png
    Commander-Soldiers Reinforcement Learning for Cooperative Multi-Agent Systems
    Yiqun Chen, Wei Yang, Tianle Zhang, and 2 more authors
    In International Joint Conference on Neural Networks (IJCNN), Dec 2022
  16. ICONIP 2022
    Multi-Agent Hyper-Attention Policy Optimization
    Bin Zhang*, Zhiwei Xu*, Yiqun Chen*, and 4 more authors
    In International Conference on Neural Information Processing (ICONIP), Dec 2022