Zhuosheng Zhang

Tenure-Track Assistant Professor
School of Computer Science
Shanghai Jiao Tong University
Email: zhangzs@sjtu.edu.cn
Office: School of Software 5213
800 Dongchuan Road, Shanghai

Profile

I am a tenure-track assistant professor at Shanghai Jiao Tong University. I received my Ph.D. degree and my M.S. degree from Shanghai Jiao Tong University in 2023 and 2020, respectively. I was an intern at Amazon Web Services, Microsoft Research Redmond, Langboat Tech, NICT (Japan), and IBM. I have served as an action editor for ACL Rolling Review, and a (senior) area chair for ACL, NeurIPS, and EMNLP.

My research interests include natural language processing, LLM reasoning, LLM agents, and LLM safety. I have published over 100 papers in top-tier conferences and journals, including Nature Communications, TPAMI, ICML, ICLR, ACL, AAAI, EMNLP, TNNLS, TASLP, and COLING. I have won 1st place in various language understanding and reasoning leaderboards, such as SQuAD2.0, MuTual, RACE, ShARC, and CMRC. I was awarded as an Academic Star at Shanghai Jiao Tong University and was selected as one of the Global Top 100 Chinese Rising Stars in Artificial Intelligence. I won the Excellent Doctoral Thesis of Chinese Information Processing Society (CIPS), WAIC 2024 Youth Outstanding Paper Award, WAIC 2024 YunFan Award: Bright Star, and Baidu Scholarship.

Recent Projects

Prospective students: We are actively looking for undergraduate interns at SJTU. We expect applicants to have some prior experience in AI/NLP/ML (prior research experience is not required), and a minimum of 10 hours per week commitment to research. Please email me with your CV if you are interested.

Teaching

Courses:
    • NIS3353: Artificial Intelligence Security
      Undergraduate, Shanghai Jiao Tong University, 2024-
    • NIS8021: Frontier Technology in Natural Language Processing
      Graduate, Shanghai Jiao Tong University, 2024-
Tutorials:
    • For Beginners: Dive into LLMs《动手学大模型》系列编程实践教程 New Updates! (May 2025)
    • CVPR 2024: From Multimodal LLM to Human-level AI: Modality, Instruction, Reasoning and Beyond
      Hao Fei, Yuan Yao, Ao Zhang, Haotian Liu, Fuxiao Liu, Zhuosheng Zhang, Shuicheng Yan.
      Seattle WA, USA
      [Website]
    • LREC-COLING 2024: From Multimodal LLM to Human-level AI: Modality, Instruction, Reasoning, Efficiency and Beyond
      Hao Fei, Yuan Yao, Zhuosheng Zhang, Fuxiao Liu, Ao Zhang, Tat-Seng Chua.
      Torino, Italia
      [Website]
    • IJCNLP-AACL 2023: Learning WHO Saying WHAT to WHOM in Multi-Party Conversations
      Jia-Chen Gu, Zhuosheng Zhang, and Zhen-Hua Ling.
      Bali, Indonesia.
      [Website]
    • IJCAI 2021: Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond
      Zhuosheng Zhang and Hai Zhao.
      Montreal, Canada (Virtual)
      [Website]

Recent Talks

  • 2026/04: Keynote at ICLR 2026 MemAgents Workshop. [slides]
  • 2026/02: Talk "面向AI数字分身的个性化建模与流通" at CCF秀湖会议
  • 2025/11: Talk "大模型智能体推理机制分析:从推理泛化到言行合一" at LMG 2025大模型深度推理论坛 [slides]
  • 2025/10: Talk "智能体系统的技术架构、能力演化与全景评估" at CNCC 2025 AI Agent关键技术与应用论坛 [slides]
  • 2025/10: Talk "从被动工具到主动伙伴:探索心智驱动的OS Agent" at CNCC 2025 面向移动生态的Agentic AI论坛 [slides]
  • 2025/09: Talk "迈向可信赖的AI智能体:从隐式意图理解到拟人行为分析" at CIPS大模型前沿技术报告 [slides]
  • 2025/07: Talk "大模型时代的智能交互:OS Agent技术与挑战" at 上海交通大学大模型智能体暑期研学营 [slides]
  • 2024/09: Keynote "Caution for the environment: Multimodal Agents are Susceptible to Environmental Distractions" at CJNLP 2024. [slides]
  • 2024/08: Keynote "Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities" at Knowledge-Augmented NLP Workshop @ ACL 2024. [slides]

Selected Publications

Discover google scholar | semantic scholar | dblp.
[Preprint]
    • ColorAgent: Building A Robust, Personalized, and Interactive OS Agent
      Ning Li, Qiqiang Lin, Zheng Wu, Xiaoyun Mo, Weiming Zhang, Yin Zhao, Xiangmou Qu, Jiamu Zhou, Jun Wang, Congmin Zheng, Yuanyi Song, Hongjiang Chen, Heyuan Huang, Jihong Wang, Jiaxin Yin, Jingwei Yu, Junwei Liao, Qiuying Peng, Xingyu Lou, Jun Wang, Weiwen Liu*, Zhuosheng Zhang*, Weinan Zhang
      Preprint, 2025
      [PDF] [Abstract]
    • ColorBench: Benchmarking Mobile Agents with Graph-Structured Framework for Complex Long-Horizon Tasks
      Yuanyi Song, Heyuan Huang, Qiqiang Lin, Yin Zhao, Xiangmou Qu, Jun Wang, Xingyu Lou, Weiwen Liu, Zhuosheng Zhang, Jun Wang, Yong Yu, Weinan Zhang, Zhaoxiang Wang
      Preprint, 2025
      [PDF] [Abstract]
    • ColorEcosystem: Powering Personalized, Standardized, and Trustworthy Agentic Service in Massive-agent Ecosystem
      Fangwen Wu, Zheng Wu, Jihong Wang, Yunku Chen, Ruiguang Pei, Heyuan Huang, Xin Liao, Xingyu Lou, Huarong Deng, Zhihui Fu, Weiwen Liu, Zhuosheng Zhang, Weinan Zhang, Jun Wang
      Preprint, 2025
      [PDF] [Abstract]
    • The Hunger Game Debate: On the Emergence of Over-Competition in Multi-Agent Systems
      Xinbei Ma, Ruotian Ma, Xingyu Chen, Zhengliang Shi, Mengru Wang, Jen-tse Huang, Qu Yang, Wenxuan Wang, Fanghua Ye, Qingxuan Jiang, Mengfei Zhou, Zhuosheng Zhang*, Rui Wang, Hai Zhao, Zhaopeng Tu*, Xiaolong Li, Linus
      Preprint, 2025
      [PDF] [Abstract]
    • VeriOS: Query-Driven Proactive Human-Agent-GUI Interaction for Trustworthy OS Agents
      Zheng Wu, Heyuan Huang, Xingyu Lou, Xiangmou Qu, Pengzhou Cheng, Zongru Wu, Weiwen Liu, Weinan Zhang, Jun Wang, Zhaoxiang Wang*, Zhuosheng Zhang*
      Preprint, 2025
      [PDF] [Abstract]
    • Agent-ScanKit: Unraveling Memory and Reasoning of Multimodal Agents via Sensitivity Perturbations
      Pengzhou Cheng, Lingzhong Dong, Zeng Wu, Zongru Wu, Zhuosheng Zhang*, Gongshen Liu*
      Preprint, 2025
      [PDF] [Abstract]
    • Say One Thing, Do Another? Diagnosing Reasoning-Execution Gaps in VLM-Powered Mobile-Use Agents
      Lingzhong Dong, Ziqi Zhou, Shuaibo Yang, Haiyue Sheng, Pengzhou Cheng, Zongru Wu, Zheng Wu, Gongshen Liu*, Zhuosheng Zhang*
      Preprint, 2025
      [PDF] [Abstract]
[2026]
    • See, Think, Act: Teaching Multimodal Agents to Effectively Interact with GUI by Identifying Toggles
      Zongru Wu, Rui Mao, Zhiyuan Tian, Pengzhou Cheng, Tianjie Ju, Zheng Wu, Lingzhong Dong, Haiyue Sheng, Zhuosheng Zhang*, Gongshen Liu*.
      CVPR, 2026
      [PDF] [Abstract]
    • Training High-Level Schedulers with Execution-Feedback Reinforcement Learning for Long-Horizon GUI Automation
      Zehao Deng, Tianjie Ju, Zheng Wu, Zhuosheng Zhang*, Gongshen Liu.
      CVPR, 2026
      [PDF] [Abstract]
    • LaSM: Layer-wise Scaling Mechanism for Defending Pop-up Attack on GUI Agents
      Zihe Yan, Zhuosheng Zhang*, Jiaping Gui, Gongshen Liu.
      CVPR, 2026
      [PDF] [Abstract]
    • DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
      Zhiwei He, Tian Liang, Jiahao Xu, Qiuzhi Liu, Xingyu Chen, Yue Wang, Linfeng Song, Dian Yu, Zhenwen Liang, Wenxuan Wang, Zhuosheng Zhang, Rui Wang, Zhaopeng Tu, Haitao Mi, Dong Yu.
      ICLR, 2026
      [PDF] [Abstract]
      • Auditing Partial Dataset Usage in Large Language Models Via Fuzzy Membership Aggregation
        Hongyu Zhu, Sichu Liang, Bofan Chen, Shilin Wang, Zhuosheng Zhang, Weiping Ding.
        IEEE Transactions on Fuzzy Systems, 2026
        [PDF] [Abstract]
        • Generalizable and Adaptive Continual Learning Framework for AI-generated Image Detection
          Hanyi Wang, Jun Lan, Yaoyu Kang, Huijia Zhu, Weiqiang Wang, Zhuosheng Zhang, Shilin Wang.
          IEEE Transactions on Multimedia, 2026
          [PDF] [Abstract]
        • GEM: Gaussian Embedding Modeling for Out-of-Distribution Detection in GUI Agents
          Zheng Wu, Pengzhou Cheng, Zongru Wu, Lingzhong Dong, Zhuosheng Zhang*
          AAAI, 2026
          [PDF] [Abstract]
        • An LLM-based Quantitative Framework for Evaluating High-Stealthy Backdoor Risks in OSS Supply Chains
          Zihe Yan, Kai Luo, Haoyu Yang, Yang Yu, Zhuosheng Zhang*, Guancheng Li*
          AAAI, 2026
          [PDF] [Abstract]
      [2025 & Before]
        • Discourse-Aware Language Representation
          Zhuosheng Zhang#, Siru Ouyang#, Hai Zhao*
          TPAMI, 2025
          [PDF] [Abstract]
        • Universal Multimodal Representation for Language Understanding
          Zhuosheng Zhang#, Kehai Chen, Rui Wang#, Masao Utiyama, Eiichiro Sumita, Zuchao Li, Hai Zhao
          TPAMI, 2023
          "Let's retrieve images to overcome the lack of large-scale bilingual pairs."
          [PDF] [Abstract]
        • SG-Net: Syntax Guided Transformer for Language Representation
          Zhuosheng Zhang, Yuwei Wu, Junru Zhou, Sufeng Duan, Hai Zhao, Rui Wang.
          TPAMI, 2022
          [PDF] [Abstract]
        • Text Compression-aided Transformer Encoding
          Zuchao Li, Zhuosheng Zhang, Hai Zhao, Rui Wang, Kehai Chen, Masao Utiyama, Eiichiro Sumita.
          TPAMI, 2022
          [PDF] [Abstract]
      • Risks of AI Scientists: Prioritizing Safeguarding Over Autonomy
        Xiangru Tang, Qiao Jin, Kunlun Zhu, Tongxin Yuan, Yichi Zhang, Wangchunshu Zhou, Meng Qu, Yilun Zhao, Jian Tang, Zhuosheng Zhang, Arman Cohan, Zhiyong Lu, Mark Gerstein.
        Nature Communications, 2025
        [PDF] [Abstract]
      • Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
        Zhuosheng Zhang#, Yao Yao#, Aston Zhang, Xiangru Tang, Xinbei Ma, Zhiwei He, Yiming Wang, Mark Gerstein, Rui Wang, Gongshen Liu, Hai Zhao.
        ACM Computing Surveys, 2025
        "Join us on an exciting journey from chain-of-thought reasoning to language agent!"
        [PDF] [Abstract]
      • Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities
        Tianjie Ju, Yiting Wang, Xinbei Ma, Pengzhou Cheng, Haodong Zhao, Yulong Wang, Lifeng Liu, Jian Xie, Zhuosheng Zhang*, Gongshen Liu*.
        SCIS, 2025
        [PDF] [Abstract]
      • Do NOT Think That Much for 2+ 3=? On the Overthinking of o1-Like LLMs
        Xingyu Chen, Jiahao Xu, Tian Liang, Zhiwei He, Jianhui Pang, Dian Yu, Linfeng Song, Qiuzhi Liu, Mengfei Zhou, Zhuosheng Zhang, Rui Wang, Zhaopeng Tu, Haitao Mi, Dong Yu.
        ICML, 2025
        [PDF] [Abstract]
      • Caution for the Environment: LLM Agents are Susceptible to Environmental Distractions
        Xinbei Ma, Yiting Wang, Yao Yao, Tongxin Yuan, Aston Zhang, Zhuosheng Zhang*, Hai Zhao*.
        ACL, 2025
        [PDF] [Abstract]
      • You Only Look at Screens: Multimodal Chain-of-Action Agents
        Zhuosheng Zhang, Aston Zhang.
        ACL-Findings, 2024
        "Perform a task on smart phones? Train an agent using screenshots."
        [PDF] [Abstract] [slides]
      • Multimodal Chain-of-Thought Reasoning in Language Models
        Zhuosheng Zhang, Aston Zhang, Mu Li, Hai Zhao, George Karypis, Alex Smola.
        TMLR, 2024
        "Imagine learning a textbook with no figures: Multimodal-CoT surpasses humans on ScienceQA."
        Featured in Dive into Deep Learning (Adopted at 500 universities from 70 countries)
        [Top Trending Research on paperswithcode] [Idea Inspiration] [PDF] [Abstract]
      • Automatic Chain of Thought Prompting in Large Language Models
        Zhuosheng Zhang, Aston Zhang, Mu Li, Alex Smola.
        ICLR, 2023
        "Let's think not just step by step, but also one by one."
        Featured in Dive into Deep Learning (Adopted at 400 universities from 60 countries)
        [PDF] [Abstract] [bilibili] [slides]

      Shared Tasks

      [May 2022] HellaSwag Leaderboard on Commonsense Reasoning
        [January 2021] ShARC Leaderboard on Conversational Question Answering
        [September 2020] MuTual Leaderboard on Dialogue Reasoning Challenge
        [July 2019] SQuAD2.0 Leaderboard on Machine Reading Comprehension
        • The best models for both single and ensemble settings among all submissions (2020.01).
        • The first to surpass human benchmark on both EM and F1 scores with a single model (from 2019.07-09).
        • The first time to exceed 90% F1 score with ensemble models.
          [Leaderboard] [Paper] [Report]
        [March 2019] RACE Leaderboard on Machine Reading Comprehension
        [April 2019] SNLI Leaderboard on Language Inference [March 2019] GLUE Leaderboard on Language Understanding
        • The 3rd best among all submissions.
        • The best among all academic submissions.
          [Leaderboard] [Paper]
        [August 2017] Chinese Machine Reading Comprehension (CCL-CMRC 2017)

      Awards & Honors

        • 2024: WAIC Youth Outstanding Paper Award, World Artificial Intelligence Conference.

        • 2024: WAIC YunFan Award: Bright Star, World Artificial Intelligence Conference.

        • 2023: Excellent Doctoral Thesis of Chinese Information Processing Society (CIPS).

        • 2023: Shanghai Outstanding Doctoral Graduate.

        • 2022: Academic Stars of Graduate Students (10 recipients), Shanghai Jiao Tong University.

        • 2021: Global Top 100 Chinese Rising Stars in Artificial Intelligence (Top 10 recommended), Baidu Research.

        • 2021: Baidu Scholarship (10 recipients, worldwide), Baidu.

        • 2020: National Scholarship of China, Ministry of Education of the P.R. China.

        • 2019: Yang Yuanqing Education Fund, The foundation of Class 1988 in CS @ Shanghai Jiao Tong University.

        • 2018: Academic Stars of Graduate Students (The only master student awardee), Shanghai Jiao Tong University.

        • 2016: National Figures Nomination of College Students (20 total recipients), Ministry of Education of the P.R. China.

        • 2015: CCF Elite Collegiate Award, China Computer Federation.

      Academic Service

      Organization: (Senior) Area Chair / Action Editor/ SPC:
        • AAAI 2026
        • ACL Rolling Review
        • NeurIPS 2025
        • EMNLP 2025
        • ACL 2025
        • LREC-COLING 2024
        • IJCAI 2024
        • ICLR 2023 TinyPapers
      Program Committee Member:
        • ML/AI conferences: ICLR, ICML, NeurIPS, AAAI, IJCAI, etc.
        • CL/NLP conferences: ARR, ACL, EMNLP, COLING, NAACL, AACL, NLPCC, CCL, etc.

      Journal Reviewer:
        • Artificial Intelligence, IEEE/ACM TASLP, IEEE TNNLS, IEEE TETCI, IEEE Communications Magazine, ACM TALLIP, ACM TOIS, TMLR, Neurocomputing, Multimedia Systems, Neural Computing and Applications, Expert Systems With Applications.

      Experience

        • Jul. 2022 - Aug. 2023, Amazon Web Services AI, CA, USA.
          Applied Scientist Intern, advised by Dr. Aston Zhang, Mu Li, Alex Smola.
        • Feb. 2022 - June. 2022, Microsoft Cognitive Services Research Group, WA, USA.
          Research Intern, advised by Dr. Shuohang Wang.
        • Mar. 2021 - Dec. 2021, Langboat Tech, Beijing, China.
          Research Intern, advised by Prof. Ming Zhou.
        • Jun. 2019 - Jul. 2020, NICT, Kyoto, Japan.
          Internship Research Fellow, advised by Prof. Rui Wang, Kehai Chen, Masao Utiyama, and Eiichiro Sumita.

      Education

        • Sept. 2020 - Sept. 2023
          Ph.D., Dept. of Computer Science and Engineering, Shanghai Jiao Tong University, advised by Prof. Hai Zhao.
        • Sept. 2016 - Mar. 2020
          M.S., Dept. of Computer Science and Engineering, Shanghai Jiao Tong University, advised by Prof. Hai Zhao.
        • Sept. 2012 - Jun. 2016
          B.S., Dept. of Computer Science and Engineering, Wuhan University, advised by Prof. Haojun Ai.

      Research Team

      PhD Students: Master Students: Alumni: