Xiangxin Zhou

I am a fourth-year Ph.D. student at University of Chinese Academy of Sciences (UCAS) and Institute of Automation, Chinese Academy of Sciences (CASIA), advisded by Liang Wang.

Previously, I received my B.Eng. degree from Tsinghua University in 2021.

My research interests include LLM reasoning and AI for Drug Discovery (AIDD).


Education
  • Institute of Automation, Chinese Academy of Sciences
    Institute of Automation, Chinese Academy of Sciences
  • University of Chinese Academy of Sciences
    University of Chinese Academy of Sciences
    School of Artificial Intelligence
    Ph.D. Student
    Sep. 2021 - present
  • Tsinghua University
    Tsinghua University
    B.Eng. in Electronic Engineering
    Sep. 2016 - Jul. 2021
Experience
  • Xiaohongshu Hi Lab
    Xiaohongshu Hi Lab
    RedStar Intern
    Aug. 2025 - Present
  • ByteDance Seed
    ByteDance Seed
    Research Intern
    May. 2025 - Jul. 2025
  • ByteDance AI Lab
    ByteDance AI Lab
    Research Intern
    May. 2023 - May. 2025
  • ByteDance AML
    ByteDance AML
    Research Intern
    Sep. 2022 - May. 2023
Selected Publications (view all )
Reinforcing General Reasoning without Verifiers
Reinforcing General Reasoning without Verifiers

Xiangxin Zhou*, Zichen Liu*, Anya Sims*, Haonan Wang, Tianyu Pang, Chongxuan Li, Liang Wang, Min Lin, Chao Du (* equal contribution)

Preprint. 2025

VeriFree is a verifier-free method that bypasses answer verification and instead uses RL to directly maximize the probability of generating the reference answer.

Reinforcing General Reasoning without Verifiers

Xiangxin Zhou*, Zichen Liu*, Anya Sims*, Haonan Wang, Tianyu Pang, Chongxuan Li, Liang Wang, Min Lin, Chao Du (* equal contribution)

Preprint. 2025

VeriFree is a verifier-free method that bypasses answer verification and instead uses RL to directly maximize the probability of generating the reference answer.

Designing Cyclic Peptides via Harmonic SDE with Atom-Bond Modeling
Designing Cyclic Peptides via Harmonic SDE with Atom-Bond Modeling

Xiangxin Zhou*, Mingyu Li*, Yi Xiao, Jiahan Li, Dongyu Xue, Zaixiang Zheng, Jianzhu Ma, Quanquan Gu (* equal contribution)

International Conference on Machine Learning (ICML) 2025

CpSDE is a generative algorithm capable of generating diverse types of cyclic peptides given 3D receptor structures.

Designing Cyclic Peptides via Harmonic SDE with Atom-Bond Modeling

Xiangxin Zhou*, Mingyu Li*, Yi Xiao, Jiahan Li, Dongyu Xue, Zaixiang Zheng, Jianzhu Ma, Quanquan Gu (* equal contribution)

International Conference on Machine Learning (ICML) 2025

CpSDE is a generative algorithm capable of generating diverse types of cyclic peptides given 3D receptor structures.

Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic Flows
Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic Flows

Xiangxin Zhou*, Yi Xiao*, Haowei Lin, Xinheng He, Jiaqi Guan, Yang Wang, Qiang Liu, Feng Zhou, Liang Wang, Jianzhu Ma (* equal contribution)

International Conference on Learning Representations (ICLR) 2025

DynamicFlow is a full-atom (stochastic) flow model that learns to transform apo pockets and noisy ligands into holo pockets and corresponding 3D ligand molecules.

Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic Flows

Xiangxin Zhou*, Yi Xiao*, Haowei Lin, Xinheng He, Jiaqi Guan, Yang Wang, Qiang Liu, Feng Zhou, Liang Wang, Jianzhu Ma (* equal contribution)

International Conference on Learning Representations (ICLR) 2025

DynamicFlow is a full-atom (stochastic) flow model that learns to transform apo pockets and noisy ligands into holo pockets and corresponding 3D ligand molecules.

Reprogramming Pretrained Target-Specific Diffusion Models for Dual-Target Drug Design
Reprogramming Pretrained Target-Specific Diffusion Models for Dual-Target Drug Design

Xiangxin Zhou, Jiaqi Guan, Yijia Zhang, Xingang Peng, Liang Wang, Jianzhu Ma

Conference on Neural Information Processing Systems (NeurIPS) 2024

DualDiff generates dual-target ligand molecules via compositional sampling based on single-target diffusion models.

Reprogramming Pretrained Target-Specific Diffusion Models for Dual-Target Drug Design

Xiangxin Zhou, Jiaqi Guan, Yijia Zhang, Xingang Peng, Liang Wang, Jianzhu Ma

Conference on Neural Information Processing Systems (NeurIPS) 2024

DualDiff generates dual-target ligand molecules via compositional sampling based on single-target diffusion models.

Antigen-Specific Antibody Design via Direct Energy-based Preference Optimization
Antigen-Specific Antibody Design via Direct Energy-based Preference Optimization

Xiangxin Zhou*, Dongyu Xue*, Ruizhe Chen*, Zaixiang Zheng, Liang Wang, Quanquan Gu (* equal contribution)

Conference on Neural Information Processing Systems (NeurIPS) 2024

Direct energy-based preference optimzation guides the generation of antibodies with both rational structures and considerable binding affinities to given antigens.

Antigen-Specific Antibody Design via Direct Energy-based Preference Optimization

Xiangxin Zhou*, Dongyu Xue*, Ruizhe Chen*, Zaixiang Zheng, Liang Wang, Quanquan Gu (* equal contribution)

Conference on Neural Information Processing Systems (NeurIPS) 2024

Direct energy-based preference optimzation guides the generation of antibodies with both rational structures and considerable binding affinities to given antigens.

Stabilizing Policy Gradients for Stochastic Differential Equations via Consistency with Perturbation Process
Stabilizing Policy Gradients for Stochastic Differential Equations via Consistency with Perturbation Process

Xiangxin Zhou, Liang Wang, Yichi Zhou

International Conference on Machine Learning (ICML) 2024

Policy gradients in data-scarce regions are ill-defined, leading to instability. Consistency ensured via score matching allows us to correctly estimate the policy gradients with sufficient data that can be efficiently sampled from the forward SDE (i.e., perturbation).

Stabilizing Policy Gradients for Stochastic Differential Equations via Consistency with Perturbation Process

Xiangxin Zhou, Liang Wang, Yichi Zhou

International Conference on Machine Learning (ICML) 2024

Policy gradients in data-scarce regions are ill-defined, leading to instability. Consistency ensured via score matching allows us to correctly estimate the policy gradients with sufficient data that can be efficiently sampled from the forward SDE (i.e., perturbation).

Controllable and Decomposed Diffusion Models for Structure-based Molecular Optimization
Controllable and Decomposed Diffusion Models for Structure-based Molecular Optimization

Xiangxin Zhou*, Xiwei Cheng*, Yuwei Yang, Yu Bao, Liang Wang, Quanquan Gu (* equal contribution)

International Conference on Learning Representations (ICLR) 2024

DecompOpt a structure-based molecular optimization method based on a controllable and decomposed diffusion model.

Controllable and Decomposed Diffusion Models for Structure-based Molecular Optimization

Xiangxin Zhou*, Xiwei Cheng*, Yuwei Yang, Yu Bao, Liang Wang, Quanquan Gu (* equal contribution)

International Conference on Learning Representations (ICLR) 2024

DecompOpt a structure-based molecular optimization method based on a controllable and decomposed diffusion model.

DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug Design
DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug Design

Jiaqi Guan*, Xiangxin Zhou*#, Yuwei Yang, Yu Bao, Jian Peng, Jianzhu Ma, Qiang Liu, Liang Wang, Quanquan Gu# (* equal contribution, # corresponding author)

International Conference on Machine Learning (ICML) 2023

DecompDiff is a diffusion model for SBDD with decomposed priors over arms and scaffold, equipped with bond diffusion and additional validity guidance.

DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug Design

Jiaqi Guan*, Xiangxin Zhou*#, Yuwei Yang, Yu Bao, Jian Peng, Jianzhu Ma, Qiang Liu, Liang Wang, Quanquan Gu# (* equal contribution, # corresponding author)

International Conference on Machine Learning (ICML) 2023

DecompDiff is a diffusion model for SBDD with decomposed priors over arms and scaffold, equipped with bond diffusion and additional validity guidance.

All publications