Publications
Selected Works
Chasing the Tail: Effective Rubric-based Reward Modeling for Large Language Model Post-Training
Junkai Zhang*, Zihao Wang*, Lin Gui*, Swarnashree Mysore Sathyendra, Jaehwan Jeong, Victor Veitch, Wei Wang, Yunzhong He, Bing Liu, Lifeng Jin
ICLR 2026
Paper
Code
Dataset
DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails
Yihe Deng*, Yu Yang*, Junkai Zhang*, Wei Wang, Bo Li
AISTATS 2026
Paper
Code
Model
Reinforcement Learning
Uncertainty-Aware Reward-Free Exploration with General Function Approximation
Junkai Zhang*, Weitong Zhang*, Dongruo Zhou, Quanquan Gu
ICML 2024
Paper
Code
Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs
Junkai Zhang, Weitong Zhang, Quanquan Gu
ICML 2023
Paper
Optimization Methods
Why Does Sharpness-Aware Minimization Generalize Better Than SGD?
Zixiang Chen*, Junkai Zhang*, Yiwen Kou, Xiangning Chen, Cho-Jui Hsieh, Quanquan Gu
NeurIPS 2023
Paper
AI for Science
MatSciBench: Benchmarking the Reasoning Ability of LLM in Material Science
Junkai Zhang*, Jingru Gan*, Zian Jia, Changquan Gu, Jianpeng Chen, Xiaoxuan Wang, Yanqiao Zhu, Mingyu Derek Ma, Dawei Zhou, Ling Li, Wei Wang
In Submission
Paper
Code
Dataset
MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design
Jingyuan Qi, Zian Jia, Minqian Liu, Wangzhi Zhan, Junkai Zhang, Xiaofei Wen, Jingru Gan, Jianpeng Chen, Qin Liu, Mingyu Derek Ma, Bangzheng Li, Haohui Wang, Adithya Kulkarni, Muhao Chen, Dawei Zhou, Ling Li, Wei Wang, Lifu Huang
NAACL 2025 Demo
Paper
Demo
Protein Large Language Models: A Comprehensive Survey
Yijia Xiao, Wanjia Zhao, Junkai Zhang, Yiqiao Jin, Han Zhang, Zhicheng Ren, Renliang Sun, Haixin Wang, Guancheng Wan, Pan Lu, Xiao Luo, Yu Zhang, James Zou, Yizhou Sun, Wei Wang
EMNLP 2025 Findings
Paper
MetamatBench: Integrating Heterogeneous Data, Computational Tools, and Visual Interface for Metamaterial Discovery
Jianpeng Chen, Wangzhi Zhan, Haohui Wang, Zian Jia, Jingru Gan, Junkai Zhang, Jingyuan Qi, Tingwei Chen, Lifu Huang, Muhao Chen, Ling Li, Wei Wang, Dawei Zhou
KDD 2025 Datasets and Benchmarks
Paper
Code
Neural network-assisted personalized handwriting analysis for Parkinson's disease diagnostics
Guorui Chen, Trinny Tat, Yihao Zhou, Zhaoqi Duan, Junkai Zhang, Kamryn Scott, Xun Zhao, Zeyang Liu, Wei Wang, Song Li, Katy A. Cross, Jun Chen
Nature Chemical Engineering
Paper
Self-powered in-stent restenosis diagnosis via magnetoelastic stents
Guorui Chen, Wi Jin Kim, Youcheng Yang, Yan-Ruide Li, Jing Tian, Junkai Zhang, Xun Zhao, Kamryn Scott, Lily G. Defelice, Zeyang Liu, Jing Xu, Tzuchun Chung, Jarod Carol, Yihao Zhou, Anthony C. Wang, Olujimi A. Ajijola, Paul S. Weiss, Wei Wang, Song Li, Geoffrey P. Colby, Jun Chen
Nature Cardiovascular Research
Paper
Continuous Treatment Effect Modeling in Multi-agent Dynamical Systems
Zijie Huang*, Jeehyun Hwang*, Junkai Zhang*, Jinwoo Baik, Weitong Zhang, Quanquan Gu, Dominik Wodarz, Yizhou Sun, Wei Wang
WWW 2024
Paper
Generative Models
Fast Sampling via De-randomization for Discrete Diffusion Models
Zixiang Chen, Huizhuo Yuan, Yongqian Li, Yiwen Kou, Junkai Zhang, Quanquan Gu
NeurIPS 2024
Paper
Code