Junkai Zhang

Junkai Zhang 张峻恺

CS PhD Candidate @ UCLA

About

I'm a fourth-year Ph.D. candidate in Computer Science at UCLA, advised by Professor Wei Wang. I am broadly interested in Large Language Model post-training. I received my B.S. in Mathematics from Tsinghua University in 2022, with a minor in Philosophy.

Education

UCLA

Ph.D. in Computer Science

Tsinghua University

B.S. in Mathematics, Minor in Philosophy

Experience

Google

Student Researcher, Playa Vista, CA

Scale AI

Post-training Research Intern, San Francisco, CA

Publications

Selected Works

Chasing the Tail: Effective Rubric-based Reward Modeling for Large Language Model Post-Training

Junkai Zhang*, Zihao Wang*, Lin Gui*, Swarnashree Mysore Sathyendra, Jaehwan Jeong, Victor Veitch, Wei Wang, Yunzhong He, Bing Liu, Lifeng Jin

ICLR 2026

DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

Yihe Deng*, Yu Yang*, Junkai Zhang*, Wei Wang, Bo Li

AISTATS 2026

Reinforcement Learning

Uncertainty-Aware Reward-Free Exploration with General Function Approximation

Junkai Zhang*, Weitong Zhang*, Dongruo Zhou, Quanquan Gu

ICML 2024

Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs

Junkai Zhang, Weitong Zhang, Quanquan Gu

ICML 2023

Optimization Methods

Why Does Sharpness-Aware Minimization Generalize Better Than SGD?

Zixiang Chen*, Junkai Zhang*, Yiwen Kou, Xiangning Chen, Cho-Jui Hsieh, Quanquan Gu

NeurIPS 2023

AI for Science

MatSciBench: Benchmarking the Reasoning Ability of LLM in Material Science

Junkai Zhang*, Jingru Gan*, Zian Jia, Changquan Gu, Jianpeng Chen, Xiaoxuan Wang, Yanqiao Zhu, Mingyu Derek Ma, Dawei Zhou, Ling Li, Wei Wang

In Submission

MetaScientist: A Human-AI Synergistic Framework for Automated Mechanical Metamaterial Design

Jingyuan Qi, Zian Jia, Minqian Liu, Wangzhi Zhan, Junkai Zhang, Xiaofei Wen, Jingru Gan, Jianpeng Chen, Qin Liu, Mingyu Derek Ma, Bangzheng Li, Haohui Wang, Adithya Kulkarni, Muhao Chen, Dawei Zhou, Ling Li, Wei Wang, Lifu Huang

NAACL 2025 Demo

Protein Large Language Models: A Comprehensive Survey

Yijia Xiao, Wanjia Zhao, Junkai Zhang, Yiqiao Jin, Han Zhang, Zhicheng Ren, Renliang Sun, Haixin Wang, Guancheng Wan, Pan Lu, Xiao Luo, Yu Zhang, James Zou, Yizhou Sun, Wei Wang

EMNLP 2025 Findings

MetamatBench: Integrating Heterogeneous Data, Computational Tools, and Visual Interface for Metamaterial Discovery

Jianpeng Chen, Wangzhi Zhan, Haohui Wang, Zian Jia, Jingru Gan, Junkai Zhang, Jingyuan Qi, Tingwei Chen, Lifu Huang, Muhao Chen, Ling Li, Wei Wang, Dawei Zhou

KDD 2025 Datasets and Benchmarks

Neural network-assisted personalized handwriting analysis for Parkinson's disease diagnostics

Guorui Chen, Trinny Tat, Yihao Zhou, Zhaoqi Duan, Junkai Zhang, Kamryn Scott, Xun Zhao, Zeyang Liu, Wei Wang, Song Li, Katy A. Cross, Jun Chen

Nature Chemical Engineering

Self-powered in-stent restenosis diagnosis via magnetoelastic stents

Guorui Chen, Wi Jin Kim, Youcheng Yang, Yan-Ruide Li, Jing Tian, Junkai Zhang, Xun Zhao, Kamryn Scott, Lily G. Defelice, Zeyang Liu, Jing Xu, Tzuchun Chung, Jarod Carol, Yihao Zhou, Anthony C. Wang, Olujimi A. Ajijola, Paul S. Weiss, Wei Wang, Song Li, Geoffrey P. Colby, Jun Chen

Nature Cardiovascular Research

Continuous Treatment Effect Modeling in Multi-agent Dynamical Systems

Zijie Huang*, Jeehyun Hwang*, Junkai Zhang*, Jinwoo Baik, Weitong Zhang, Quanquan Gu, Dominik Wodarz, Yizhou Sun, Wei Wang

WWW 2024

Generative Models

Fast Sampling via De-randomization for Discrete Diffusion Models

Zixiang Chen, Huizhuo Yuan, Yongqian Li, Yiwen Kou, Junkai Zhang, Quanquan Gu

NeurIPS 2024

Teaching

Teaching Assistant, UCLA

CS 31/32 (Intro to CS I, II), CS 180 (Algorithms and Complexity)

Services

Conference Reviewer: ICML [2025], ICLR [2024–2026], NeurIPS [2025], NAACL [2025], AAAI [2024–2025], AISTATS [2024]