I am a third year M.S. student at SIGS, Tsinghua University and I’m very fortune to be advised by Prof. Xueqian Wang. Before that, I received my bachelor’s degree in Electrical Engineering and Automation from Xi’an Jiaotong University in Jun. 2022. My currect research interest lies in RL and LLM, especially in alignment with AI feedback and AI safety.

Currently, I’m an intern at Tencent AI Lab, AI for Science Center, supervised by Peilin Zhao.

Publications & Preprints

Probing the Safety Response Boundary of Large Language Models via Unsafe Decoding Path Generation Probing the Safety Response Boundary of Large Language Models via
Unsafe Decoding Path Generation

Haoyu Wang, Bingzhe Wu, Yatao Bian, Yongzhe Chang, Xueqian Wang, Peilin Zhao
Under Review
Step-On-Feet Tuning: Scaling Self-Alignment of LLMs via Bootstrapping Step-On-Feet Tuning: Scaling Self-Alignment of LLMs via Bootstrapping
Haoyu Wang, Guozheng Ma, Ziqiao Meng, Zeyu Qin, Li Shen, Zhong Zhang, Bingzhe Wu, Liu Liu, Yatao Bian, Tingyang Xu, Xueqian Wang, Peilin Zhao
MHFAIA Workshop at ICML 2024 & Under Review
Are Large Language Models Really Robust to Word-Level Perturbations? Are Large Language Models Really Robust to Word-Level Perturbations?
Haoyu Wang, Guozheng Ma, Cong Yu, Ning Gui, Linrui Zhang, Zhiqi Huang, Suwei Ma, Yongzhe Chang, Sen Zhang, Li Shen, Xueqian Wang, Peilin Zhao, Dacheng Tao
SoLaR Workshop at NeurIPS 2023 & Under Review
Learning better with less: effective augmentation for sample-efficient visual reinforcement learning Learning better with less: effective augmentation for sample-efficient
visual reinforcement learning

Guozheng Ma, Linrui Zhang, Haoyu Wang, Lu Li, Zilin Wang, Zhen Wang, Li Shen, Xueqian Wang, Dacheng Tao
Advances in Neural Information Processing Systems 2023

Education

·2022.09 - Now, Tsinghua University, Master, Big Data Program

·2018.09 - 2022.06, Xi’an Jiaotong University, Bachelor, Electrical Engineering and Automation

Internship

·2023.09 - Now, Tencent AI Lab, AI for Science Center, supervised by Peilin Zhao