About Me

I am currently a Research Scientist at ByteDance Seed, specializing in research on Large Language Model (LLM), AI for Science (AI4S), and Natural Language Processing (NLP).

Before this, I completed my Ph.D. at the Natural Language Processing Group of Nanjing University, co-supervised by Prof. Shujian Huang and Prof. Jiajun Chen. During my doctoral studies, I interned at ByteDance AI Lab under the mentorship of Prof. Zhou Hao and Prof. Lei Li, where I laid the groundwork for generative modeling research, including non-autoregressive text generation and latent variable-based parallel text generation.

News

We’re hiring! Whether you’re seeking an internship or a full-time role, join us to build cutting-edge AI systems! Feel free to reach out via baoyu.001@bytedance.com for the Top Seed Program (internships & full-time positions available) or JD sites directly.

Selected Publications/Preprints [Full list]

[name*: equal contributions] [name: interns/students I mentored]

ByteDance Seed Team, Seed LiveInterpret 2.0: End-to-end Simultaneous Speech-to-speech Translation with Your Voice, [Homepage｜Demo]

Arxiv 2025

ByteDance Seed Team, Seed-X: Building Strong Multilingual Translation LLM with 7B Parameters, [HF｜Demo] Arxiv 2025

Xiangxin Zhou*, Xiwei Cheng*, Yuwei Yang, Yu Bao, Liang Wang, Quanquan Gu, DecompOpt: Controllable and Decomposed Diffusion Models for Structure-based Molecular Optimization

ICLR 2024

Jiaqi Guan*, Xiangxin Zhou*, Yuwei Yang, Yu Bao, Jian Peng, Jianzhu Ma, Qiang Liu, Liang Wang, Quanquan Gu, DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug Design

ICML 2023

Min Liu, Yu Bao, Chengqi Zhao, Shujian Huang, Selective Knowledge Distillation for Non-Autoregressive Neural Machine Translation AAAI 2023

Yu Bao, Hao Zhou, Shujian Huang, Dongqi Wang, Lihua Qian, Xinyu Dai, Jiajun Chen, Lei Li, latent-GLAT: Glancing at Latent Variables for Parallel Text Generation

ACL 2022

Yu Bao, Shujian Huang, Tong Xiao, Dongqi Wang, Xinyu Dai, Jiajun Chen, Non-Autoregressive Translation by Learning Target Categorical Codes

NAACL-HLT 2021

Lihua Qian, Hao Zhou, Yu Bao, Mingxuan Wang, Lin Qiu, Weinan Zhang, Yong Yu, Lei Li, Glancing Transformer for Non-Autoregressive Neural Machine Translation

ACL 2021

Jiahuan Li*, Yu Bao*, Shujian Huang, Xinyu Dai, Jiajun Chen, Explicit Semantic Decomposition for Definition Generation ACL 2020

Yu Bao, Hao Zhou, Jiangtao Feng, Mingxuan Wang, Shujian Huang, Jiajun Chen, Lei Li, PNAT: Non-Autoregressive Transformer by Position Learning

Arxiv 2019

Yu Bao*, Hao Zhou*, Shujian Huang, Lei Li, Lili Mou, Olga Vechtomova, Xinyu Dai, Jiajun Chen, Generating Sentences from Disentangled Syntactic and Semantic Spaces

ACL 2019

Professional Services

Journal Reviewer of

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
IEEE Transactions on Neural Networks and Learning Systems (TNNLS)
Journal of Artificial Intelligence Research (JAIR)

Area Chair of

ACL Rolling Review 2025-
The 1st GenBio Workshop on New Frontiers of Generative AI and Biology at NeurIPS 2023

PC Member/Reviewer of

International Conference on Machine Learning (ICML) 2023-
Annual Conference on Neural Information Processing Systems (NeurIPS) 2022-
International Conference on Learning Representations(ICLR) 2022-
North American Chapter of the Association for Computational Linguistics (NAACL) 2022-2024
Conference on Empirical Methods in Natural Language Processing (EMNLP) 2021-2024
Annual Meeting of the Association for Computational Linguistics (ACL) 2021-2024
ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) 2022-2024
AAAI Conference on Artificial Intelligence (AAAI) 2022-2024
The Chinese National Conference on Computational Linguistics (CCL) 2022
The CCF Conference on Natural Language Processing and Chinese Computing (NLPCC) 2022
International Joint Conferences on Artificial Intelligence (IJCAI) 2020