Yu Bao (鲍宇)

Logo

Research Scientist at ByteDance Seed, specializing in research on Large Language Model, AI for Science, and Natural Language Processing

View My GitHub Profile

About Me

I am currently a Research Scientist at ByteDance Seed, specializing in research on Large Language Model (LLM), AI for Science (AI4S), and Natural Language Processing (NLP).

Before this, I completed my Ph.D. at the Natural Language Processing Group of Nanjing University, co-supervised by Prof. Shujian Huang and Prof. Jiajun Chen. During my doctoral studies, I interned at ByteDance AI Lab under the mentorship of Prof. Zhou Hao and Prof. Lei Li, where I laid the groundwork for generative modeling research, including non-autoregressive text generation and latent variable-based parallel text generation.

News

We’re hiring! Whether you’re seeking an internship or a full-time role, join us to build cutting-edge AI systems! Feel free to reach out via baoyu.001@bytedance.com for the Top Seed Program (internships & full-time positions available) or JD sites directly.

Selected Publications/Preprints [Full list]

[name*: equal contributions] [name: interns/students I mentored]

  1. ByteDance Seed Team, Seed LiveInterpret 2.0: End-to-end Simultaneous Speech-to-speech Translation with Your Voice, [HomepageDemo] Arxiv 2025
  2. ByteDance Seed Team, Seed-X: Building Strong Multilingual Translation LLM with 7B Parameters, [HFDemo] Arxiv 2025
  3. Xiangxin Zhou*, Xiwei Cheng*, Yuwei Yang, Yu Bao, Liang Wang, Quanquan Gu, DecompOpt: Controllable and Decomposed Diffusion Models for Structure-based Molecular Optimization ICLR 2024
  4. Jiaqi Guan*, Xiangxin Zhou*, Yuwei Yang, Yu Bao, Jian Peng, Jianzhu Ma, Qiang Liu, Liang Wang, Quanquan Gu, DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug Design ICML 2023
  5. Min Liu, Yu Bao, Chengqi Zhao, Shujian Huang, Selective Knowledge Distillation for Non-Autoregressive Neural Machine Translation AAAI 2023
  6. Yu Bao, Hao Zhou, Shujian Huang, Dongqi Wang, Lihua Qian, Xinyu Dai, Jiajun Chen, Lei Li, latent-GLAT: Glancing at Latent Variables for Parallel Text Generation ACL 2022
  7. Yu Bao, Shujian Huang, Tong Xiao, Dongqi Wang, Xinyu Dai, Jiajun Chen, Non-Autoregressive Translation by Learning Target Categorical Codes NAACL-HLT 2021
  8. Lihua Qian, Hao Zhou, Yu Bao, Mingxuan Wang, Lin Qiu, Weinan Zhang, Yong Yu, Lei Li, Glancing Transformer for Non-Autoregressive Neural Machine Translation ACL 2021
  9. Jiahuan Li*, Yu Bao*, Shujian Huang, Xinyu Dai, Jiajun Chen, Explicit Semantic Decomposition for Definition Generation ACL 2020
  10. Yu Bao, Hao Zhou, Jiangtao Feng, Mingxuan Wang, Shujian Huang, Jiajun Chen, Lei Li, PNAT: Non-Autoregressive Transformer by Position Learning Arxiv 2019
  11. Yu Bao*, Hao Zhou*, Shujian Huang, Lei Li, Lili Mou, Olga Vechtomova, Xinyu Dai, Jiajun Chen, Generating Sentences from Disentangled Syntactic and Semantic Spaces ACL 2019

Professional Services

Journal Reviewer of

Area Chair of

PC Member/Reviewer of