Hongpeng Guo is a Senior Research Scientist at ByteDance, specializing in large-scale machine learning systems. With a Ph. D. in Computer Science from the University of Illinois Urbana-Champaign, his expertise lies in developing infrastructure for reinforcement learning and foundation model training. His background includes roles at Anyscale, Google, and Jane Street.
He developed "Verl, " an open-source framework designed to make reinforcement learning for large language models more efficient and scalable.