Thomas Zhu

Hi! I am Thomas (Hanwen) Zhu. I am a research scientist at ByteDance Seed working on Seed-Prover.

I obtained MS in Machine Learning student at CMU, working on applying language models to formal theorem proving. I am honored to have been advised by Prof. Sean Welleck and Prof. Jeremy Avigad. I developed miniCTX, LeanHammer, and LeanArchitect at CMU.

I worked on Seed-Prover (V1), which solved 5/6 problems on IMO 2025 and set new state-of-the-art scores on all Lean benchmarks.

I received BA Mathematics and Computer Science from Oxford, where I graduated top first. I had the privilege to work with Ruining Li and Tomas Jakab at VGG in applying diffusion models to 3D generation of human-object interactions. I also worked at OxAI in developing a benchmark for gender bias in large vision-language models.

I was a 2026 Siebel Scholar and I received the highest 2024 Gibbs Prize in Computer Science at Oxford.

Please contact me at [email protected].

Publications

LeanArchitect: Automating Blueprint Generation for Humans and AI
Thomas Zhu, Pietro Monticone, Jeremy Avigad, Sean Welleck
Paper GitHub

Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience
ByteDance Seed
Paper GitHub

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving
ByteDance Seed
Paper News GitHub

Premise Selection for a Lean Hammer
Thomas Zhu*, Joshua Clune*, Jeremy Avigad†, Albert Q. Jiang†, Sean Welleck†
Website Paper GitHub
In preprint

miniCTX: Neural Theorem Proving with (Long-)Contexts
Jiewen Hu, Thomas Zhu, Sean Welleck
Website Paper Dataset
ICLR 2025 Oral (Top 1.8%)

DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors
Thomas Zhu, Ruining Li*, Tomas Jakab*
Website Paper GitHub
ICCV HiGen Workshop 2025 Oral

VisoGender: A dataset for benchmarking gender bias in image-text pronoun resolution
Siobhan Mackenzie Hall, Fernanda Gonçalves Abrantes, Thomas Hanwen Zhu, Grace Sodunke, Aleksandar Shtedritski, Hannah Rose Kirk
Paper GitHub
NeurIPS Datasets and Benchmarks 2023

Blogs

May 4, 2025: DeepSeek-Prover-V2 Learned to Cheat.