news | Haowei Lin

Jul 10, 2026	Excited to share Hyra (Hunyuan Research Agent), an autonomous research agent I lead at Tencent Hunyuan. Hyra sets new state-of-the-art results on AI4AI, AI4Science, mathematics, engineering, and creative design (see results).
Jul 07, 2026	Two new benchmarks in the Terminal-Bench series are out: Harbor-Index, a challenging, diverse, and high-quality benchmark for evaluating frontier agents, which I co-lead; and Frontier-Bench (formerly Terminal-Bench 3.0), for which I served as a task contributor and reviewer.
Dec 07, 2025	Glad to launch a new blog on Scaling Law Discovery (SLD) (paper). We hope our work on SLD helps advance foundation model development and push the boundaries of AI Scientist. Code, dataset, benchmarks, and leaderboard are all publicly available.
Oct 21, 2025	Excited to co-lead adapters in Terminal-Bench & Harbor, which converts all agentic benchmarks into a unified format for T-Bench.
Oct 20, 2025	Our paper on AI for scientific discovery was published in Nature Machine Intelligence as a cover paper!