Skip to main content

Summer Deadline: Sunday, March 29 @ 11:59pm PT. Click to apply.

StealthEval: A Probe-Rewrite-Evaluate Workflow for Reliable Benchmarks

StealthEval: A Probe-Rewrite-Evaluate Workflow for Reliable Benchmarks

December 1, 2025

Abstract coming soon. This paper has been accepted but the arXiv preprint is not yet available.

Accepted to Reliable ML @ NeurIPS 2025

Authors: Lang Xiong, Nishant Bhargava, Jeremy Chang, Jianhang Hong

Abstract coming soon. This paper has been accepted but the arXiv preprint is not yet available.

Begin Your Journey

The application takes 10 minutes and is reviewed on a rolling basis. We look for strong technical signal—projects, coursework, or competition results—and a genuine curiosity to do real research.

If admitted, you will join a structured pipeline with direct mentorship to take your work from ideation to top conference submission at venues like NeurIPS, ACL, and EMNLP.