Skip to main content

Spring Deadline: Sunday, March 1 @ 11:59pm PT. Click here to apply.

Amortized Latent Steering: Low-Cost Alternative to Test-Time Optimization

Amortized Latent Steering: Low-Cost Alternative to Test-Time Optimization

December 1, 2025

Current test-time optimization methods require 10-100x more compute per query than standard decoding. We propose Amortized Latent Steering (ALS), which collapses iterative test-time optimization into ...

Accepted to ER @ NeurIPS 2025

Authors: Nathan Egbuna, Saatvik Gaur

Current test-time optimization methods require 10-100x more compute per query than standard decoding. We propose Amortized Latent Steering (ALS), which collapses iterative test-time optimization into a single offline-computed vector applied at constant cost during inference. ALS computes the mean difference between hidden states from successful versus unsuccessful generations, then uses this direction to calibrate the model hidden representations. Across GSM8K and MATH-500 benchmarks, ALS achieves 2-5x speedup over iterative methods while matching or surpassing greedy Chain-of-Thought and Self-Consistency baselines, yielding up to 101% improvement in efficiency-accuracy trade-off.

Begin Your Journey

The application takes 10 minutes and is reviewed on a rolling basis. We look for strong technical signal—projects, coursework, or competition results—and a genuine curiosity to do real research.

If admitted, you will join a structured pipeline with direct mentorship to take your work from ideation to top conference submission at venues like NeurIPS, ACL, and EMNLP.