🎧 Startup Growth Podcast, Ep. 25: Jay Ram | Beyond Evals: Build Environments That Make Agents Better
Jay Ram is Founder & CEO of Hud, the evaluation and RL platform for AI agents. Hud helps startups build RL environments, run fast reward loops, and plug into any RL backend—so teams can cut costs and push last-mile accuracy once they've hit PMF. Before Hud, Jay left a lucrative quant career, shipped an AI prank-calling app that briefly hit #1 on the App Store (≈500k calls), and decided he wanted harder problems and smarter customers. He's a YC W25 alum; Hud is already used by researchers at foundation labs and is expanding into enterprise environments.
Jay's catalyst was realizing he didn't want to just talk weekends—he wanted to build. He and his co-founders first tackled computer-use evals for labs. Inside that work, the language shifted: labs asking for "evals" really needed environments—places where you design rewards, iterate, and actually improve model behavior. Today, Jay frames Hud as the "Next.js of RL environments": opinionated lifecycle, backend-agnostic training, and infra that returns signal fast. Early on, use a foundation model; post-PMF, train your own with SFT/RL—that's where environments matter. Looking ahead, he sees post-training speciation: domain-tuned models for finance, accounting, creative tooling, and more—because teams will own more of their stack again.
Key Topics Covered:
· What Hud is: tools to set up your agent for RL, define tasks, shape rewards, and plug into RFT/other RL backends.
· From evals to environments: why scores measure but rewards improve—and how iteration loops change outcomes.
· Where it fits: use foundation models early; post-PMF train your own for cost leverage + last-mile gains.
· Design + infra: a new category needs opinionated UX and fast results; why lab researchers use Hud for computer-use evals.
· Market timing: the "DeepSeek moment" pulled RL from hobbyists into enterprise interest in 2025.
· Pre-train vs post-train: scale vs accuracy + domain depth—and why post-training is the real edge.
· Future of work: enterprises will own more of the stack; model speciation by domain.
· Reality check: agents ace toy DBs, struggle in production; modeling real environments is the unlock.
· YC W25 arc: vision matched the original app more than mid-batch; enterprise demand is catching up now.
· Finance stack aside: keep ops boring; focus cycles on shipping product
Chapters:
(00:15) Cold open — "We give you all the tools to set up your agent for RL."
(00:59) Intro — Jay Ram, Hud, and the origin story
(01:41) What Hud does — build RL environments; backend-agnostic (OpenAI RFT, Thinking Machines, etc.)
(02:12) Where environments fit — early: foundation models; post-PMF: train for cost + accuracy
(02:50) From quant to builder — leaving Wall Street to make things
(03:30) The prank-calling app — #1 on App Store; ≈500k calls; why the customers weren't it
(04:40) Evals → environments — labs' "eval" asks were really RL environments with rewards
(05:40) Evals vs RL — scores vs rewarded steps; how updates happen
(07:14) Hard parts — opinionated design + infra speed for researchers and teams
(08:08) Before Hud — no toolkit/standards; emerging gymnasium-style efforts vs Hud's opinionated path
(09:25) YC W25 — applying, partners (Aaron & Matt), why YC felt like "actual college"
(11:05) Vision vs timing — market caught up; enterprises now exploring environments
(12:20) Trend — teams rolling their own models post-PMF (SFT/RL)
(13:01) Today's fragmented stack — hosting, inference, data; Hud's role in the loop
(13:48) The "DeepSeek moment" — hobbyist RL → enterprise interest in 2025
(15:57) Future of agents — own the stack, post-training speciation
(18:26) Why end-to-end is hard — production data systems need real environments
(19:29) Forward-deployed labs — domain hires and environments; how Hud plugs into RFT
(20:15) Rapid wrap — it's early; the stack is shifting fast
Where to find Jay Ram:
X: @jayendra_ram
LinkedIn: www.linkedin.com/in/jay-ram-29003b198/
Website: hud.ai
Where to find Hud:
X: @hud_evals
Website: hud.ai
Where to find David Phillips:
X: @davj
LinkedIn: linkedin.com/in/davjphillips
Brought to you by:
Fondo — All-in-one accounting for startups: fondo.com
Need help with the upcoming tax deadline?
Take the stress out of bookkeeping, taxes, and tax credits with Fondo’s all-in-one accounting platform built for startups. Start saving time and money with our expert-backed solutions.
.png)
.png)
.png)
.png)
.png)










.png)

