Fondo | SF Tensor Launches: Infrastructure for the Era of Large-Scale AI Training ⚡

Their Solution

SF Tensor is the "set it and forget it" infrastructure layer for anyone training or fine-tuning AI models. Hook up your repo, pick your GPU count and budget, and they deal with the rest:

Their automatic kernel optimizer analyzes your architecture and tunes execution for any hardware (NVIDIA, AMD or TPUs). No more having to drop down into custom CUDA because PyTorch doesn’t understand memory topology.

They find the cheapest available compute across all clouds for your specific requirements and launch your training run.

Automatic Distributed Training allows you to scale from 1 to 10,000 GPUs without having to change your code or killing your MFU

Everything else that you shouldn’t have to think about: Spot instance migration? Handled. Monitoring? Baked in. Logs and artifacts? Done.

‍

Compute should be boring. Let’s make it boring.

Try them right now at sf-tensor.com or contact them at hello@sf-tensor.com to see how they can help with your infra pains.

‍

Learn More

^‍

^{👉 Ask: Know anyone training or fine-tuning AI models? They would be grateful for an intro! Reach out to the founders}^here^.

^‍

^{⭐ Give SF Tensor a star on}^Github^.

^‍

^{👣 Follow SF Tensor on}^LinkedIn^&^X^.

‍

SF Tensor recently launched!

Launch YC: SF Tensor - Infrastructure for the Era of  Large-Scale AI Training ⚡

‍

^{"Building the future of high-performance compute"}

‍

*^TL;DR:^SF Tensor***^{lets AI researchers forget about the infrastructure layer and focus on their research. They automatically optimize kernels to run faster, find the cheapest GPUs across every provider and migrate your jobs when spot instances fail. Training AI should be about AI, not DevOps.}

‍

Founded by Ben Koska, Luk Koska & Tom Koska‍

Three brothers that have been working on Artificial Intelligence together for years, most recently training their own Foundational World Models. SF Tensor was born out of their own needs as AI researchers scaling up training runs to thousands of concurrent GPUs.

Ben has been publishing AI research since high school, solo-training models across 4,000 GPUs as co-PI on a 6-figure grant.

Tom and Luk (twins btw) have been doing AI research for years, from starting college in parallel to high school at age 14 to finishing their BSc in CS (at age 16).

‍

‍

The Problem

Training AI should mean developing smarter architectures and finding better data. But right now, it doesn’t. Teams waste their time on everything but actual research:

Optimizing code so that training runs don’t drain the bank
Fighting cloud providers and scrambling for GPU availability
Making distributed training work with reasonable MFU (=cost efficiency).

This drives up costs, frustrates everyone and kills velocity. Infrastructure has inadvertently turned into the limiting factor for AI research labs, and it’s killing progress.

They experienced this first-hand developing their own foundation models – what they expected to be AI research, experimentation and iterative improvement turned out to be an ugly mix of writing CUDA, debugging driver mismatches and optimizing inter-GPU collective operations. That’s why they decided to solve the infrastructure layer, to allow other researchers to focus on research, not infrastructure.

‍

Their Solution

SF Tensor is the "set it and forget it" infrastructure layer for anyone training or fine-tuning AI models. Hook up your repo, pick your GPU count and budget, and they deal with the rest:

Their automatic kernel optimizer analyzes your architecture and tunes execution for any hardware (NVIDIA, AMD or TPUs). No more having to drop down into custom CUDA because PyTorch doesn’t understand memory topology.
They find the cheapest available compute across all clouds for your specific requirements and launch your training run.
Automatic Distributed Training allows you to scale from 1 to 10,000 GPUs without having to change your code or killing your MFU
Everything else that you shouldn’t have to think about: Spot instance migration? Handled. Monitoring? Baked in. Logs and artifacts? Done.

‍

Compute should be boring. Let’s make it boring.

Try them right now at sf-tensor.com or contact them at hello@sf-tensor.com to see how they can help with your infra pains.

‍

Learn More

^‍

^{🌐 Visit}^{sf-tensor.com}^{to learn more.}

^‍

*^{👉 Ask: Know anyone training or fine-tuning AI models? They would be grateful for an intro! Reach out to the founders}^here***^.

^‍

*^{⭐ Give SF Tensor a star on}^Github***^.

^‍

*^{👣 Follow SF Tensor on}^LinkedIn***^&^X^.

‍

Posted

November 5, 2025

Launch

David J. Phillips

CEO & Founder

View Posts

About The Author

David is the CEO & Founder of Fondo (YC W18). He is an angel investor in Rippling, Flexport, LiquidDeath, and 100+ other startups. David began his career as an accountant at Deloitte before learning to code and becoming a founder. Previously, he was co-founder of Hackbright where 1,000+ software engineers have been trained and placed at tech companies including Slack, Disney, and Uber and was acquired by Capella Education NASDAQ: $CPLA in 2016.

← Back to all posts

SF Tensor Launches: Infrastructure for the Era of Large-Scale AI Training ⚡

Save time, money, and run a better startup.

The all-in-one accounting platform for startups. Bookkeeping, taxes, and tax credits on autopilot.

"Building the future of high-performance compute"

TL;DR: SF Tensor lets AI researchers forget about the infrastructure layer and focus on their research. They automatically optimize kernels to run faster, find the cheapest GPUs across every provider and migrate your jobs when spot instances fail. Training AI should be about AI, not DevOps.

‍

The Problem

Their Solution

Compute should be boring. Let’s make it boring.

Learn More

‍

🌐 Visit sf-tensor.com to learn more.

‍

👉 Ask: Know anyone training or fine-tuning AI models? They would be grateful for an intro! Reach out to the founders here.

‍

⭐ Give SF Tensor a star on Github.

‍

👣 Follow SF Tensor on LinkedIn & X.

Featured

Altrina Automator Launches: Turn Demonstrations into Reliable Browser Automations in 5 Minutes

🎧 Startup Growth Podcast, Ep. 34 Julian Weisser | The Solo Flippening: How 1-in-3 Startups Broke the Co-Founder Myth

🎧 Startup Growth Podcast, Ep. 33 Nate Matherson | Set It, Forget It—Scaling to 2,000+ Customers at Numeral

Categories

David J. Phillips

About The Author

Need help with the upcoming tax deadline?

Take the stress out of bookkeeping, taxes, and tax credits with Fondo’s all-in-one accounting platform built for startups. Start saving time and money with our expert-backed solutions.

Don't miss the next tax deadline

Take the stress out of bookkeeping, taxes, and tax credits with Fondo’s all-in-one accounting platform built for startups. Start saving time and money with our expert-backed solutions.

SF Tensor Launches: Infrastructure for the Era of Large-Scale AI Training ⚡

"Building the future of high-performance compute"

TL;DR: SF Tensor lets AI researchers forget about the infrastructure layer and focus on their research. They automatically optimize kernels to run faster, find the cheapest GPUs across every provider and migrate your jobs when spot instances fail. Training AI should be about AI, not DevOps.

‍

The Problem

Their Solution

Compute should be boring. Let’s make it boring.

Learn More

‍

🌐 Visit sf-tensor.com to learn more.

‍

👉 Ask: Know anyone training or fine-tuning AI models? They would be grateful for an intro! Reach out to the founders here.

‍

⭐ Give SF Tensor a star on Github.

‍

👣 Follow SF Tensor on LinkedIn & X.

David J. Phillips

About The Author

Join Our Newsletter and Get the LatestPosts to Your Inbox

Featured

Altrina Automator Launches: Turn Demonstrations into Reliable Browser Automations in 5 Minutes

🎧 Startup Growth Podcast, Ep. 34 Julian Weisser | The Solo Flippening: How 1-in-3 Startups Broke the Co-Founder Myth

🎧 Startup Growth Podcast, Ep. 33 Nate Matherson | Set It, Forget It—Scaling to 2,000+ Customers at Numeral

Categories

Newsletter

Save time, money, and run a better startup.

The all-in-one accounting platform for startups. Bookkeeping, taxes, and tax credits on autopilot.

Products

Resources

About

Get started ⚡

SF Tensor Launches: Infrastructure for the Era of Large-Scale AI Training ⚡

^{"Building the future of high-performance compute"}

*^TL;DR:^SF Tensor***^{lets AI researchers forget about the infrastructure layer and focus on their research. They automatically optimize kernels to run faster, find the cheapest GPUs across every provider and migrate your jobs when spot instances fail. Training AI should be about AI, not DevOps.}

^‍

^{🌐 Visit}^{sf-tensor.com}^{to learn more.}

^‍

*^{👉 Ask: Know anyone training or fine-tuning AI models? They would be grateful for an intro! Reach out to the founders}^here***^.

^‍

*^{⭐ Give SF Tensor a star on}^Github***^.

^‍

*^{👣 Follow SF Tensor on}^LinkedIn***^&^X^.

SF Tensor Launches: Infrastructure for the Era of Large-Scale AI Training ⚡

^{"Building the future of high-performance compute"}

*^TL;DR:^SF Tensor***^{lets AI researchers forget about the infrastructure layer and focus on their research. They automatically optimize kernels to run faster, find the cheapest GPUs across every provider and migrate your jobs when spot instances fail. Training AI should be about AI, not DevOps.}

^‍

^{🌐 Visit}^{sf-tensor.com}^{to learn more.}

^‍

*^{👉 Ask: Know anyone training or fine-tuning AI models? They would be grateful for an intro! Reach out to the founders}^here***^.

^‍

*^{⭐ Give SF Tensor a star on}^Github***^.

^‍

*^{👣 Follow SF Tensor on}^LinkedIn***^&^X^.

Join Our Newsletter and Get the Latest
Posts to Your Inbox