Startup Gimlet Labs is solving the AI inference bottleneck in a surprisingly elegant way

March 23, 2026

TL;DR

Gimlet Labs secured an $80 million Series A led by Menlo Ventures to address the AI inference bottleneck.
The company's "multi-silicon inference cloud" software allows AI workloads to run concurrently across various hardware, including CPUs, GPUs, and high-memory systems.
This approach aims to make AI workloads 10x more efficient and utilize existing hardware more effectively, potentially saving billions in data center spending.
Gimlet Labs claims to reliably speed up AI inference by 3x to 10x at the same cost and power consumption.
The company has partnered with major chip makers like NVIDIA, AMD, and Intel, and has already achieved eight-figure revenues.
The founding team previously co-founded Pixie, an observability tool acquired by New Relic.
With the previous seed funding, Gimlet Labs has raised a total of $92 million.

Continue reading the original article