tech
March 23, 2026
Startup Gimlet Labs is solving the AI inference bottleneck in a surprisingly elegant way
Gimlet Labs just raised an $80 million Series A for tech that lets AI run across NVIDIA, AMD, Intel, ARM, Cerebras and d-Matrix chips, simultaneously.

TL;DR
- Gimlet Labs secured an $80 million Series A led by Menlo Ventures to address the AI inference bottleneck.
- The company's "multi-silicon inference cloud" software allows AI workloads to run concurrently across various hardware, including CPUs, GPUs, and high-memory systems.
- This approach aims to make AI workloads 10x more efficient and utilize existing hardware more effectively, potentially saving billions in data center spending.
- Gimlet Labs claims to reliably speed up AI inference by 3x to 10x at the same cost and power consumption.
- The company has partnered with major chip makers like NVIDIA, AMD, and Intel, and has already achieved eight-figure revenues.
- The founding team previously co-founded Pixie, an observability tool acquired by New Relic.
- With the previous seed funding, Gimlet Labs has raised a total of $92 million.
Continue reading the original article