tech

March 23, 2026

Startup Gimlet Labs is solving the AI inference bottleneck in a surprisingly elegant way

Gimlet Labs just raised an $80 million Series A for tech that lets AI run across NVIDIA, AMD, Intel, ARM, Cerebras and d-Matrix chips, simultaneously.

Startup Gimlet Labs is solving the AI inference bottleneck in a surprisingly elegant way

TL;DR

  • Gimlet Labs secured an $80 million Series A led by Menlo Ventures to address the AI inference bottleneck.
  • The company's "multi-silicon inference cloud" software allows AI workloads to run concurrently across various hardware, including CPUs, GPUs, and high-memory systems.
  • This approach aims to make AI workloads 10x more efficient and utilize existing hardware more effectively, potentially saving billions in data center spending.
  • Gimlet Labs claims to reliably speed up AI inference by 3x to 10x at the same cost and power consumption.
  • The company has partnered with major chip makers like NVIDIA, AMD, and Intel, and has already achieved eight-figure revenues.
  • The founding team previously co-founded Pixie, an observability tool acquired by New Relic.
  • With the previous seed funding, Gimlet Labs has raised a total of $92 million.

Continue reading the original article

Made withNostr