AI as Fast as Your Train of Thought

February 15, 2026

TL;DR

OpenAI's Codex-Spark model is optimized for speed, generating ~1,000 tokens per second.
It runs on Cerebras hardware, marking OpenAI's first use of non-Nvidia processors.
Spark is less capable on complex reasoning tasks compared to models like GPT-5.3 Codex and Opus 4.6.
Its speed is beneficial for tasks where maintaining user flow is critical, such as brainstorming and rapid iteration.
The model's rapid output may require new UI solutions to manage the volume of information.
Spark is currently available to Pro subscribers in the Codex app and command line interface, with limited API access.

Continue reading
the original article