GPT-4o mini: advancing cost-efficient intelligence

December 8, 2025

TL;DR

GPT-4o mini is OpenAI's most cost-efficient small model, aiming to make AI broadly accessible.
It scores 82% on MMLU and outperforms GPT-4 on chat preferences, with a pricing of 15 cents/1M input tokens and 60 cents/1M output tokens.
The model supports text and vision in the API, with future support for image, video, and audio, and has a 128K token context window.
GPT-4o mini excels in reasoning, math, coding, and multimodal tasks, outperforming models like Gemini Flash and Claude Haiku.
Safety features are integrated from pre-training through post-training alignment techniques, including instruction hierarchy for enhanced resistance to prompt manipulation.
It is available in the Assistants API, Chat Completions API, and Batch API, and for Free, Plus, and Team users of ChatGPT, replacing GPT-3.5.
OpenAI is committed to reducing AI model costs while enhancing capabilities, with GPT-4o mini showing a 99% cost reduction since text-davinci-003.

Continue reading
the original article