tech

December 8, 2025

GPT-4o mini: advancing cost-efficient intelligence

OpenAI is committed to making intelligence as broadly accessible as possible. Today, we're announcing GPT‑4o mini, our most cost-efficient small model. We expect GPT‑4o mini will significantly expand the range of applications built with AI by making intelligence much more affordable. GPT‑4o mini scores 82% on MMLU and currently outperforms GPT‑41 on chat preferences in LMSYS leaderboard⁠(opens in a new window). It is priced at 15 cents per million input tokens and 60 cents per million output tokens, an order of magnitude more affordable than previous frontier models and more than 60% cheaper than GPT‑3.5 Turbo.

GPT-4o mini: advancing cost-efficient intelligence

TL;DR

  • GPT-4o mini is OpenAI's most cost-efficient small model, aiming to make AI broadly accessible.
  • It scores 82% on MMLU and outperforms GPT-4 on chat preferences, with a pricing of 15 cents/1M input tokens and 60 cents/1M output tokens.
  • The model supports text and vision in the API, with future support for image, video, and audio, and has a 128K token context window.
  • GPT-4o mini excels in reasoning, math, coding, and multimodal tasks, outperforming models like Gemini Flash and Claude Haiku.
  • Safety features are integrated from pre-training through post-training alignment techniques, including instruction hierarchy for enhanced resistance to prompt manipulation.
  • It is available in the Assistants API, Chat Completions API, and Batch API, and for Free, Plus, and Team users of ChatGPT, replacing GPT-3.5.
  • OpenAI is committed to reducing AI model costs while enhancing capabilities, with GPT-4o mini showing a 99% cost reduction since text-davinci-003.

Continue reading
the original article

Made withNostr