OpenAI's Flex Processing: Affordable AI with Reduced Speed

04/18/2025 Artificial Intelligence

OpenAI is introducing a new API option called Flex processing, designed to offer more competitive pricing for AI model usage. This move appears to be a direct response to increasing competition from companies like Google, which are also focusing on more cost-effective AI solutions.

Flex processing is currently in beta and available for OpenAI's o3 and o4-mini reasoning models. The key trade-off is reduced cost in exchange for slower response times and potential resource unavailability. This makes it suitable for tasks that aren't time-sensitive or critical to production.

Specifically, OpenAI is targeting lower-priority and non-production tasks like model evaluations, data enrichment, and asynchronous workloads with Flex processing. By accepting slower processing, users can significantly reduce their API expenses.

Cost Savings with Flex Processing

The cost reduction is substantial. For the o3 model, Flex processing is priced at $5 per million input tokens and $20 per million output tokens. This is half the price of the standard rates, which are $10 and $40 respectively. Similarly, for the o4-mini model, the price drops from $1.10 to $0.55 per million input tokens and from $4.40 to $2.20 per million output tokens.

This pricing strategy is especially relevant as the cost of advanced AI continues to rise. Other companies, like Google, are releasing more efficient and budget-friendly models, like the Gemini 2.5 Flash, pushing OpenAI to find new ways to offer affordable AI solutions.

ID Verification for Tiered Access

In addition to Flex processing, OpenAI has announced that developers in tiers 1-3 of their usage tier system will need to complete ID verification to access the o3 model. These tiers are determined by the amount spent on OpenAI services. Features such as reasoning summaries and streaming API support are also gated behind this verification process.

OpenAI has stated that the ID verification process is intended to prevent misuse of its platform by malicious actors and ensure compliance with its usage policies. This move is part of a broader effort to maintain the integrity and safety of AI model usage.

The introduction of Flex processing and the implementation of ID verification represent significant steps by OpenAI to balance cost-effectiveness, performance, and security in the rapidly evolving AI landscape. As competition intensifies, these measures could prove crucial in maintaining OpenAI's position as a leading AI provider.

Source: TechCrunch