Google releases Gemini 2.5 Flash with a “thinking budget”

Google has rolled out an early version of Gemini 2.5 Flash in preview. In this context, Google introduced a new “thinking budget” to optimize cost and quality.

Image with the writing Gemini 2.5 Flash
Generated with Grok

The details

  • Gemini 2.5 Flash is a hybrid reasoning model that allows developers to turn thinking on and off.
  • Additionally, developers can use a “thinking budget” of up to 24k tokens. That allows them to control how many tokens the model uses when generating a response.
  • The model includes new features like Canvas, an interactive space for improving your documents and code. According to Google, it also have better reasoning capabilities.
  • Gemini 2.5 Flash is available in the Gemini app and through the Gemini API. Compared to other models, Gemini 2.5 Flash is significantly cheaper with similar performance.

Our thoughts

The new model is powerful and cost-effective. The new “thinking budget” allows users to balance costs and quality for specific use cases. 

Additionally, we have noticed that Google’s AI models have much lower hallucination rates than other models. You can check the hallucination rates of SOTA LLMs on the Vectara leaderboard on Hugging Face.

More information: 🔗 Google


Magic AI tool of the week

Today, it is essential to work in a structured and organized manner. There are many tools that you can use to boost your productivity. However, finding the right tool for your needs can be overwhelming.

One of the best tools we’ve ever used is Notion, especially with its powerful AI features. Notion combines the features of a note-taking app, document editor, project management tool, and AI assistance.

AI will help you finish your tasks faster and more efficiently. We promise this tool boosts your productivity from day one.

👉🏽 Try Notion for free!*


Hand-picked articles of the week


😀 Do you enjoy our content? If so, why not support us with a small financial contribution? This helps us fund our work to ensure we can stick around long-term.


AI and Coding Merch