Batch Mode in the Gemini API: Process more for less
Products
More
Solutions
Events
Learn
Community
Developer Program
Blog
Gemini models are now available in Batch ModeToday, we’re excited to introduce a batch mode in the Gemini API, a new asynchronous endpoint designed specifically for high-throughput, non-latency-critical workloads. The Gemini API Batch Mode allows you to submit large jobs, offload the scheduling and processing, and retrieve your results within 24 hours—all at a 50% discount compared to our synchronous APIs.Process more for less...
Read more at developers.googleblog.com