Release your creativity on a scale: Multimodal Revolution Azure AI Foundry | Blog Microsoft Azure

Imagine a platform where every developer can unlock the entire AI spectrum: text, pictures, sound and video. This Azure AI Foundry Moles Opendai Devday makes this vision real. With today’s launch OpenI GPT-Image-1-Mini, GPT-Realtime-Mini and GPT-Audio-Mini, plus the main security upgrades to GPT-5, now you have the highest set of tools for creating, experiment and measure of multimodal solutions.

Imagine a platform where every developer – whether you build for a startup or a global business – can unlock the entire AI spectrum: text, pictures, sound and video. This Azure AI Foundry Moles Opendai Devday makes this vision real. With today’s launch OpenI GPT-Image-1-Mini, GPT-Realtime-Mini and GPT-Audio-Mini, plus the main security upgrades to GPT-5, now you have the final set of tools for creating, experiment and scale of multimodal solutions and more accessible than ever. We are excited that we can share that the models that OPENAI announced are now developing in Azure AI Foundrywith the fact that most customers are able to start on October 7, 2025.

Today’s connection to the main innovations we announced last week notifications Start the frame of Agent Microsoft (Now in preview), Workflows in Foundry Agent Service in private preview, unified observability, Voice Live API, General availability of API and new responsible AI capabilities. Microsoft Agent Framework (Github) is a commercial, open source of SDK and Runtime designed to simplify orchestration of more agents systems. It unifies the semantic core foundations ready for business with multi-agent autogen and gives developers tools to build intelligent and scalable agent solutions with speed and confidence.

By expanding the Azure AI foundry with the latest Openai models and the development of our AI agent, we seize customers with unrivaled choice, flexibility and business skills, allowing developers to build intelligent agents that deal with complex business needs and increase innovations.

Meet new models: built for developers prepared for anything

GPT-IMAGE-1-MINI: Compact force for visual creativity

GPT-IMAGE-1-MINI has a purpose for organizations and developers who need fast and efficient image generation on a scale. Its compact architecture allows high -quality text and image and image creation on image while consuming fewer computing resources, allowing teams to deploy multimodal AI even in limited environments. Its robust architecture based on the Image-1 model optimizes the consistency and ease of adoption for organizations that already use multimodal AI in the Azure AI founders.

What is strange?

  • Flexible Image Generation: Deployment of high quality Text-to-Image and picture Function without budget violation.
  • Lightning-Quick Inference: Generate real -time images, fluently integrated into the existing Azure AI Foundry workflows.

Cases of use:

  • Generating educational materials for classrooms and online learning.
  • Designing stories and visual stories.
  • Creating game assets for rapid prototyping and development.
  • Acceleration of working procedures of the user interface design for applications and websites.

Table 1: Prices and deployment of GPT-IMAGE-1-MINI in Azure AI Foundry (per 1m tokens)*

GPT-Realtime-Mini and GPT-Audio-Mini: Effective and affordable voice solution

Two new mini models are designed for organizations and developers who need quickly, cost -effective multimodal AI without sacrificing quality. These models are lightweight and highly optimized and give voice interaction in real time and generating sound with minimal resource requirements. Their simplified architecture allows rapid inference and low latency, which makes them ideal for scenarios, where the speed and sensitivity of the voice chatbots, real-time translation and dynamic sound of sound content are critical. By consuming fewer computing resources, these models help businesses and developers’ teams reduce operating costs and at the same time scaling multimodal capabilities across a wide range of applications.

What makes them special?

  • Reality in real time: Power Chatbots, Assistants and Translation Tools with almost zero latency.
  • Light sources: Start advanced voice and sound models for minimal infrastructure.
  • Affordable scaling: Reduce your operating costs and expand the multimodal abilities.

Cases of use:

  • Voice chatbots for customer service and support.
  • Real -time translation for global communication.
  • Dynamic creating sound content for media and fun.
  • Interactive voice assistants for corporate and consumer applications.

GPT -Realtime -mini in Azure AI Foundry allows our customer to create voice solutions with lower latency, better adherence to instructions and cargo efficiency -the capacity of our customers appreciates, controls shorter handle time, smoother dialogues and faster time to value.

Andy O’Dower, VP Product, Twilio

Table 2: GPT-Realtime-Mini and GPT-Audio-Mini Determination and Deployment in Azure AI Foundry (per 1m tokens)*

Table with pricing information.

GPT-5-CAT-LABEST: Increasing the rod for safety and well-being

The latest GPT-5-CAT-LABEST update in Azure AI Foundry introduces a more robust set of security railings that is designed to better protect users during sensitive conversations. With increased detection and response ability, the GPT-5-Chat-Lakest is now equipped for more efficient recognition and control of dialogue that could lead to mental or emotional distress. These improvements reflect our constant AI responsible obligation, which ensures that any interaction is not only intelligent and helpful, but also safe and support for users at difficult moments.

Table 3: GPT-5-CAT-LABEST CRICING AND Deployment in Azure AI Foundry (on 1m tokens)*

Table with pricing information.

GPT-5-For: The peak of reasoning and analytics

The GPT-5-Pro ​​is the peak of advanced thinking and analytics in the Azure AI foundry ecosystem and provides intelligence to the level of research. When deployed via foundry, architecture in the GPT-5-PRO tournament style, it uses several ways of thinking to ensure maximum accuracy and reliability, which is ideal for comprehensive analysis, code generation and decision-making. With Azure AI Foundry, the organization unlocked the full potential of the GPT-5-PRO, managed smarter decisions and accelerated innovations across their most important business processes, safely and reliably.

Table 4: Prices and deployment of GPT-5-for*

Table with pricing information.

The edge of the developer: assembly, experiment and ship – Furter

On these new models, the Azure AI foundry not only keeps up – sets the pace. Developers can now move as text, use the generation of images and sound, editing and understanding. Result? Richer, smarter workflows that control innovations in every sector – from education and games to business automation.

Sneak Peek: Sora 2-Next on video level and sound generation

And there are more on the horizon. Sora 2 in Azure AI Foundry comes early and brings advanced video and sound generation to a single API. Imagine an animation with physics controlled, a synchronized dialog and portrait function available to developers via Azure AI Foundry. Stay tuned to another wave of absorbing generative experiences.

Are you ready to create another wave of absorbing multimodal experiences? Azure AI Foundry is your platform for every option.


*Prices are accurate since October 2025.

(Tagstotranslate) Models of large languages ​​(LLM)

Leave a Comment