Microsoft Launches Three New MAI AI Models for Speech, Voice and Image Generation in Foundry Platform

Microsoft Launches Three New MAI AI Models for Speech, Voice and Image Generation in Foundry Platform
[ Google AdSense - In-Article Ad ]

The three models are available through Microsoft Foundry, the company's platform designed to give developers and enterprise customers access to a broad range of AI tools and services. MAI-Transcribe-1 is designed to convert spoken audio into text, while MAI-Voice-1 focuses on generating realistic synthetic speech. MAI-Image-2, the second iteration in Microsoft's image generation line under the MAI brand, is built to produce AI-generated visuals from text prompts and other inputs.

Global advertising and marketing conglomerate WPP has signed on to test the new tools, lending early enterprise validation to Microsoft's latest AI push. WPP's involvement underscores the potential commercial applications of the models, particularly in industries such as media, advertising and content production. The partnership suggests Microsoft is actively pursuing large-scale enterprise adoption as it positions Foundry as a competitive hub for AI development.

The launch comes as Microsoft continues to invest heavily in AI infrastructure and capabilities across its product ecosystem. The company has previously relied heavily on its partnership with OpenAI, whose models power much of Microsoft's AI-driven consumer and enterprise offerings. The introduction of proprietary MAI models indicates a strategic effort to diversify Microsoft's AI portfolio and reduce dependence on any single external provider.

Microsoft Foundry is positioned as a one-stop platform for developers to access, customize and deploy AI models at scale. By adding its own MAI models to the Foundry catalog, Microsoft is expanding the range of first-party options available to businesses building AI-powered applications. Industry analysts have noted that the move aligns with a broader trend among major technology companies seeking greater control over the AI models underpinning their platforms and services.

Details regarding pricing, availability timelines and the full technical specifications of the three new MAI models had not been fully disclosed at the time of publication. Microsoft has indicated that the models are accessible within Foundry, though broader rollout details are expected to follow. The company's expansion into multimodal AI — spanning audio, voice and visual content — positions it to compete more directly with rivals including Google, Amazon and a growing number of specialized AI providers.

[ Google AdSense - Bottom Article Ad ]