

Microsoft has introduced three new specialised AI models this month, focusing on image generation, voice generation, and speech-to-text transcription. The models — MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2 — are designed to deliver faster performance, improved accuracy, and competitive pricing. According to the company, MAI-Transcribe-1 offers advanced transcription capabilities across 25 major languages and outperforms rival models in error rates based on internal testing.
MAI-Voice-1 enables natural and expressive voice generation with consistent tone, while MAI-Image-2 enhances image quality with better lighting, textures, and clarity. These models are currently available through Microsoft Foundry and MAI Playground, and are also being integrated into products like Copilot, Bing, and PowerPoint. Microsoft aims to strengthen its position in the AI space with these advanced innovations.






















Comments (0)
No comments yet
Be the first to comment!