Microsoft Announces Three New AI Foundation Models

Microsoft's AI research lab has released three new foundational AI models. These models aim to generate text, voice, and images, showcasing the tech giant's continued push in multimodal AI.

Model Details

MAI-Transcribe-1: This model transcribes speech across 25 different languages into text. According to Microsoft, it is 2.5 times faster than Azure Fast offering.

MAI-Voice-1: A voice-generating model that allows users to create custom voices in seconds.

MAI-Image-2: A video-producing model capable of generating high-quality videos from input text or audio.

Relevance and Impact

These models mark a significant step forward for Microsoft's AI capabilities. By offering more cost-effective alternatives to rival models, such as those from Google and OpenAI, Microsoft aims to position itself at the forefront of multimodal AI research.

The release of these foundation models is likely to stimulate further development in the field, potentially leading to even more advanced AI applications.

🔗 来源：Microsoft takes on AI rivals with three new foundational models (AI 严选)

Microsoft Announces Three New AI Foundation Models

Model Details

Relevance and Impact

用户反馈

推荐意见

类别

我的帐户

导航

搜索

配置浏览器推送通知

Chrome (安卓)

Chrome (台式电脑)

Safari (iOS 16.4+)

Safari (macOS)

Edge (安卓)

Edge (台式电脑)

Firefox (安卓)

Firefox (台式电脑)