Text Modeling - Search News

18h

Google’s Gemini 3.1 Flash TTS model offers unparalleled control over AI voices

Google LLC’s DeepMind artificial intelligence unit today rolled out a new text-to-speech model called Gemini 3.1 Flash TTS.

Microsoft launches MAI-Image-2-Efficient, a cheaper and faster AI image model

How Microsoft shipped a production-optimized image model in under a month. The speed of this release deserves attention.

Fox Business

OpenAI releases text-to-video AI model Sora to certain ChatGPT users

OpenAI released its text-to-video artificial intelligence model, Sora, this week after the completion of its testing phase. The Microsoft-backed AI startup first teased the model in February and ...

TechCrunch

Google debuts a new Gemini-based text embedding model

Google on Friday added a new, experimental “embedding” model for text, Gemini Embedding, to its Gemini developer API. Embedding models translate text inputs like words and phrases into numerical ...

CNET on MSN

Microsoft's New AI Models Go Beyond Just Text

Microsoft's New AI Models Go Beyond Just Text ...

VentureBeat

Meta’s Transfusion model handles text and images in a single architecture

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more Multi-modal models that can process both ...

Engadget

OpenAI’s new Sora model can generate minute-long videos from text prompts

OpenAI's text-to-videos tool Sora generates high-quality videos up to one minute in length. (OpenAI) OpenAI on Thursday announced Sora, a brand new model that generates high-definition videos up to ...

11h

Meet Happy Oyster: Alibaba launches new AI video model that turns text prompts into playable 3D worlds

Alibabahas launched the Happy Oyster AI model. The new model is capable of generating interactive 3D environments and videos ...

TechCrunch

Largest text-to-speech AI model yet shows ’emergent abilities’

Researchers at Amazon have trained the largest ever text-to-speech model yet, which they claim exhibits “emergent” qualities improving its ability to speak even complex sentences naturally. The ...

Why Developers Are Dropping Cloud APIs for This Tiny 82M Speech Model

Kokoro 82M is an 82-million-parameter text-to-speech model that beats many TTS APIs while running locally on CPUs, including ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results