Model Text - Search News

16d

Mistral drops Voxtral Transcribe 2, an open-source speech model that runs on-device for pennies

Mistral AI has launched Voxtral Transcribe 2, a new on-device speech-to-text model family featuring real-time transcription, speaker diarization, and open-weights licensing—aimed at cheaper, ...

VentureBeat

Meta’s Transfusion model handles text and images in a single architecture

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more Multi-modal models that can process both ...

Geeky Gadgets

How does a GPT AI model work and generate text responses?

Over the last few years Generative Pretrained Transformers or GPTs have become part of our everyday lives and are synonymous with services such as ChatGPT or custom GPTs. That can be now created by ...

Engadget

NVIDIA's new AI model Fugatto can create audio from text prompts

NVIDIA has debuted a new experimental generative AI model, which it describes as "a Swiss Army knife for sound." The model called Foundational Generative Audio Transformer Opus 1, or Fugatto, can take ...

InfoWorld

OpenAI previews Realtime API for speech-to-speech apps

Realtime API supports multi-model text and speech experiences including natural speech-to-speech conversations using preset voices already supported in the API. OpenAI has introduced a public beta of ...

InfoWorld

Microsoft’s Phi-4-multimodal AI model handles speech, text, and video

Microsoft has introduced a new AI model that, it says, can process speech, vision, and text locally on-device using less compute capacity than previous models. Innovation in generative artificial ...

13d

Sarvam rolls out new AI voice model, Bulbul V3, as part of 14-day launch blitz

Bulbul V3 is a text-to-speech AI model that looks to make the output audio sound more natural by rendering pauses, emphasis, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results