Meta’s “massively multilingual” AI model translates up to 100 languages, speech or text

Enlarge (credit: Getty Images)

On Tuesday, Meta announced SeamlessM4T, a multimodal AI model for speech and text translations. As a neural network that can process both text and audio, it can perform text-to-speech, speech-to-text, speech-to-speech, and text-to-text translations for "up

Meta’s open-source speech AI models support over 1,100 languages

Advancements in machine learning and speech recognition technology have made information more accessible to people, particularly those who rely on voice to access information. However, the lack of labelled data for numerous languages poses a significant challenge in developing high-quality …

文 » A