Meta introduces multilingual speech translation model for 100 languages

SeamlessM4T' draws on findings from all of these projects to enable a multilingual and multimodal translation experience stemming from a single model.

Update: 2023-08-22 16:30 GMT

Representative image

SAN FRANSCISCO: Heating up the artificial intelligence (AI) race, Meta on Tuesday launched a new all-in-one, multilingual multimodal AI translation and transcription model for up to 100 languages depending on the task.

Called ‘SeamlessM4T,’ the single model can perform speech-to-text, speech-to-speech, text-to-speech, and text-to-text translations.

'SeamlessM4T' supports speech recognition for nearly 100 languages, speech-to-text translation for nearly 100 input and output languages, speech-to-speech translation, supporting nearly 100 input languages and 36 (including English) output languages and text-to-text translation for nearly 100 languages.

It can also support text-to-speech translation, supporting nearly 100 input languages and 35 (including English) output languages.

“We’re also releasing the metadata of SeamlessAlign, the biggest open multimodal translation dataset to date, totalling 270,000 hours of mined speech and text alignments,” Meta said in a blog post.

Last year, Meta released No Language Left Behind (NLLB), a text-to-text machine translation model that supports 200 languages, and has since been integrated into Wikipedia as one of the translation providers.

"We also shared a demo of our Universal Speech Translator, which was the first direct speech-to-speech translation system for Hokkien, a language without a widely used writing system,” said the company.

Earlier this year, we revealed Massively Multilingual Speech, which provides speech recognition, language identification and speech synthesis technology across more than 1,100 languages.

'SeamlessM4T' draws on findings from all of these projects to enable a multilingual and multimodal translation experience stemming from a single model, built across a wide range of spoken data sources with state-of-the-art results, Meta noted.

Meta introduces multilingual speech translation model for 100 languages

SeamlessM4T' draws on findings from all of these projects to enable a multilingual and multimodal translation experience stemming from a single model.

Similar News

SpaDeX mission to help India ace space docking technology: ISRO

TRAI recommends additional spectrum for Indian Railways to boost safety

Study shows surge in fake news, deepfakes in India; govt developing tools

Samsung to unveil new AI-powered home appliances at CES 2025

South Korea begins local production of nanotechnology-backed filters for chips

Tech Next: Redmi’s new Note 14 Pro+ arrives with premium design, 6200mAh battery

Tech Next: Lenovo’s 2024 ThinkPad is all set for AI era with 17+ hour battery life and stunning 14-inch display

Apple’s chipset shipments increased to 18 pc in Q3 globally

Meta outage hit Facebook, Instagram, WhatsApp and more. Here's what we know so far

realme 14x sets new durability standards with first IP69 under Rs 15,000

AI revolution yet to come, regulations may hinder innovation: Meta AI chief

YouTube’s new feature for registered health professionals in India to reach people