Get Started Free

Google Meet Real-Time Speech Translation Guide + Alternative

Google Meet Real-Time Speech Translation Guide + Alternative

Create Subtitles, Voiceovers, and Transcripts in Minutes

Effortlessly generate subtitles, voiceovers, and transcripts in over 100 languages. Powered by advanced AI.

Book a Demo

Recently, I’ve been experimenting with Google Meet’s real-time speech translation feature and how it performs in multilingual meetings. While it’s one of the more interesting developments in AI-powered communication, I think it’s still fairly limited in terms of supported languages and customization options.

Those limitations can become noticeable fairly quickly if you or your team regularly work with participants who speak different languages or rely on platforms outside of Google Meet. In those cases, it can be useful to look at other tools that offer broader language support or more flexible setups.

In this blog, we'll:

  • Take a look at Google Meet's real-time speech translation feature (including its pros, cons, and how it works)
  • Explore Maestra's live speech translator (including its key features and how to use it step by step)
  • Compare Google Meet and Maestra side by side
  • Highlight the use cases where AI-powered live voice translation delivers the most value

Let's get started.

Google Meet Real-Time Speech Translation: Availability & Supported Languages

For years, Google Meet has offered live captions, allowing users to see a text transcription of what's being said. While helpful, it still required the participants to read and process the information. The real-time speech translation feature takes things significantly further by automatically translating the speaker's voice into the chosen target language in real time. [1]

Powered by Gemini AI, the system not only translates the words but also preserves the speaker's original tone, pitch, and vocal style. This creates a natural meeting translation experience; for instance, allowing a manager speaking in English to be head in Spanish by their Madrid-based colleagues without any need for a human interpreter.

The interface of Google Meet real-time speech translation feature.

Who can use AI speech translation in Google Meet?

Initially launched as a limited beta, the feature reached general availability in early 2026.

As of March 2026, Google Meet live speech translation is a premium feature available to the following subscribers:

  • Individual users: Google AI Pro and Google AI Ultra subscribers.
  • Business and education: Google Workspace customers on Business Standard and Plus, Enterprise Standard and Plus, Frontline Plus, and users with Gemini AI for Business or Education add-ons.

Only one participant (typically the host) needs an eligible subscription to enable the future for the entire meeting. Once enabled, all participants can choose to listen to the translated audio.

❓ Are you a Gemini user wondering can Gemini translate in real time? Our blog explains what it can and can't do.

Which languages does Google Meet support for real-time speech translation?

Google Meet live translation has also expanded beyond its initial English-Spanish pairing. The speech-to-speech feature now supports bidirectional real-time translation for a core group of major global languages.

Current supported language pairs include:

  • English↔Spanish
  • English↔French
  • English↔German
  • English↔Portuguese
  • English↔Italian

While the current system focuses on six languages, Google is working on adding more. This includes non-Western languages like Hindi, which is expected to be available in a future update. [2]

An illustration of a Google Meet video call interface featuring two diverse avatars.

How does Google Meet translation work?

Setting up Google Meet translation is a two-part process: a subscriber must first enable it for the meeting, and then each participant must configure their own listening and speaking preferences.

How to Enable AI Speech Translation in Google Meet (Hosts/Subscribers)

If you have an eligible subscription, you can turn on translation for everyone in the call:

  1. Join the meeting on a computer (mobile support for Google Meet real-time speech translation is expected to come later in 2026).
  2. Click Meeting tools>Speech translation at the bottom right.
  3. Click Enable translation for everyone.

Important: Only the subscriber/host has the authority to change the language combination for the entire meeting.

How to Configure AI Speech Translation in Google Meet (All Participants)

Once the feature is active, each person can customize their experience:

  1. Click the speech translation badge (a blue icon) in the top right corner of the screen.
  2. Select the Language that you speak and Language that you prefer to hear. (For the best experience, Google recommends setting both of these to your native language.)
  3. To allow your voice to be translated, you must click Allow translation>Translate me under Language that you speak. (To protect your privacy, Google will not translate your voice for others until you explicitly grant this permission.)
  4. Once configured, the AI will begin dubbing other speakers for you in real time. You will hear the translated version clearly while the original speaker’s voice plays quietly in the background. This helps you follow the conversation while still hearing the speaker's real tone and emotion.

For changing language combinations or more information, you can visit Google Meet's official speech translation guide.

Google Meet interface showing a video call with a "Speech translation" sidebar active for English and Spanish.

Speech translation is designed to make meetings more inclusive and effective.

Google

Benefits & Limitations of Google Meet Live Translation

AI speech translation in Google Meet can significantly improve communication in multilingual meetings. However, like any AI-powered system, it also comes with certain limitations depending on language support, meeting conditions, and subscription access.

Benefits of Google Meet Translation

  • Natural "dubbed" experience: Unlike traditional text captions that require you to look away from the speaker, this feature lets you maintain eye contact while hearing a voice that mimics the speaker’s original tone and vocal style.
  • Unified Workspace integration: There is no need for third-party bots or external plugins. The translation happens natively within the Meet interface, maintaining Google’s enterprise-grade security and encryption.
  • Cost-effective solution: Since the feature is built directly into eligible Google Workspace and Google AI plans, teams can add translation capabilities without getting separate interpretation services or tools.
  • Easy meeting-wide activation: A subscriber can enable translation for all participants with a single toggle. Once active, a blue badge appears for everyone, ensuring the entire team is aware that translation is available and ready to use.
3D Google Meet icon floating on a solid blue background.

Limitations of Google Meet Translation

  • Single language pair: Currently, only one translation pair (e.g., English↔Spanish) can be active in a meeting at a time. The system cannot yet translate into multiple different target languages simultaneously for different users.
  • Limited number of supported languages: As of March 2026, the feature is primarily focused on bidirectional translation between English and five major languages: Spanish, French, German, Portuguese, and Italian.
  • Processing delay: While designed for "near real-time" translation, there can be an inherent delay in the process. This pause, typically a few seconds, is necessary for the system to capture the audio, process it, translate it, and then synthesize the new speech.
  • Device restrictions: Google Meet live speech translation is currently optimized for desktop use. While a rollout for Android and iOS apps is underway, it is still a disadvantage for users who primarily join meetings from mobile devices.
  • No recording support: Translated audio is not currently captured in Google Meet recordings. If you record a meeting, the saved video file will only contain the original spoken audio, not the AI-dubbed translations.

Translate Speech in Real Time

Need more language options? Maestra's live speech translator supports over 125 languages, complete with flexible sessions that allow multiple source and target languages.
Live Translate Now
Related Article
10 Best Real-Time Translators for Meetings

10 Best Real-Time Translators for Meetings

A Guide to Maestra's Real-Time Speech Translator

Google Meet real-time speech translation is a great step forward, but from what I've seen, many individuals and teams still need something more flexible.

I started using Maestra’s speech translator when I needed something that could handle more languages and work across different platforms without being tied to a single meeting tool. In this part, I'll walk through the features that stand out and show how to start using the tool step by step.

Key Features of Maestra's AI Speech Translator

Below are the key features of Maestra's real-time online voice translator, which provides both live speech translation and translated captions to support multilingual communication:

  • 125+ supported languages: Unlike current limitations in Google Meet, Maestra supports 125+ languages for real-time speech translation, enabling inclusive communication across nearly any language group.
  • Multiple source and target languages: Users can select multiple source and target languages in the same session. This allows every speaker and listener to communicate in their preferred language without being limited to a single language pair.
  • Automatic language detection: The tool can automatically detect the language a speaker is using without requiring manual selection. This allows conversations to flow naturally even when participants switch languages during the session.
  • Speaker detection: The system can automatically identify and separate different speakers during a session. This helps participants follow the conversation more easily and clearly shows who is speaking in translated captions.
  • Contextual translation: Users can add and save context (background and domain information) to help the system better understand the topic of the conversation. This improves translation accuracy and provides greater terminology consistency throughout the session.
  • Automatic voice cloning: The live voice translator can preserve the speaker's original voice through AI-powered voice cloning. This allows the translated speech to retain the speaker’s tone, pitch, and vocal characteristics, creating a more natural listening experience.
Maestra interface showing English to French translation with an "AI Fact-Checker" overlay panel.
  • AI voices: Translated speech can also be delivered using natural-sounding AI voices. More than 800 voices are available in a variety of tones and accents to match the audience's preferences.
  • Customizable live captions: The tool also generates live translated captions that can be customized for readability and accessibility. Users can adjust elements like caption style, size, and display to make conversations easier to follow in real time.
  • Transcript saving and AI enhancements: Every live translation and transcript can be saved to your dashboard for later access and editing. From there, you can use built-in AI features to generate summaries, extract keywords, perform sentiment analysis, and gain deeper insights from the conversation.
  • Shareable sessions: Each translation session can be shared with participants through an invitation email. Attendees simply join the session from their own device and receive translated speech and captions in their selected language.
  • Integrations: Users can integrate the live translator with platforms like Zoom, Microsoft Teams, OBS, or vMix. This makes it possible to add real-time speech translation to meetings, webinars, broadcasts, and live streams without changing their existing setup.

How to Use Maestra's Live Speech Translator

Now let's walk through how to use Maestra's real-time speech translator:

  1. Go to the live translator and log in to your account. From the top, choose either Private or Shareablemode. (In this walkthrough, we’ll focus on the shareable option, but if you select private, you only need to follow the first four steps before starting your session.)
  2. Click Dubbing from the upper menu.
  3. Select speaker and translation languages. You can choose multiple languages for each, or enable auto-detect for the spoken language.
    How to use Maestra's live dubbing tool.
  4. At the bottom, select an AI voice for each target language, or choose to clone the speaker’s voice for a more natural and consistent output.
    A range of AI voices in Japanese for Maestra's real-time speech translator.
  5. Add your event details, including event name, date, time, and description.
  6. If needed, you can also add context to improve translation accuracy. You can include background information such as a topic summary, event details, or domain-specific terminology to help the AI better understand your session.
    How to add contextual information to Maestra's instant voice translator.
  7. Toggle on Save to Maestra app to access the session transcript later for documentation, editing, and repurposing.
  8. Toggle on Speaker detection to automatically identify and separate different speakers during the session.
    How to enable session saving and speaker detection in Maestra's AI speech translator.
  9. To invite speakers, enter their email addresses to send a join link for the session.
  10. When everything is set up, click the red plus button. A unique QR code and shareable join link will appear. Review your session information and click Start Session.
  11. Start speaking. A split-screen interface will appear, with the source transcript on the left and translated captions on the right. The translated audio will also play in real time as you speak.

    If the split screen view doesn't appear: Click the language button at the bottom right (showing the source language), then select the target language to open the split-screen view.

    The interface of Maestra's live translator showing a translation from English to Japanese.
  12. When the session ends, click the red microphone icon to stop the session. You can then save the transcript to your Maestra dashboard (if you haven't already enabled saving to app) or download the transcript as a TXT or DOCX file.

Further settings, accessed via the main page, allow you to customize and extend the functionality of the real-time translator:

  • Caption customization: You can adjust the appearance of live captions by changing elements like font style and size, color, alignment, and speaker labels to improve readability and match your preferences.
  • Integrations: You can connect the tool with platforms like Zoom, Microsoft Teams, vMix, and OBS, making it possible to use real-time speech translation in meetings, webinars, broadcasts, and live streams without changing your existing setup.
  • Custom glossary and word replacements: You can create a custom dictionary by adding preferred terms, brand names, or industry-specific vocabulary, ensuring more consistent and accurate translations throughout your sessions.
How to access Settings for integrations and customization in Maestra's online speech translator.

Real-Time Online Voice Translation: Google Meet vs. Maestra

Feature Google Meet Maestra
Language Support Supports bidirectional translation between English and Spanish, French, German, Portuguese, and Italian as of March 2026 125+ languages supported
Platform Support Built directly into and only works within Google Meet Can integrate with Zoom, Microsoft Teams, OBS, and vMix
Voice Cloning
Multilingual Session Support
Accessibility & Pricing Available to users with Google AI Pro/Ultra plans and eligible Google Workspace tiers Offers Premium, Business, and Business Plus plans, plus custom enterprise pricing
Free Trial

Top Use Cases for Real-Time Speech Translation

If you're wondering where real-time speech translation actually makes a difference, these are the situations where I've seen tools like Maestra's talking translator work best. The value shows up most when people from different language backgrounds need to communicate live without slowing down the conversation.

To significantly advance S2ST, we created a scalable data acquisition pipeline and developed an end-to-end model that provides direct, real-time language translation with just a two-second delay.

Karolis Misiunas & Artsiom Ablavatski
Google Engineers

Multilingual Meetings

In international meetings, language differences can easily slow the discussion down. Real-time translation helps teams stay focused on the conversation instead of constantly switching languages or relying on summaries afterward. Participants can follow the conversation in their preferred language through audio or captions while the meeting continues naturally.

Global Webinars and Virtual Events

When running webinars or online events for an international audience, making the content accessible to everyone can be difficult. Real-time translation helps presenters reach participants who may not speak the event’s primary language. This is especially useful for live webinar translation, where attendees can join through shareable links or email invitations and listen to translated audio directly from their own devices.

A man looking at a laptop screen that displays the word "WEBINAR."

Online Education and E-Learning

For educators and course creators working with international learners, language barriers can limit participation. Real-time translation makes it easier for students to follow lectures and discussions without waiting for translated materials later. In these cases, education translation in real time can help deliver content more clearly across different language groups.

Live Events and Streams

Live conferences and panel discussions often attract audiences from different regions. In these situations, a real-time translator can make the occasion more accessible without requiring human interpreters for every session, which can reduce overall costs. The same approach can also be used for live streams, allowing creators to expand their reach and connect with audiences who speak different languages.

Enterprise Training and Internal Comms

Large organizations often run training sessions or internal updates for employees across multiple countries. In these cases, real-time translation is often used to make communication more accessible. For companies operating internationally, training translation can help ensure consistency across teams.

Final Verdict

From what I’ve tested, Google Meet’s real-time speech translation works best in simple, controlled scenarios. It’s convenient, built-in, and easy to enable, but it doesn’t yet cover the needs of teams working across multiple languages or platforms. The single language pair limitation and restricted language support are the biggest constraints. If you need something more flexible, Maestra's live speech translator may be the better option.

Looking ahead, this is clearly just the beginning. As Google continues to develop its AI models, we can expect better performance, more languages, and fewer limitations. But even as it improves, there will still be a distinction between native meeting features and dedicated translation tools.

Frequently Asked Questions

Does Google Meet support real-time speech translation?

Yes, Google Meet supports real-time speech translation. It automatically dubs a speaker's voice into another language while preserving their original tone and pitch. As of early 2026, the feature is generally available for premium subscribers and supports bidirectional translation between English and five other languages.

How does real-time speech translation work in Google Meet?

Google Meet translation utilizes a specialized speech-to-speech model that transcribes, translates, and synthesizes audio in roughly two to three seconds. This AI-driven process is built directly into the Google Meet infrastructure. Because it is a native feature, it requires no external bots or plugins to handle the complex bidirectional translation.

Which languages are supported by Google Meet real-time speech translation?

As of March 2026, Google Meet real-time speech translation supports bidirectional translation between English and five major languages: Spanish, French, German, Portuguese, and Italian. Please note that while many other languages are supported for text-based captions, only this specific group currently supports the AI-dubbed voice translation.

Can Google Meet live translate captions?

Yes, Google Meet can live translate captions. Users can view translated captions as on-screen text or combine them with real-time AI speech translation for a more immersive multilingual experience. While speech translation currently supports a limited set of major languages, translated captions are available in a much wider range, covering around 70 languages.

How accurate is Google Meet live translation?

Google Meet live translation is generally accurate for clear speech and common vocabulary. However, because the system translates speech in real time, it can produce more errors than translations generated from recorded audio or written text. Accuracy may also decrease when there is background noise, fast speech, or complex technical terminology.

Is Google Meet live translation available on mobile devices?

Google Meet live speech translation is currently available only on computers. Participants who join meetings from mobile devices cannot use the feature yet. However, Google has confirmed that support for Android and iOS apps is planned in a future rollout.

Can Google Meet translate into multiple languages at the same time?

No. As of March 2026, Google Meet supports only one language pair at a time, meaning all participants must use the same translation combination during the meeting. If you need multiple source and target languages in the same session, tools like Maestra’s real-time online voice translator allow participants to select different languages simultaneously.

Is Google Meet translation better than Zoom AI translation?

It depends on what you need from the feature. Google Meet focuses on real-time speech-to-speech translation that preserves the speaker’s voice and tone. Zoom translation, on the other hand, mainly offers translated captions and interpretation features rather than full voice dubbing.

Serra Ardem

About Serra Ardem

Serra Ardem is a content writer and editor who explores the intersection of real-time language technologies, communication, and accessibility. She treats the digital landscape as a lab, researching how AI-powered translation and speech recognition shape the ways people connect across languages.

With over 10 years of experience in digital storytelling, Serra consistently experiments with new tools, helping readers turn complex tech into simple, practical solutions.