When You Need a Written Record of What People Said

Updated 10 June, 2021

Literally months after announcing Live Transcription for Teams meetings in message center notification MC220987 (August 26 2020, updated Nov 10), Microsoft published a blog post on March 23 to say that the functionality is now generally available. Making the transcript available completes the work to create a meeting recap to highlight important information shared in calls and adds to an array of recent improvements introduced to make Teams meetings run smoother.

Transcription is available for those with Office 365 E3 and, E5, Microsoft E3 and E5, and Microsoft 365 Business Standard and Business Premium plans. Currently, transcription works for personal meetings. It’s not available for channel meetings, Meet Now meetings, or 1:1 calls. Meeting transcription is only available in the Teams desktop client.

Update: According to message center notification MC260564 (June 7), Microsoft is making transcription available to these additional SKUs to make Teams meeting content more accessible to those challenged by hearing issues:

  • Office 365 E1
  • Office 365 A1
  • Office 365/Microsoft 365 A3
  • Office 365/Microsoft 365 A5
  • Microsoft 365 F1
  • Office 365/Microsoft 365 F3
  • Microsoft 365 Business Basic

Meeting transcription (automatic captions) is a feature of Stream. When Teams transitioned storage of meeting recordings to OneDrive for Business, the ability to generate transcriptions was lost. That gap is now closed, which is a relief because all new Teams meeting records are now stored in OneDrive unless an organization explicitly chooses to stay with Stream.

Change in Plan

One notable change since Microsoft first published MC220987 is that recording is no longer combined with transcription. Two separate options appear in the Teams meeting menu (Figure 1). You can start a recording without generating a transcript or start transcription without recording a meeting. Separate notices are used to advise meeting participants when recording and transcription are active.

Separate options for recording and transcribing Teams meetings
Figure 1: Separate options for recording and transcribing Teams meetings

Policy Control for Transcription

Before a meeting organizer or presenter can transcribe a meeting, the Teams meeting policy assigned to their account must have the AllowTranscription setting enabled (its companion setting is AllowCloudRecording to control the ability to record a meeting). The AllowTranscription setting can only be changed with PowerShell. To check the settings of the Global (default) policy, run the Get-CsTeamsMeetingPolicy cmdlet (included in the Teams PowerShell module):

Get-CsTeamsMeetingPolicy -Identity Global | ft identity, AllowTranscription, AllowCloudRecording

Identity                    AllowTranscription AllowCloudRecording
--------                    ------------------ -------------------
Global                      False              True

To update the policy, run the Set-CsTeamsMeetingPolicy cmdlet:

Set-CsTeamsMeetingPolicy -Identity Global -AllowTranscription $True

Capturing Speech During Meetings

Transcription is an AI-generated record of what is spoken during a meeting. Processing occurs in almost real time to display text during meetings (Figure 2) and to make a full transcript available soon after a meeting is over.

Transcribed text displayed during a Teams meeting
Figure 2: Transcribed text displayed during a Teams meeting

In their blog post, Microsoft says that “Delivering live transcription with high accuracy, minimal latency, and cost efficiency at enterprise scale has been one of the toughest challenges in the industry. Over the last two years we’ve made significant strides in solving this problem and have dramatically improved our models for accuracy using meeting context in real time and cutting edge AI.”

Microsoft also notes that an AI model is created for each meeting to take account of the meeting topic, participants, and attachments to improve the accuracy of text recognition, especially with jargon. Microsoft says that the models used for meeting transcription are removed after a meeting is over and not used to improve their AI.

There’s no doubt that capturing and transcribing conversations during a meeting is a mind-bendingly difficult computation task. The computational challenges will become even harder as Microsoft expands coverage to additional languages from its current limitation of U.S. English.

I’m not sure how Teams detects US English as it doesn’t seem to have any difficulty interpreting my Irish accent. Perhaps it’s fairer to say that transcription works when people speak a transatlantic U.S.-style English and won’t if someone starts to speak French, German, Italian, or another language.

A glance at the transcript shown in Figure 2 illustrates how difficult it can be to capture an accurate transcript. If the meeting is full of different voices, if participants don’t have good microphones and the audio feed is indistinct in any way, you’ll see oddities in the transcript (like “The seat back on the Moon” – I have no idea what was said there). Noise suppression in the Teams desktop client seems to help as I have noted better results when people use the desktop rather than the browser or mobile clients.

Using Meeting Transcripts

When a meeting finishes, the meeting transcript is available almost immediately. Open the meeting in the Teams calendar app and the transcript is available in the meeting recap or via the Recordings & Transcripts tab. Figure 3 shows an example of a transcribed meeting which happened in three parts, one for each segment. In this case, the meeting organizer is reviewing the transcript, so the delete option is available. Other participants can view and down transcripts, but they can’t delete a transcript.

Viewing a transcript of a Teams meeting
Figure 3: Viewing a transcript of a Teams meeting

Transcripts are downloadable in Video Text Track (VTT) or Microsoft Word (docx) format. VTT files are text-format files where individual contributions are noted with a detailed timestamp. For example:

00:03:02.870 --> 00:03:05.570
<v Sean Landy>So now we get its goal.</v>

The Word format is more readable because it is less structured and more valuable because the transcript is easier to edit. In both cases, the content is exactly what is displayed during the meeting. The availability of the downloaded files makes it possible to edit obvious errors and add missing content before releasing a formal record of a meeting.

Guest users don’t have access to the Teams calendar app, so they can’t access the transcript once a meeting is over.

Privacy

The default for transcription is to use speaker attribution. In other words, as people speak, their contributions are identified in the transcript using their name. Users can hide their identity for transcripts and live captions in the Captions and transcripts section of client settings (Figure 4).

Setting the privacy option for speaker attribution in Teams transcripts and live captions
Figure 4: Setting the privacy option for speaker attribution in Teams transcripts and live captions

When a user chooses not to identify themselves in transcripts, Teams inserts the generic “Speaker” attribution with a number, so the transcript contains text for “Speaker 1,” “Speaker 2,” and so on.

Like many other Teams settings, caching means that it can take a while before the option to allow or deny identification in transcripts becomes active.

Transcripts and Compliance

Microsoft’s blog says: “Teams live transcription files are stored in the meeting organizer’s Exchange Online account and only the organizer and tenant admin have permissions to delete it.” Let’s see where the transcript information is stored. After running a meeting to generate a transcript, I used PowerShell to detect recent updates to folders in my Exchange Online mailbox and then the MFCMAPI utility to examine the items in those folders. This revealed that Teams stores transcript items in the ApplicationDataRoot/93c8660e-1330-4e40-8fda-fd27f9eafe10/MeetingTranscriptCollection folder in the NonIPMRoot (hidden part) of the mailbox. Each item refers to a transcript. If we examine the properties stored for the transcript, we find some JSON content (Figure 5).

A Teams transcript item stored in an Exchange Online mailbox viewed through MFCMAPI
Figure 5: A Teams transcript item stored in an Exchange Online mailbox viewed through MFCMAPI

Pasting the JSON into a PowerShell variable, we can see what it holds:

$json | convertfrom-json

TranscriptJsonBlob@is.Large         : True
TranscriptJsonBlob@stream           : True
TranscriptJsonBlob@type             : Edm.String
TranscriptMetadataJsonBlob@is.Large : True
TranscriptMetadataJsonBlob@stream   : True
TranscriptMetadataJsonBlob@type     : Edm.String
TranscriptWordBag@is.Queryable      : False
TranscriptWordBag                   :
TranscriptWordBag@type              : Edm.String
ThreadId@is.Queryable               : True
ThreadId                            : 19:meeting_NWY0ZDMwZGMtZWVjYi00ZDFmLTlhYmMtNWYzOWU2NGVlY2Jh@thread.v2
ThreadId@type                       : Edm.String
MessageId@is.Queryable              : True
MessageId                           : 0
MessageId@type                      : Edm.String
#type.ThreadId                      : String
#type.MessageId                     : String

There’s no transcript text present. Instead, we have the thread link for the meeting to bring us to the Teams data store in Azure. Some extra probing located a second property called TranscriptJsonBlob. This is where the transcript text is stored.

If Exchange Online mailboxes stored transcript text, transcripts would be indexed and discoverable. And this is just what will happen when Microsoft delivers the update documented in message center MC260749 (June 8, 2021) to make it possible to search spoken text captured in transcripts.

Good New Feature

Overall, there’s lots to like about meeting transcription. Some people get better results than others, but experience will guide people to using better setups when transcription is important. Automated transcripts might be imperfect and include some interesting but incorrect interpretations of what people say during meetings. However, having Teams generate a transcript is a lot cheaper than having a professional transcriber listen to a meeting recording, and the output is a good start for a final record. Errors are easily fixed by editing the transcript and making that version available to attendees (and if the edited version is a Word document circulated by email or stored in OneDrive for Business or SharePoint Online, it will be discoverable).

About the Author

Tony Redmond

Tony Redmond has written thousands of articles about Microsoft technology since 1996. He is the lead author for the Office 365 for IT Pros eBook, the only book covering Office 365 that is updated monthly to keep pace with change in the cloud. Apart from contributing to Practical365.com, Tony also writes at Office365itpros.com to support the development of the eBook. He has been a Microsoft MVP since 2004.

Comments

  1. TJ

    Some of my VTTs have the Speaker name and some don’t. Do know why that is and how I can ensure that the speaker name always shows in the VTT. I need this for PhD research. Am using Microsoft Teams and even though the word version shows the speaker names, the VTT file does not.

Leave a Reply