SOLVED

Download Transcripts

Brass Contributor

Does anyone know if it is possible to download the entire transcript for a video that has been put on Stream? I wanted to take the transcript and use it to create a short summary for a blog. 

16 Replies
best response confirmed by SarahFabius (Brass Contributor)
Solution

Hi @SarahFabius,

Yes, download the caption file in the Stream admin portal. Screenshot shown here.

https://answers.microsoft.com/en-us/msoffice/forum/all/download-transcript-from-microsoft-stream/18f...

 

This will also help

 

https://techcommunity.microsoft.com/t5/microsoft-stream-ideas/allow-export-of-transcript/idi-p/20546...

 

As Mark Mroz states, download the caption file from Stream (as in the first link) and then run it through this web utility which pulls out just the transcript text from the file (removing the time codes, metadata, and blank lines).

 

https://aka.ms/StreamVTTCleaner


Hope that answers your question!

Best, Chris

@Christopher Hoard You are awesome. Thank you!

@Christopher Hoard 

 

This doesn't work for me - I don't see that link to download the captions.

clipboard_image_0.png

Likewise, I also don't see the Show Transcript option under the View Settings.  I swear I had these options last week.

clipboard_image_1.png

 

@GmsDave I also cannot see the download link, but only for video people shared with me.  I can see a transcription for that shared video, but no place to download it.

However, on my own content, I may choose "Update video details" and on that panel, I can see how to download transcripts for streams I made.

@MarkTab  Probably the next day or two later, those options showed up for me again, and I've had them ever since.  I wonder if there was just a service outage or upgrade going on at the moment in time to disable those features for me.

NicoleReilly_0-1597833805946.jpeg

I've got a customer who is getting this message when trying to download the .VTT file.  Is it as simple as creating an association with Notepad or similar? Or is there another Windows setting that needs to change?

 

They want to download it so they can then run it through the web utility already mentioned on this thread.

@NicoleReilly I got this same error - have not seen a response to your raising this question. Would love to know what to do as I am currently not able to download the caption transcript...

@Joanne_Roig I discovered that the files had downloaded despite the error message (found them in the Downloads folder on the local machine) - so uploaded those to the web converter and it worked.

THhank you @NicoleReilly  I just checked and did find the files. However, it still wants me to select an app with which to open the file. I selected Word, but with each sentence, there is coded gibberish like the example I pasted below. Would you happen to know what App I can/should select (if not Word) that would download just the pertinent transcript without all of the extraneous codes?

Thank you!

Joanne

 

EXAMPLE:

 

WEBVTT

 

NOTE duration:"01:24:45.5700000"

 

NOTE language:en-us

 

NOTE Confidence: 0.26191342

 

c386a959-c91c-458f-b0aa-f0b419c25034

00:00:00.100 --> 00:00:00.620

Sense.

 

NOTE Confidence: 0.8726744

 

85d8b921-32d3-4045-87de-8afb058981ce

00:00:02.100 --> 00:00:07.118

My laptop OK great so Emily.

Why do we start with you and

 

NOTE Confidence: 0.8726744

 

42d47e6d-bb49-435a-b3db-bffb288c9346

00:00:07.118 --> 00:00:11.750

kind of get get us kicked off

with the introduction to you

 

NOTE Confidence: 0.8726744

 

0d9bf88b-dbe3-4bae-bbf8-68d6a83c9227

00:00:11.750 --> 00:00:12.908

and your team.

If you run it through the web utility previously mentioned, it will pull out just the transcript text from the file (removing the time codes, metadata, and blank lines).

The web utility link is:

https://aka.ms/StreamVTTCleaner

Thank you, @NicoleReilly  I had no idea what a Web Utility was, so clicked on the link you sent - figured it all out and it worked well. Thanks again! - Joanne Roig

 

@Christopher Hoard Thanks Chris. Very useful. Now if only it put in newlines for sentences rather than the whole transcript on one line ;)..... although a quick find and replace on "." sorted it.

Is there a potential ETA on when a simple "Download transcript" button will be enabled? The current workarounds suggested below are nice hacks and provide neat things like "confidence" of speech-to-text but it would be nice to have a simple button just for this simple text. I'd also like to separately use the audio clips to determine who spoke when so that "Download transcript" would produce an easy-to-read file like:

[12:20] Bob: good morning, everyone welcome to the meeting.
[12:21] Alice: today we will speak about a new on Teams
etc...

Related:
https://techcommunity.microsoft.com/t5/microsoft-stream-forum/how-do-i-download-the-transcript-and-d...

I also don't see the options to download the captions. the 2 options available when i click on 3 dots next to like are
Linked Groups/Channels
Add to Group/Channel

What am I doing wrong?

@khanr Hello, Teams doesn't save live captions Use live captions in a Teams meeting - Office Support (microsoft.com)

 

But this applies to live transcripts View live transcription in a Teams meeting - Office Support (microsoft.com)

 

If you need to edit transcripts it can only be done if still using Stream (and not yet in the new storage location OneDrive/SharePoint).

 

 

Hi there, the only option I have when I double click on the three ... is Disable auto scroll. Tried to copy the transcript (ctrl a, ctra c). Even though all text is highlighted, it only copies a small portion of the hour-long transcript. Appreciate ANY suggestions.

 

Thanks,

 

Rob

 

@SarahFabius 

1 best response

Accepted Solutions
best response confirmed by SarahFabius (Brass Contributor)
Solution

Hi @SarahFabius,

Yes, download the caption file in the Stream admin portal. Screenshot shown here.

https://answers.microsoft.com/en-us/msoffice/forum/all/download-transcript-from-microsoft-stream/18f...

 

This will also help

 

https://techcommunity.microsoft.com/t5/microsoft-stream-ideas/allow-export-of-transcript/idi-p/20546...

 

As Mark Mroz states, download the caption file from Stream (as in the first link) and then run it through this web utility which pulls out just the transcript text from the file (removing the time codes, metadata, and blank lines).

 

https://aka.ms/StreamVTTCleaner


Hope that answers your question!

Best, Chris

View solution in original post