Forum Discussion

SarahFabius's avatar
SarahFabius
Brass Contributor
Jan 10, 2020

Download Transcripts

Does anyone know if it is possible to download the entire transcript for a video that has been put on Stream? I wanted to take the transcript and use it to create a short summary for a blog. 

  • Hi SarahFabius,

    Yes, download the caption file in the Stream admin portal. Screenshot shown here.

    https://answers.microsoft.com/en-us/msoffice/forum/all/download-transcript-from-microsoft-stream/18fb51f5-3b11-49c6-b18a-44bf6fd524ea

     

    This will also help

     

    https://techcommunity.microsoft.com/t5/microsoft-stream-ideas/allow-export-of-transcript/idi-p/205468

     

    As Mark Mroz states, download the caption file from Stream (as in the first link) and then run it through this web utility which pulls out just the transcript text from the file (removing the time codes, metadata, and blank lines).

     

    https://aka.ms/StreamVTTCleaner


    Hope that answers your question!

    Best, Chris

    • GmsDave's avatar
      GmsDave
      Copper Contributor

      ChrisHoardMVP 

       

      This doesn't work for me - I don't see that link to download the captions.

      Likewise, I also don't see the Show Transcript option under the View Settings.  I swear I had these options last week.

       

      • MarkTab's avatar
        MarkTab
        Icon for Microsoft rankMicrosoft

        GmsDave I also cannot see the download link, but only for video people shared with me.  I can see a transcription for that shared video, but no place to download it.

        However, on my own content, I may choose "Update video details" and on that panel, I can see how to download transcripts for streams I made.

    • xazos1979's avatar
      xazos1979
      Copper Contributor

      ChrisHoardMVP Thanks Chris. Very useful. Now if only it put in newlines for sentences rather than the whole transcript on one line 😉..... although a quick find and replace on "." sorted it.

  • NicoleReilly's avatar
    NicoleReilly
    Copper Contributor

    I've got a customer who is getting this message when trying to download the .VTT file.  Is it as simple as creating an association with Notepad or similar? Or is there another Windows setting that needs to change?

     

    They want to download it so they can then run it through the web utility already mentioned on this thread.

    • Joanne_Roig's avatar
      Joanne_Roig
      Copper Contributor

      NicoleReilly I got this same error - have not seen a response to your raising this question. Would love to know what to do as I am currently not able to download the caption transcript...

      • NicoleReilly's avatar
        NicoleReilly
        Copper Contributor

        Joanne_Roig I discovered that the files had downloaded despite the error message (found them in the Downloads folder on the local machine) - so uploaded those to the web converter and it worked.

  • Is there a potential ETA on when a simple "Download transcript" button will be enabled? The current workarounds suggested below are nice hacks and provide neat things like "confidence" of speech-to-text but it would be nice to have a simple button just for this simple text. I'd also like to separately use the audio clips to determine who spoke when so that "Download transcript" would produce an easy-to-read file like:

    [12:20] Bob: good morning, everyone welcome to the meeting.
    [12:21] Alice: today we will speak about a new on Teams
    etc...

    Related:
    https://techcommunity.microsoft.com/t5/microsoft-stream-forum/how-do-i-download-the-transcript-and-determine-who-spoke-when/m-p/2279254

  • robgib40's avatar
    robgib40
    Copper Contributor

    Hi there, the only option I have when I double click on the three ... is Disable auto scroll. Tried to copy the transcript (ctrl a, ctra c). Even though all text is highlighted, it only copies a small portion of the hour-long transcript. Appreciate ANY suggestions.

     

    Thanks,

     

    Rob

     

    SarahFabius 

  • vullnetdyla's avatar
    vullnetdyla
    Copper Contributor

    So I found a way to extract the transcript from the API call that Stream makes to get it from the server. I had to do this because the Video I wanted the transcript for was not mine, and I didn't have the option to go into the settings and Download Captions.

    I found that <urlToYourStreamVideo>/streamContent is the API call that responds with the JSON with all the transcript data. Once I found this, I copied the JSON into my browsers Console and did this: 

    let asd = <paste your content here> //run this as step 1
    let ftw = asd.entries.map(obj => obj.text) //run this as step 2
    ftw.reduce((sum, x) => sum = sum + x + "\n") //run this as step 3

     

    I then had to figure out how to select that massive transcript by dragging and dropping. You might have a better way to do this depending on the browser you use.

     

    Once I had it all copied, I could then use it to get a decent summary from Microsoft Copilot and ChatGPT.

Resources