It was recently brought to my attention that the transcript files in Steam had changed and therefore the code I previously created for extracting the text from these files no longer works (You can read my original posting and code here). As such I had another look and updated the code so that it would work with the new format.
The issue was that the new format includes additional lines of data which I needed to strip out plus also supports double and single line groups of text. It didn’t take too long to write a new macro which would support this new format.
You can see the new Macro code below:
Sub Macro1()
Dim introw As Integer
Dim intcount As Integer
‘Delete first 10 rows
For intcount = 1 To 5
Rows(1).EntireRow.Delete
Next
introw = 1
Do While Cells(introw + 1, 1).Value <> “”
‘ delete the five rows preceeding text
For intcount = 1 To 5
Rows(introw).EntireRow.Delete
Next
‘ deal with blocks of 2 or 1 line of text
If Cells(introw + 3, 1).Value <> “” Then
introw = introw + 2
Else
introw = introw + 1
End If
Loop
End Sub
If using the above take care in the way that WordPress converts the minus ( – ) character in my code to a similar looking character in the above. As such you may get a syntax error if copying and pasting. If so just delete and replace the minus with the correct character in your code. If you have any other issues with the above please let me know.


Have played around with Microsoft Stream a little recently but only briefly, when I spotted someone having issues exporting the automatically created transcripts. The issue is the transcripts are formatted with time codes and the person wanted only the text of the transcript without all of the timecodes. Removing manually was a pain but thankfully the format of the transcript files appeared uniform in nature…….time to roll out the Macro code and some VBA