Skip to content

Commit 3dc3482

Browse files
authored
Update README.md
1 parent 13ece34 commit 3dc3482

File tree

1 file changed

+13
-1
lines changed

1 file changed

+13
-1
lines changed

Codes/README.md

Lines changed: 13 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -22,11 +22,23 @@
2222
# To Run the unimodal Vision Based models
2323

2424
6.Vision+lstm_foldWise.py
25-
7. 3DCNN_withFolds.py
25+
7.3DCNN_withFolds.py
2626

2727
# To Run the Multimodal Model
2828
2929
9. MultiModalFusionModelfoldWise.py
3030

3131
# To extract all the video frames.
3232
frameExtract.py
33+
34+
# Extraction of transcript
35+
36+
The 'all__video_vosk_audioMap.p' has to be generated using the Vosk speech recognition toolkit(https://alphacephei.com/vosk/). The format of the file is in JSON format like the below:
37+
38+
{
39+
"video_name1": "transcript1",
40+
"video_name2": "transcript2",
41+
...
42+
"video_name3": "transcript3"
43+
}
44+

0 commit comments

Comments
 (0)