@@ -53,25 +53,25 @@ Run the data preparation pipeline from the project root to create domain-specifi
5353
5454** Training Data:**
5555``` bash
56- python apps/whisper_fine_tuning /src/core/data_prep/main_data_prep.py \
57- --source_data_dir apps/whisper_fine_tuning /data/raw/audios/matis \
58- --output_data_dir apps/whisper_fine_tuning /data/raw/samples/training \
56+ python apps/speech_to_text /src/core/data_prep/main_data_prep.py \
57+ --source_data_dir apps/speech_to_text /data/raw/audios/matis \
58+ --output_data_dir apps/speech_to_text /data/raw/samples/training \
5959 --domain train
6060```
6161
6262** Testing Data:**
6363``` bash
64- python apps/whisper_fine_tuning /src/core/data_prep/main_data_prep.py \
65- --source_data_dir apps/whisper_fine_tuning /data/raw/audios/matis \
66- --output_data_dir apps/whisper_fine_tuning /data/raw/samples/testing \
64+ python apps/speech_to_text /src/core/data_prep/main_data_prep.py \
65+ --source_data_dir apps/speech_to_text /data/raw/audios/matis \
66+ --output_data_dir apps/speech_to_text /data/raw/samples/testing \
6767 --domain test
6868```
6969
7070** Evaluation Data:**
7171``` bash
72- python apps/whisper_fine_tuning /src/core/data_prep/main_data_prep.py \
73- --source_data_dir apps/whisper_fine_tuning /data/raw/audios/matis \
74- --output_data_dir apps/whisper_fine_tuning /data/raw/samples/evaluation \
72+ python apps/speech_to_text /src/core/data_prep/main_data_prep.py \
73+ --source_data_dir apps/speech_to_text /data/raw/audios/matis \
74+ --output_data_dir apps/speech_to_text /data/raw/samples/evaluation \
7575 --domain evaluation
7676```
7777
@@ -81,7 +81,7 @@ python apps/whisper_fine_tuning/src/core/data_prep/main_data_prep.py \
8181After preparing your datasets, run the complete pipeline locally:
8282
8383``` bash
84- python apps/whisper_fine_tuning /src/core/data_prep/main_silver_data_prep.py --train_datasets apps/whisper_fine_tuning /data/raw/samples/training --eval_datasets apps/whisper_fine_tuning /data/raw/samples/evaluation --test_datasets apps/whisper_fine_tuning /data/raw/samples/testing
84+ python apps/speech_to_text /src/core/data_prep/main_silver_data_prep.py --train_datasets apps/speech_to_text /data/raw/samples/training --eval_datasets apps/speech_to_text /data/raw/samples/evaluation --test_datasets apps/speech_to_text /data/raw/samples/testing
8585```
8686
8787This command will use the prepared datasets for the complete Whisper fine-tuning workflow.
0 commit comments