You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+5-4Lines changed: 5 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -19,21 +19,22 @@ Researchers in the field may use and build upon the proposed approach for variab
19
19
20
20
### Project Structure
21
21
22
-
1.`data`: Contains scripts related to data handling. This includes downloading raw data and creating the training and test datasets:
22
+
1.`data`: Contains scripts related to data handling. This includes downloading raw data and creating the training and test datasets (RQ1, RQ2 and RQ3):
23
23
-`SelectedRepositories.csv`: CSV file containing the repositories that were selected based on the criteria mentioned in the manuscript.
24
24
-`repos_download.py`: a script used to download repositories as zip files, extract their content, and copy all `.java` files in the specified directory.
25
25
-`dataset_creation.py`: converts raw data generated by `repos_download.py` into CSV files for training and testing, respectively.
26
26
-`training`: contains the training dataset stored in CSV files that we generated.
27
27
-`testing`: contains the CSV file of the testing dataset that we generated.
28
+
-`survey_data`: contains the Excel sheets of the raw survey responses and the aggregated version from which we generate the results of RQ3. The original survey can be accessed through this [link](https://forms.office.com/pages/responsepage.aspx?id=mRm4YH8LLUGSo-F9iunj4HbSH6eNn6hEr16DyJ7J0iVUMjMxV01ZNjZCME81NFUzVzhVUVZESE4yNS4u).
28
29
29
-
2.`training`: Contains scripts used to train the model using the method described in the manuscript:
30
+
2.`training`: Contains scripts used to train the model using the method described in the manuscript (RQ1 and RQ2):
30
31
-`variable_predictor.py`: script used to train the model.
31
32
32
-
3.`evaluation`: Contains scripts used to train the model using the method described in the manuscript:
33
+
3.`evaluation`: Contains scripts used to train the model using the method described in the manuscript (RQ1 and RQ2):
33
34
-`model_eval.py`: script used to evaluate the model.
34
35
-`ChatGPT_identifiers.py`: script used to run experiments that involve GPT4 mentioned in the manuscript.
35
36
36
-
4.`survey_codes`: Contains the code snippets that were used in the survey described in the manuscript.
37
+
4.`survey_codes`: Contains the code snippets that were used in the survey described in the manuscript (RQ3).
37
38
38
39
5.`figures`: Contains figures that are displayed in the README.
0 commit comments