Skip to content

Commit 36b988d

Browse files
committed
add survey data
1 parent 8fca916 commit 36b988d

File tree

3 files changed

+5
-4
lines changed

3 files changed

+5
-4
lines changed

README.md

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -19,21 +19,22 @@ Researchers in the field may use and build upon the proposed approach for variab
1919

2020
### Project Structure
2121

22-
1. `data`: Contains scripts related to data handling. This includes downloading raw data and creating the training and test datasets:
22+
1. `data`: Contains scripts related to data handling. This includes downloading raw data and creating the training and test datasets (RQ1, RQ2 and RQ3):
2323
- `SelectedRepositories.csv`: CSV file containing the repositories that were selected based on the criteria mentioned in the manuscript.
2424
- `repos_download.py`: a script used to download repositories as zip files, extract their content, and copy all `.java` files in the specified directory.
2525
- `dataset_creation.py`: converts raw data generated by `repos_download.py` into CSV files for training and testing, respectively.
2626
- `training`: contains the training dataset stored in CSV files that we generated.
2727
- `testing`: contains the CSV file of the testing dataset that we generated.
28+
- `survey_data`: contains the Excel sheets of the raw survey responses and the aggregated version from which we generate the results of RQ3. The original survey can be accessed through this [link](https://forms.office.com/pages/responsepage.aspx?id=mRm4YH8LLUGSo-F9iunj4HbSH6eNn6hEr16DyJ7J0iVUMjMxV01ZNjZCME81NFUzVzhVUVZESE4yNS4u).
2829

29-
2. `training`: Contains scripts used to train the model using the method described in the manuscript:
30+
2. `training`: Contains scripts used to train the model using the method described in the manuscript (RQ1 and RQ2):
3031
- `variable_predictor.py`: script used to train the model.
3132

32-
3. `evaluation`: Contains scripts used to train the model using the method described in the manuscript:
33+
3. `evaluation`: Contains scripts used to train the model using the method described in the manuscript (RQ1 and RQ2):
3334
- `model_eval.py`: script used to evaluate the model.
3435
- `ChatGPT_identifiers.py`: script used to run experiments that involve GPT4 mentioned in the manuscript.
3536

36-
4. `survey_codes`: Contains the code snippets that were used in the survey described in the manuscript.
37+
4. `survey_codes`: Contains the code snippets that were used in the survey described in the manuscript (RQ3).
3738

3839
5. `figures`: Contains figures that are displayed in the README.
3940

205 KB
Binary file not shown.
27.9 KB
Binary file not shown.

0 commit comments

Comments
 (0)