Empirical Assessment of the Effort Needed to Attack Programs Protected with Obfuscation

This repository contains the questionnaires, their results, and our analysis on an experiment we did to empirically prove that software obfuscations slow down the reverse engineering of attackers.

We distributed the C source code of different applications obfuscated with Tigress (using control flow flattening and opaque predicates) to 152 students in the second year of the MSc in Computer Engineering at the Politecnico di Torino, one of the most renowned technical universities in Italy. In each of these applications we injected a simple but prominent bug and asked the students to fix during a hackathon of sort. We timed the students and asked them to fill several questionnaires describing their experience.

Our full discussion about the experiment procedure and its results analysis can be found in our paper Empirical Assessment of the Effort Needed to Attack Programs Protected with Obfuscation, downloadable from DOI when available.

Execution

The experiment was conducted in two consecutive steps: a remote stage, where the students were tasked to solve some programming tasks at home, and a live stage, where the students were gathered into a controlled environment ( classrooms) and tried to crack some obfuscated applications.

Remote stage

In this stage, the students were contacted primarily via email and asked to fill some Google Forms to assess their programming experience and knowledge of the C language.

In particular, we followed the next work-flow:

We asked the students if they wanted to participate using the questionary reported in 1-participation.md.
We gathered all the positive answers and asked the participating students to analyze the code check.c and fill the questionnaire 2-check.md. This exercise required the student to compile and debug an application to extract a hidden value contained into an obfuscated variable. The goal of this step was to assess the ability of the students to effectively analyze a process dynamically.
We also sent the students the questionnaire 3-preliminary.md to test their expertise and ability on the C programming language.

Live stage

During this step, the students were gathered into classrooms and tasked to do the following:

First, we asked the students to fill the questionnaire 4-live.md to prove their C programming language skills, and comparing these under stress results with the answers we got from the remote stage.
We asked the students to crack three differently obfuscated applications (arithmetic, number, and tictactoe) into three consecutive rounds with a timeout. The distributed application source codes are available at 5-task-1-code, 5-task-2-code, and 5-task-3-code. After a timeout, each student had to fill the questionnaire 5-task.md, which asked to describe how they cracked the application, and to upload the modified source code and a screenshot of the fixed application. To help the students in fill the form, we sent them also also 5-attack-steps.pdf, which contains a list of attack steps to be used in the questionnaire.

Responses

We used Google Form to both distribute and collect the questionnaire results. We provide here, in the folder responses, all the student answers in CSV format and also the submitted source code.

Questionnaire answers

To keep the privacy of the students intact, we have removed all the personal information (e.g., names, email addresses, university identification numbers) and replaced them with an anonymous identifier named id in the CSV tables.

In particular, these are the answers to the questionnaires:

We omit the answers to 1-participation.md for privacy reasons.
We do not include the responses to 2-check.md either since all the students correctly reported the hidden value.
The answers to 3-preliminary.md are available at 3-preliminary.csv
The responses to 4-live.md are instead downloadable from 4-live.csv.
5-task-1.csv, 5-task-2.csv, and 5-task-3.csv respectively contains the results of the questionnaire 5-task.md for the tasks 1, 2, and 3 executed in the classrooms.

Submitted source code

The folders 5-task-1-code, 5-task-2-code, and 5-task-3-code contains the submitted modified source code per each task.

The file names in each folder follow this schema <application>-<id>.c where <application> is the cracked application name (arithmetic, number, or tictactoe) and <id> is the anonymous identifier of the student.

Note that some students sent files that contains syntax errors, so not all of them are compilable.

Analysis

The analysis folder contains the files we used to perform our analysis.

Code metrics

The file code-metrics.csv contains several complexity metrics of the obfuscated applications distributed to the students. We used Frama-C to compute these values.

Cleaned responses

We took the raw student CSV responses, available at responses, and cleaned them up by removing some unnecessary columns. In particular:

analysis/3-preliminary.csv is the cleaned version of responses/3-preliminary.csv.
analysis/4-live.csv is the cleaned version of responses/4-live.csv.
tasks.csv contains the aggregated and cleaned data related to the three task questionnaires responses.

Responses analysis

The file analysis.rmd is our R MarkDown file used to compute the statistics explained in our paper.

Miscellaneous

This section contains some additional miscellaneous information.

Obfuscating `check.c`

The original vanilla code for the home test exercise check.c is available at check.c.

To generate the obfuscated version distributed to the students, we used the following procedure:

We obfuscated the vanilla code with Tigress by using the following options:

--Transform=EncodeLiterals --EncodeLiteralsKinds=string --EncodeLiteralsEncoderName=encoder --Functions=* --Transform=EncodeData --GlobalVariables='spyme' --Transform=EncodeArithmetic --Functions=* --Transform=Split --Functions=check
We complied checksum.c and executed it to compute the source code checksum of the obfuscated check.c.
We updated the original vanilla source code with the obtained checksum.
We executed Tigress again with the same metrics.

We decided to install a checksum computation in the application to avoid the students manipulating its source code.

The vanilla task applications

The original vanilla source code of our three task applications are:

arithmetic. and number.c were taken from the bsdgames in the Debian repositories, while tictactoe is an exercise from the book C: The Complete Reference, 4th Ed. (Paperback) by Herbert Schildt.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
analysis		analysis
execution		execution
originals		originals
responses		responses
LICENSE		LICENSE
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Empirical Assessment of the Effort Needed to Attack Programs Protected with Obfuscation

Execution

Remote stage

Live stage

Responses

Questionnaire answers

Submitted source code

Analysis

Code metrics

Cleaned responses

Responses analysis

Miscellaneous

Obfuscating `check.c`

The vanilla task applications

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Empirical Assessment of the Effort Needed to Attack Programs Protected with Obfuscation

Execution

Remote stage

Live stage

Responses

Questionnaire answers

Submitted source code

Analysis

Code metrics

Cleaned responses

Responses analysis

Miscellaneous

Obfuscating check.c

The vanilla task applications

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Obfuscating `check.c`

Packages