Project 1: Application SecuritySpring 2026

Part 1 due Thursday, January 22 at 11:59 p.m.
Part 2 due Thursday, February 5 at 11:59 p.m.

This project counts for 8% of your course grade. Late submissions will be penalized by 15% of the maximum attainable score. If you or your partner (if you’re working in a team) have a conflict due to travel, interviews, etc., please plan accordingly and turn in your project early.

This is optionally a group project; you may work in teams of two and submit one project per team. You may also work alone. Note that the exams will cover project material, so you and your partner should collaborate closely on each part.

You must remain with the same partner for all parts of a specific project. Switching partners or changing from individual to group (and vice versa) is not allowed.

The code and other answers you submit must be entirely your team’s own work, and you are bound by the Honor Code. You may discuss the conceptualization of the project and the meaning of the questions, but you may not look at any part of someone else’s solution or collaborate with anyone other than your partner. You may consult published references, provided that you appropriately cite them (e.g., with program comments).

Solutions must be submitted via the Autograder, following the submission details at the end of this spec.

Introduction

This project will introduce you to control-flow hijacking vulnerabilities in application software, including buffer overflows. We will provide a series of vulnerable programs and a virtual machine environment in which you will develop exploits.

Objectives

Be able to identify and avoid buffer overflow vulnerabilities in native code.
Understand the severity of buffer overflows and the necessity of standard defenses.
Gain familiarity with machine architecture and assembly language.
Understand the mechanics of buffer overflow exploitation.

Ethics

This project asks you to develop attacks and test them in a virtual machine you control. Attempting the same kinds of attacks against others’ systems without authorization is prohibited by law and university policies and may result in fines, expulsion, and jail time. You must not attack anyone else’s system without authorization! Per the course ethics policy, you are required to respect the privacy and property rights of others at all times, or else you will fail the course. See the “Course Policies” section on the course website.

Setup

Buffer-overflow exploitation depends on details of the target system. You must develop and test your attacks inside the Application Security Project VM, as it has been configured to disable certain security features that would complicate your work.

Follow the setup instructions on the Application Security Project VM Setup Guide page. It may take a few minutes for the VM to start, so you may not be able to ssh in immediately.
Create a new private repository from the Github template. Clone your repository from GitHub inside the VM. You must do this in a folder in the native Linux filesystem. It won’t work correctly if you use a shared folder located in the host OS.

Note, this repository is a different one from the one used in the lab. Do not try to use the same repository from Lab 1 for Project 1.

If you’re new to using Git/GitHub, check out this guide to help you get started.
Run ./build.sh. It will prompt you for GT email-ids and build the targets and the cookie file. Each group’s targets will be slightly different, so make sure your email-ids are correct! ./build.sh will only run once you or your team has signed up on the autograder. Please do not try to compile the code in any other way.
Run ./test.sh to test the (currently empty) solutions. The test script will report an error for each of the targets. You can ignore the errors as it is the first time you’re running it and your solutions are blank. ./test.sh is designed for preliminary checks and may not cover all possible test cases. Please submit your solutions to the autograder as you solve.

Guidelines

The target programs for this project are simple, short C programs with (mostly) clear security vulnerabilities. We have provided source code and a build script that compiles all the targets and creates your unique cookie file. Your solutions will be in the form of python scripts that output your exploits. They must work against the targets as compiled and executed within the provided VM.

No attack tools allowed!

Except where specifically noted, you may not use special-purpose tools meant for testing security or exploiting vulnerabilities. You must complete the project using only general purpose tools, such as gdb.

No external (non-standard) libraries allowed!

Your solutions can make use of standard Python 3 libraries and the provided shellcode.py, but they must be otherwise self-contained. Be sure to test that your solutions work correctly in an unmodified copy of the provided VM, without installing or updating any packages or changing any environment variables.

Resources

Primer Videos (Important!)

Before you begin the project, ensure that you watch the following videos. They contain many useful tips to help you along the project.

You can find the videos’ slides here.

You may also find the Wikipedia page on x86 Calling Convention quite useful.

Control hijacking tutorials

Review the slides from the control-hijacking lectures and complete the lab. Read “Smashing the Stack for Fun and Profit,” available here.

GDB

You will make use of the GDB debugger for dynamic analysis within the VM. Useful commands that you may not know are disassemble, info reg, x, and stepi. See the GDB help for details, and don’t be afraid to experiment. This quick reference may also be useful: https://users.ece.utexas.edu/~adnan/gdb-refcard.pdf

x86_64 assembly

These are many good references for Intel’s assembly language, but note that our project targets use the 64-bit x86_64 ISA (sometimes abbreviated to x64), not the older 32-bit x86 ISA. The stack is organized differently in x86_64 and x86. If you are reading any online documentation, ensure that it is based on the x86_64 architecture, not x86.

Also note that there are 2 different syntaxes for this assembly language, known as Intel syntax and AT&T syntax. They’re just 2 ways of expressing the same code. In this class we’re always using Intel syntax, but keep in mind that online resources might be using AT&T syntax. You can tell which is which because AT&T syntax uses percent signs (%) everywhere and Intel syntax doesn’t.

Part 1

The targets described below must be submitted as part of the first milestone.

target0: Overwriting a variable on the stack 7 pts

This program takes input from a file and prints a message. Your job is to provide input that causes the program to output: “Hi username! Your grade is A+.” (You can use any name for the username). To accomplish this, your input will need to overwrite another variable stored on the stack.

The read_input(char *destination, char *filename) function can be found in read_input.c. It opens the file given to it, then copies the entire contents of the file into memory starting at the address where destination points.

Here’s one approach you might take:

Examine target0.c. Where is the buffer overflow?
Set up the target in GDB:
```
$ gdb ./target0
```
Disassemble _main (not to be confused with main). What is its starting address?
Set a breakpoint at the beginning of _main and run the program.
Using GDB from within the VM, set a breakpoint at the beginning of _main and run the program.
```
(gdb) break _main
(gdb) run <(python3 sol0.py)
```

You may notice that target0.c seems to expect a file as an input. So why aren’t we passing one in? Well, <(python3 sol0.py) is a pipe, and it actually creates a temporary file with the output of python3 sol0.py and sends the name of this file to the target. Keep in mind this is how we pass input for all of the targets in this assignment.

Draw a picture of the stack. How are name[] and grade[] stored relative to each other?
How could a value read into name[] affect the value contained in grade[]? Test your hypothesis by running ./target0 on the command line with different inputs.

What to submit

Create a Python 3 program named sol0.py that prints a line to be passed as input to the target. Test your program with the command line:

$ ./target0 <(python3 sol0.py)

The above is a little shell trick. It runs python3 sol0.py, then saves the output to a “temporary file”. It then uses the name of that temporary file as a command-line argument for ./target0.

Hint: In Python 3, you should work with bytes rather than Unicode strings. To construct a byte literal, use this syntax: b"\xnn", where nn is a 2-digit hex value. To repeat a byte n times, you can do: b"\xnn"*n. To output a sequence of bytes, use:

import sys
sys.stdout.buffer.write(b"\x61\x62\x63")

Don’t use print(), because it automatically encodes whatever is being printed with the default encoding of the console. We don’t want our payload to be encoded, so we use sys.stdout.buffer.write().

target1: Overwriting the return address 7 pts

This program takes input from a file and prints a message. Your job is to provide input that makes it output: “Your grade is A+.” Your input will need to overwrite the return address so that the function vulnerable() transfers control to print_good_grade() when it returns.

Examine target1.c. Where is the buffer overflow?
Examine the function print_good_grade. What is its starting address?
Using GDB from within the VM, set a breakpoint at the beginning of vulnerable and run the program.
```
(gdb) break vulnerable
(gdb) run <(python3 sol1.py)
```
Disassemble vulnerable and draw the stack. Where is input[] stored relative to rbp? How long would an input have to be to overwrite this value and the return address?

When looking at the stack contents via gdb, you will notice that there are many null bytes added between rbp and where the first variable (input[]) is stored. This is generally done by the compiler to ensure that rsp is a multiple of 16 bytes.

Examine the rsp and rbp registers:
```
(gdb) info reg
```
What are the current values of the saved frame pointer and return address from the stack frame? You can examine two giant words (8 bytes each) of memory at rbp using:
```
(gdb) x/2gx $rbp
```
What should these values be in order to redirect control to the desired function?

What to submit

Create a Python 3 program named sol1.py that prints a line to be passed as input to the target. Test your program with the command line:

$ ./target1 <(python3 sol1.py)

When debugging your program, it may be helpful to view a hex dump of the output. Try this:

$ python3 sol1.py | hd

Remember that x86_64 is little endian. Use Python’s to_bytes method to output 64-bit little-endian values like so:

import sys
sys.stdout.buffer.write(0x0123456789abcedf.to_bytes(8, "little"))

target2: Redirecting control to shellcode 7 pts

Targets 2 through 7 are owned by the root user and have the suid bit set. Your goal is to cause them to launch a shell, which will therefore have root privileges. Unless otherwise noted, you should use the shellcode we have provided in shellcode.py. Successfully placing this shellcode in memory and setting the instruction pointer to the beginning of the shellcode (e.g., by returning or jumping to it) will open a shell.

Examine target2.c. Where is the buffer overflow?

Create a Python 3 program named sol2.py that outputs the provided shellcode:

from shellcode import shellcode
import sys
sys.stdout.buffer.write(shellcode)

Disassemble vulnerable. Where does buf begin relative to rbp? What is the offset from the start of the shellcode to the saved return address?
Set up the target in GDB:
```
$ gdb ./target2
```
Set a breakpoint in vulnerable and start the target.
Identify the address after the call to read_input and set a breakpoint there:
```
(gdb) break *<address>
```
Run the program. It will stop it reaches that breakpoint.
```
(gdb) run <(python3 sol2.py)
```
Examine the bytes of memory where you think the shellcode is to confirm your calculation:
```
(gdb) x/32bx 0x<address>
```
Disassemble the shellcode:
```
(gdb) disas/r 0x<address>,+32
```
How does it work?
Modify your solution to overwrite the return address and cause it to jump to the beginning of the shellcode.

What to submit

Create a Python 3 program named sol2.py that prints a line to be used as the command-line argument to the target. Test your program with the command line:

$ ./target2 <(python3 sol2.py)

If you are successful, you will see a root shell prompt (#). Running whoami will output “root”. Running exit will return to your normal shell.

If your program segfaults, you can examine the state at the time of the crash using GDB with the core dump: gdb ./target2 core. To enable creating core dumps, run ulimit -c unlimited. The file core won’t be created if a file with the same name already exists. Also, since the target runs as root, you will need to run it using sudo ./target2 in order for the core dump to be created.

target3: Overwriting the return address indirectly 16 pts

In this target, the buffer overflow is restricted and cannot directly overwrite the return address. You’ll need to find another way. Your input should cause the provided shellcode to execute and open a root shell.

The read_input_with_limit(char *destination, char *filename, size_t limit) function can be found in read_input.c. It opens the file given to it, then copies the contents of the file into memory starting at the address where destination points. However, it will only copy the first limit bytes, even if the file is longer than that.

What to submit

Create a Python 3 program named sol3.py that prints a line to be used as the command-line argument to the target. Test your program with the command line:

$ ./target3 <(python3 sol3.py)

Part 2

The targets described below must be submitted as part of the second milestone.

target4: Beyond strings 16 pts

This target takes as its input the name of a 64-bit count followed by that many 32-bit integers (all little endian).

Hint: First figure out how an attacker can cause a buffer overflow in this program. Note that the read_elements function breaks the for-loop once the end of the file is reached, so the 64-bit count does not need to be truthful. Is there anything that might happen if count is really large?

What to submit

Create a Python 3 program named sol4.py that outputs input in the format described above to be read by the target. Test your program with the command line:

$ ./target4 <(python3 sol4.py)

target5: Bypassing DEP 16 pts

This program resembles target2, but it has been compiled with data execution prevention (DEP) enabled. DEP means that the processor will refuse to execute instructions stored on the stack. You can overflow the stack and modify values like the return address, but you can’t jump to any shellcode you inject. You need to find another way to run the command /bin/sh and open a root shell.

Hint: You do not need to use shellcode.py for this target.

What to submit

Create a Python 3 program named sol5.py that prints a line to be used as the command-line argument to the target. Test your program with the command line:

$ ./target5 <(python3 sol5.py)

For this target, it’s acceptable if the program segfaults after the root shell is closed.

Warning: Do not try to create a solution that depends on you manually setting environment variables. You cannot assume that the autograder will run your solution with the same environment variables that you have set.

target6: Variable stack position 16 pts

When we constructed the previous targets, we ensured that the stack would be in the same position every time the vulnerable function was called, but this is often not the case in real targets. In fact, a defense called ASLR (address-space layout randomization) makes buffer overflows harder to exploit by changing the starting location of the stack and other memory areas on each execution. This target resembles target2, but the stack position is randomly offset by 0–256 bytes each time it runs. You need to construct an input that always opens a root shell despite this randomization.

What to submit

Create a Python 3 program named sol6.py that prints a line to be used as the command-line argument to the target. Test your program with the command line:

$ ./target6 <(python3 sol6.py)

Warning: If you see any output before the root shell is opened, you have not done this target correctly and your solution will not be accepted by the autograder.

target7: Return-oriented programming 15 pts

This target is identical to target2, but it is compiled with DEP enabled. Implement a ROP-based attack to bypass DEP and open a root shell.

It will be helpful to use a tool such as ROPgadget. The ROPgadget command is already installed on the provided VM. View its usage by running ROPgadget -h. The --binary, and --multibr flags will be particularly helpful. (You are also free to use the --ropchain flag, but it often produces relatively unintelligible results. If you’re having trouble interpreting and extending its output, we’d recommend against using it. We aren’t able to vouch for the tool’s decisions when using --ropchain.)

Though there are a number of ways you could implement a ROP exploit, for this target you should use the setuid syscall to become root, followed by the execve syscall to run the /bin/sh binary. This is equivalent to:
```
setuid(0);
execve("/bin/sh", 0, 0);
```
syscall is the assembly instruction for making a syscall. In order specify which kind of syscall you want to make (and what arguments you want to make it with), you have to set certain registers to specific values. Consult this table for a reference.
You may want to look at the assembly in shellcode.py for some inspiration. You probably won’t be able to find gadgets that work exactly the same way as this shellcode, but it should be a good starting point.
We recommend that you start by getting the execve call to work on its own, without setuid. When you do this correctly, it will open a shell, but you won’t be root. Then modify your solution to make it call setuid first, and you’ll get a root shell.

What to submit

Create a Python 3 program named sol7.py that prints a line to be used as the command-line argument to the target. Test your program with the command line:

$ ./target7 <(python3 sol7.py)

Troubleshooting

Help, I’m getting a segfault!

A segfault most likely means that you’re either jumping execution to or dereferencing an address that is incorrect. This means you’re on the right track because you’ve overwritten something! If you are stuck as to where to start looking, check the addresses that your exploit has, and make sure they are both correct and in the correct place.

I get a root shell when I run `sudo ./test.sh`. Am I done?

No! You should only run test.sh without sudo. If you run it under sudo, then your shells will always be spawned as the root user, whether you have accomplished the task of opening a root shell or not. If you previously ran test.sh with sudo, you might get permission errors from Git when you run the test script without sudo. To fix this, run the following in the root directory of your local repository:

$ sudo chown -R cs3235:cs3235 .git

My solution works in GDB but not from the command line.

The most likely explanation is that you’re referencing data from argv[]. Since argv[] comes from outside of _main‘s stack frame, its position can vary depending on the size of the environment and arguments, which can be slightly different when running under gdb. The best solution is to find the data you need in the stack frame of the vulnerable function, rather than from argv[].

The files I want to submit are on my VM. How do I upload them to the autograder?

If the files that you want to submit are in the VM and you’re not sure how to upload it to the autograder, follow these steps.

Submission Details

Visit the project on the autograder for either part 1 or 2, and optionally create a team.

Project 1 (Part 1) Autograder

Project 1 (Part 2) Autograder
Ensure the following items are completed and submitted by the deadline (cookie is generated by ./build.sh, please do not change it):

Part 1 (Due Thursday, January 22, 11:59 PM)

`cookie`	The cookie generated by ./build.sh
`sol0.py`	Overwriting a variable on the stack 7 pts
`sol1.py`	Overwriting the return address 7 pts
`sol2.py`	Redirecting control to shellcode 7 pts
`sol3.py`	Overwriting the return address indirectly 16 pts

Part 2 (Due Thursday, February 5, 11:59 PM)

`cookie`	The cookie generated by ./build.sh
`sol4.py`	Beyond strings 16 pts
`sol5.py`	Bypassing DEP 16 pts
`sol6.py`	Variable stack position 16 pts
`sol7.py`	Return-oriented programming 15 pts

You can submit solutions up to 20 times per day. If this limit is exceeded, you will have to wait till 12:00 AM the next day to submit again.

Project 1: Application SecuritySpring 2026

Introduction

Objectives

Ethics

Setup

Guidelines

No attack tools allowed!

No external (non-standard) libraries allowed!

Resources

Primer Videos (Important!)

Control hijacking tutorials

GDB

x86_64 assembly

Part 1

target0: Overwriting a variable on the stack 7 pts

What to submit

target1: Overwriting the return address 7 pts

What to submit

target2: Redirecting control to shellcode 7 pts

What to submit

target3: Overwriting the return address indirectly 16 pts

What to submit

Part 2

target4: Beyond strings 16 pts

What to submit

target5: Bypassing DEP 16 pts

What to submit

target6: Variable stack position 16 pts

What to submit

target7: Return-oriented programming 15 pts

What to submit

Troubleshooting

Help, I’m getting a segfault!

I get a root shell when I run sudo ./test.sh. Am I done?

My solution works in GDB but not from the command line.

The files I want to submit are on my VM. How do I upload them to the autograder?

Submission Details

Part 1 (Due Thursday, January 22, 11:59 PM)

Part 2 (Due Thursday, February 5, 11:59 PM)

I get a root shell when I run `sudo ./test.sh`. Am I done?