README.md 8.8 KB

Fuzzing Dulwich

Fuzzing Status

This directory contains files related to Dulwich's suite of fuzz tests that are executed daily on automated infrastructure provided by OSS-Fuzz. This document aims to provide necessary information for working with fuzzing in Dulwich.

The latest details regarding OSS-Fuzz test status, including build logs and coverage reports, is available on the Open Source Fuzzing Introspection website.

How to Contribute

There are many ways to contribute to Dulwich's fuzzing efforts! Contributions are welcomed through issues, discussions, or pull requests on this repository.

Areas that are particularly appreciated include:

  • Tackling the existing backlog of open issues. While fuzzing is an effective way to identify bugs, that information isn't useful unless they are fixed. If you are not sure where to start, the issues tab is a great place to get ideas!
  • Improvements to this (or other) documentation make it easier for new contributors to get involved, so even small improvements can have a large impact over time. If you see something that could be made easier by a documentation update of any size, please consider suggesting it!

For everything else, such as expanding test coverage, optimizing test performance, or enhancing error detection capabilities, jump into the "Getting Started" section below.

Getting Started with Fuzzing Dulwich

[!TIP] New to fuzzing or unfamiliar with OSS-Fuzz?

These resources are an excellent place to start:

Setting Up Your Local Environment

Before contributing to fuzzing efforts, ensure Python and Docker are installed on your machine. Docker is required for running fuzzers in containers provided by OSS-Fuzz. Install Docker following the official guide if you do not already have it.

Understanding Existing Fuzz Targets

Review the fuzz-targets/ directory to familiarize yourself with how existing tests are implemented. See the Files & Directories Overview for more details on the directory structure.

Contributing to Fuzz Tests

Start by reviewing the Atheris documentation and the section on Running Fuzzers Locally to begin writing or improving fuzz tests.

Files & Directories Overview

The fuzzing/ directory is organized into three key areas:

Fuzz Targets (fuzz-targets/)

Contains Python files for each fuzz test.

Things to Know:

  • Each fuzz test targets a specific part of Dulwich's functionality.
  • Test files adhere to the naming convention: fuzz_<API Under Test>.py, where <API Under Test> indicates the functionality targeted by the test.
  • Any functionality that involves performing operations on input data is a possible candidate for fuzz testing, but features that involve processing untrusted user input or parsing operations are typically going to be the most interesting.
  • The goal of these tests is to identify previously unknown or unexpected error cases caused by a given input. For that reason, fuzz tests should gracefully handle anticipated exception cases with a try/except block to avoid false positives that halt the fuzzing engine.

Dictionaries (dictionaries/)

Provides hints to the fuzzing engine about inputs that might trigger unique code paths. Each fuzz target may have a corresponding .dict file. For information about dictionary syntax, refer to the LibFuzzer documentation on the subject.

Things to Know:

  • OSS-Fuzz loads dictionary files per fuzz target if one exists with the same name, all others are ignored.
  • Most entries in the dictionary files found here are escaped byte values that were recommended by the fuzzing engine after previous runs.
  • A default set of dictionary entries are created for all fuzz targets as part of the build process, regardless of an existing file here.
  • Development or updates to dictionaries should reflect the varied formats and edge cases relevant to the functionalities under test.
  • Example dictionaries (some of which are used to build the default dictionaries mentioned above) can be found here:

OSS-Fuzz Scripts (oss-fuzz-scripts/)

Includes scripts for building and integrating fuzz targets with OSS-Fuzz:

  • container-environment-bootstrap.sh - Sets up the execution environment. It is responsible for fetching default dictionary entries and ensuring all required build dependencies are installed and up-to-date.
  • build.sh - Executed within the Docker container, this script builds fuzz targets with necessary instrumentation and prepares seed corpora and dictionaries for use.

Where to learn more:

Running Fuzzers Locally

This approach uses Docker images provided by OSS-Fuzz for building and running fuzz tests locally. It offers comprehensive features but requires a local clone of the OSS-Fuzz repository and sufficient disk space for Docker containers.

Build the Execution Environment

Clone the OSS-Fuzz repository and prepare the Docker environment:

git clone --depth 1 https://github.com/google/oss-fuzz.git oss-fuzz
cd oss-fuzz
python infra/helper.py build_image dulwich
python infra/helper.py build_fuzzers --sanitizer address dulwich

[!TIP] The build_fuzzers command above accepts a local file path pointing to your Dulwich repository clone as the last argument. This makes it easy to build fuzz targets you are developing locally in this repository without changing anything in the OSS-Fuzz repo! For example, if you have cloned this repository (or a fork of it) into: ~/code/dulwich Then running this command would build new or modified fuzz targets using the ~/code/dulwich/fuzzing/fuzz-targets directory:

> python infra/helper.py build_fuzzers --sanitizer address dulwich ~/code/dulwich
> ```

Verify the build of your fuzzers with the optional `check_build` command:

shell python infra/helper.py check_build dulwich


### Run a Fuzz Target

Setting an environment variable for the fuzz target argument of the execution command makes it easier to quickly select
a different target between runs:

shell

specify the fuzz target without the .py extension:

export FUZZ_TARGET=fuzz_configfile


Execute the desired fuzz target:

shell python infra/helper.py run_fuzzer dulwich $FUZZ_TARGET -- -max_total_time=60 -print_final_stats=1 ```

[!TIP] In the example above, the "-- -max_total_time=60 -print_final_stats=1" portion of the command is optional but quite useful.

Every argument provided after "--" in the above command is passed to the fuzzing engine directly. In this case:

  • -max_total_time=60 tells the LibFuzzer to stop execution after 60 seconds have elapsed.
  • -print_final_stats=1 tells the LibFuzzer to print a summary of useful metrics about the target run upon completion.

But almost any LibFuzzer option listed in the documentation should work as well.

Next Steps

For detailed instructions on advanced features like reproducing OSS-Fuzz issues or using the Fuzz Introspector, refer to the official OSS-Fuzz documentation.