oss-fuzz/docs/advanced-topics/ideal_integration.md

---
layout: default
title: Ideal integration
parent: Advanced topics
nav_order: 1
permalink: /advanced-topics/ideal-integration
---

# Ideal integration with OSS-Fuzz 
OSS projects have different build and test systems. So, we can not expect them
to have a unified way of implementing and maintaining fuzz targets and
integrating them with OSS-Fuzz. However, we will still try to give
recommendations on the preferred ways.

Here are several features (starting from the easiest) that will make automated
fuzzing simple and efficient, and will allow to catch regressions early on in
the development cycle. 

- TOC
{:toc}
---

## TL;DR
Every [fuzz target](http://libfuzzer.info/#fuzz-target):
* Is [maintained by code owners](#fuzz-target) in their RCS (Git, SVN, etc).
* Is [built with the rest of the tests](#build-support) - no bit rot! 
* Has a [seed corpus](#seed-corpus) with good [code coverage](#coverage).
* Has a [dictionary](#dictionary), if applicable.
* Is [continuously tested on the seed corpus](#regression-testing) with
  [ASan/UBSan/MSan](https://github.com/google/sanitizers).
* Is [fast and has no OOMs](#performance).

## Fuzz Target
The code of the [fuzz target(s)](http://libfuzzer.info/#fuzz-target) should be
part of the project's source code repository. 
All fuzz targets should be easily discoverable (e.g. reside in the same
directory, or follow the same naming pattern, etc). 

This makes it easy to maintain the fuzzers and minimizes breakages that can
arise as source code changes over time.

Make sure to fuzz the target locally for a small period of time to ensure that 
it does not crash, hang, or run out of memory instantly.
You can read more about what makes a good fuzz target [here]
(https://github.com/google/fuzzing/blob/master/docs/good-fuzz-target.md)

The interface between the [fuzz target]((http://libfuzzer.info/#fuzz-target))
and the fuzzing engines is C, so you may use C or C++ to implement the fuzz target.

Examples: 
[boringssl](https://github.com/google/boringssl/tree/master/fuzz),
[SQLite](https://www.sqlite.org/src/artifact/ad79e867fb504338),
[s2n](https://github.com/awslabs/s2n/tree/master/tests/fuzz),
[openssl](https://github.com/openssl/openssl/tree/master/fuzz),
[FreeType](http://git.savannah.gnu.org/cgit/freetype/freetype2.git/tree/src/tools/ftfuzzer),
[re2](https://github.com/google/re2/tree/master/re2/fuzzing),
[harfbuzz](https://github.com/behdad/harfbuzz/tree/master/test/fuzzing),
[pcre2](https://vcs.pcre.org/pcre2/code/trunk/src/pcre2_fuzzsupport.c?view=markup),
[ffmpeg](https://github.com/FFmpeg/FFmpeg/blob/master/tools/target_dec_fuzzer.c).

## Build support
A plethora of different build systems exist in the open-source world.
And the less OSS-Fuzz knows about them, the better it can scale.

An ideal build integration for OSS-Fuzz would look like this:
* For every fuzz target `foo` in the project, there is a build rule that
builds `foo_fuzzer`, a binary that contains the fuzzing entry point
(`LLVMFuzzerTestOneInput`) and all the code it depends on, and that uses the
`main()` function from `$LIB_FUZZING_ENGINE`
(env var [provided]({{ site.baseurl }}/getting-started/new-project-guide/) by OSS-Fuzz environment).
* The build system supports changing the compiler and passing extra compiler
flags so that the build command for a `foo_fuzzer` looks similar to this:

```bash
# Assume the following env vars are set:
# CC, CXX, CFLAGS, CXXFLAGS, LIB_FUZZING_ENGINE
$ make_or_whatever_other_command foo_fuzzer
```

This will allow to have minimal OSS-Fuzz-specific configuration and thus be
more robust.

There is no point in hardcoding the exact compiler flags in the build system
because they a) may change and b) are different depending on the fuzzing engine
and the sanitizer being used.

## Seed Corpus
The *corpus* is a set of inputs for the fuzz target (stored as individual files). 
When starting the fuzzing process, one should have a "seed corpus", 
i.e. a set of inputs to "seed" the mutations.
The quality of the seed corpus has a huge impact on fuzzing efficiency as it
allows the fuzzer to discover new code paths more easily.

The ideal corpus is a minimal set of inputs that provides maximal code coverage. 

For better OSS-Fuzz integration, 
the seed corpus should be available in revision control (can be same or
different as the source code). It should be regularly extended with the inputs
that (used to) trigger bugs and/or touch new parts of the code. 

Examples: 
[boringssl](https://github.com/google/boringssl/tree/master/fuzz),
[openssl](https://github.com/openssl/openssl/tree/master/fuzz),
[nss](https://github.com/mozilla/nss-fuzzing-corpus) (corpus in a separate repo).

## Dictionary
For some input types, a simple dictionary of tokens used by the input language
can have a dramatic positive effect on fuzzing efficiency. 
For example, when fuzzing an XML parser, a dictionary of XML tokens will help.
AFL has a [collection](https://github.com/rc0r/afl-fuzz/tree/master/dictionaries)
of such dictionaries for some of the popular data formats.
Ideally, a dictionary should be maintained alongside the fuzz target.
The syntax is described [here](http://libfuzzer.info/#dictionaries).

## Coverage
For a fuzz target to be useful, it must have good coverage in the code that it
is testing. You can view the coverage for your fuzz targets by looking at the
[fuzzer stats]({{ site.baseurl }}/further-reading/clusterfuzz#fuzzer-stats)
dashboard on ClusterFuzz, as well as
[coverage reports]({{ site.baseurl }}/further-reading/clusterfuzz#coverage-reports).

To generate an aggregated code coverage report for your project, please see
[code coverage]({{ site.baseurl }}/advanced-topics/code-coverage)
documentation page.

Coverage can often be improved by adding dictionaries, more inputs for seed
corpora, and fixing timeouts/out-of-memory bugs in your targets.

## Regression Testing
The fuzz targets should be regularly tested (not necessarily fuzzed!) as a part
of the project's regression testing process.
One way to do so is to link the fuzz target with a simple standalone driver
(e.g. [this one](https://github.com/llvm-mirror/compiler-rt/tree/master/lib/fuzzer/standalone))
that runs the provided inputs and use this driver with the seed corpus created
in previous step. It is recommended to use
[sanitizers](https://github.com/google/sanitizers) during regression testing.

Examples: [SQLite](https://www.sqlite.org/src/artifact/d9f1a6f43e7bab45),
[openssl](https://github.com/openssl/openssl/blob/master/fuzz/test-corpus.c)

## Performance
Fuzz targets should also be performant, as high memory usage and/or slow
execution speed can slow the down the growth of coverage and finding of new
bugs. ClusterFuzz provides a
[performance analyzer]({{ site.baseurl }}/further-reading/clusterfuzz/#performance-analyzer)
for each fuzz target that shows problems that are impacting the performance of
the fuzz target.

## Example

You may look at a simple
[example](https://github.com/google/oss-fuzz/tree/master/projects/example/my-api-repo)
that covers most of the items above. 

## Not a project member?

If you are a member of the project you want to fuzz, most of the steps above are simple.
However in some cases, someone outside the project team may want to fuzz the code
and the project maintainers are not interested in helping.

In such cases, we can host the fuzz targets, dictionaries, etc in OSS-Fuzz's 
repository and mention them in the Dockerfile.
Examples: [libxml2](https://github.com/google/oss-fuzz/tree/master/projects/libxml2),
[c-ares](https://github.com/google/oss-fuzz/tree/master/projects/c-ares), [expat](https://github.com/google/oss-fuzz/tree/master/projects/expat).
This is far from ideal because the fuzz targets will not be continuously tested 
and hence may quickly bitrot.

If you are not a project maintainer, we may not be able to CC you to security
bugs found by OSS-Fuzz.
Switch docs to new structure (#2663) 2019-08-07 14:37:16 +00:00			`---`
			`layout: default`
			`title: Ideal integration`
			`parent: Advanced topics`
			`nav_order: 1`
			`permalink: /advanced-topics/ideal-integration`
			`---`

Create ideal_integration.md 2016-11-15 18:04:07 +00:00			`# Ideal integration with OSS-Fuzz`
Update ideal_integration.md 2016-11-16 16:59:35 +00:00			`OSS projects have different build and test systems. So, we can not expect them`
Switch docs to new structure (#2663) 2019-08-07 14:37:16 +00:00			`to have a unified way of implementing and maintaining fuzz targets and`
			`integrating them with OSS-Fuzz. However, we will still try to give`
			`recommendations on the preferred ways.`

			`Here are several features (starting from the easiest) that will make automated`
			`fuzzing simple and efficient, and will allow to catch regressions early on in`
			`the development cycle.`
Create ideal_integration.md 2016-11-15 18:04:07 +00:00
Switch docs to new structure (#2663) 2019-08-07 14:37:16 +00:00			`- TOC`
			`{:toc}`
			`---`
Update ideal_integration.md 2016-11-16 06:07:18 +00:00
Update ideal_integration.md (#759) 2017-08-04 18:15:52 +00:00			`## TL;DR`
			`Every [fuzz target](http://libfuzzer.info/#fuzz-target):`
			`* Is [maintained by code owners](#fuzz-target) in their RCS (Git, SVN, etc).`
			`* Is [built with the rest of the tests](#build-support) - no bit rot!`
Update ideal_integration.md 2017-08-04 18:16:18 +00:00			`* Has a [seed corpus](#seed-corpus) with good [code coverage](#coverage).`
Documentation fixes. (#2705) 2019-08-15 22:07:23 +00:00			`* Has a [dictionary](#dictionary), if applicable.`
Switch docs to new structure (#2663) 2019-08-07 14:37:16 +00:00			`* Is [continuously tested on the seed corpus](#regression-testing) with`
			`[ASan/UBSan/MSan](https://github.com/google/sanitizers).`
			`* Is [fast and has no OOMs](#performance).`
Update ideal_integration.md (#759) 2017-08-04 18:15:52 +00:00
Update ideal_integration.md 2016-11-18 23:04:06 +00:00			`## Fuzz Target`
Switch docs to new structure (#2663) 2019-08-07 14:37:16 +00:00			`The code of the [fuzz target(s)](http://libfuzzer.info/#fuzz-target) should be`
			`part of the project's source code repository.`
			`All fuzz targets should be easily discoverable (e.g. reside in the same`
			`directory, or follow the same naming pattern, etc).`
Create ideal_integration.md 2016-11-15 18:04:07 +00:00
Switch docs to new structure (#2663) 2019-08-07 14:37:16 +00:00			`This makes it easy to maintain the fuzzers and minimizes breakages that can`
			`arise as source code changes over time.`
Update ideal_integration.md 2016-11-18 23:25:28 +00:00
			`Make sure to fuzz the target locally for a small period of time to ensure that`
Fix typos and clarify grammar and word choice throughout the OSS-Fuzz docs. (#363) 2017-02-08 03:15:53 +00:00			`it does not crash, hang, or run out of memory instantly.`
Documentation fixes. (#2705) 2019-08-15 22:07:23 +00:00			`You can read more about what makes a good fuzz target [here]`
			`(https://github.com/google/fuzzing/blob/master/docs/good-fuzz-target.md)`
Update ideal_integration.md 2016-11-16 16:59:35 +00:00
Update ideal_integration.md 2016-12-13 17:33:59 +00:00			`The interface between the [fuzz target]((http://libfuzzer.info/#fuzz-target))`
			`and the fuzzing engines is C, so you may use C or C++ to implement the fuzz target.`

Create ideal_integration.md 2016-11-15 18:04:07 +00:00			`Examples:`
			`[boringssl](https://github.com/google/boringssl/tree/master/fuzz),`
			`[SQLite](https://www.sqlite.org/src/artifact/ad79e867fb504338),`
			`[s2n](https://github.com/awslabs/s2n/tree/master/tests/fuzz),`
			`[openssl](https://github.com/openssl/openssl/tree/master/fuzz),`
			`[FreeType](http://git.savannah.gnu.org/cgit/freetype/freetype2.git/tree/src/tools/ftfuzzer),`
Update ideal_integration.md 2016-11-15 19:56:11 +00:00			`[re2](https://github.com/google/re2/tree/master/re2/fuzzing),`
Create ideal_integration.md 2016-11-15 18:04:07 +00:00			`[harfbuzz](https://github.com/behdad/harfbuzz/tree/master/test/fuzzing),`
[docs] fix pcre2 link (#2153) pcre2's web server has a redirect to HTTPS that introduces a duplicate view parameter, resulting in the eventual URL of https://vcs.pcre.org/pcre2/code/trunk/src/pcre2_fuzzsupport.c?view=markup?view=markup which fails to load. This seems like an error in their web server config, but using an HTTPS URL to begin with works around this issue. 2019-02-09 01:21:54 +00:00			`[pcre2](https://vcs.pcre.org/pcre2/code/trunk/src/pcre2_fuzzsupport.c?view=markup),`
Update ideal_integration.md 2017-01-31 15:42:12 +00:00			`[ffmpeg](https://github.com/FFmpeg/FFmpeg/blob/master/tools/target_dec_fuzzer.c).`
Create ideal_integration.md 2016-11-15 18:04:07 +00:00
Update ideal_integration.md 2017-04-24 20:25:22 +00:00			`## Build support`
			`A plethora of different build systems exist in the open-source world.`
			`And the less OSS-Fuzz knows about them, the better it can scale.`

			`An ideal build integration for OSS-Fuzz would look like this:`
Switch docs to new structure (#2663) 2019-08-07 14:37:16 +00:00			* For every fuzz target `foo` in the project, there is a build rule that
			builds `foo_fuzzer`, a binary that contains the fuzzing entry point
			(`LLVMFuzzerTestOneInput`) and all the code it depends on, and that uses the
			`main()` function from `$LIB_FUZZING_ENGINE`
			`(env var [provided]({{ site.baseurl }}/getting-started/new-project-guide/) by OSS-Fuzz environment).`
Update ideal_integration.md 2017-04-24 20:25:22 +00:00			`* The build system supports changing the compiler and passing extra compiler`
			flags so that the build command for a `foo_fuzzer` looks similar to this:

			```bash
			`# Assume the following env vars are set:`
			`# CC, CXX, CFLAGS, CXXFLAGS, LIB_FUZZING_ENGINE`
			`$ make_or_whatever_other_command foo_fuzzer`
			```

Switch docs to new structure (#2663) 2019-08-07 14:37:16 +00:00			`This will allow to have minimal OSS-Fuzz-specific configuration and thus be`
			`more robust.`
Update ideal_integration.md 2017-04-24 20:25:22 +00:00
Switch docs to new structure (#2663) 2019-08-07 14:37:16 +00:00			`There is no point in hardcoding the exact compiler flags in the build system`
			`because they a) may change and b) are different depending on the fuzzing engine`
			`and the sanitizer being used.`
Create ideal_integration.md 2016-11-15 18:04:07 +00:00
Update ideal_integration.md 2016-11-18 23:04:06 +00:00			`## Seed Corpus`
Update ideal_integration.md 2016-11-17 04:20:49 +00:00			`The corpus is a set of inputs for the fuzz target (stored as individual files).`
			`When starting the fuzzing process, one should have a "seed corpus",`
			`i.e. a set of inputs to "seed" the mutations.`
Switch docs to new structure (#2663) 2019-08-07 14:37:16 +00:00			`The quality of the seed corpus has a huge impact on fuzzing efficiency as it`
			`allows the fuzzer to discover new code paths more easily.`
Create ideal_integration.md 2016-11-15 18:04:07 +00:00
Fix typo in ideal_integration.md (#165) 2016-12-11 06:22:01 +00:00			`The ideal corpus is a minimal set of inputs that provides maximal code coverage.`
Update ideal_integration.md 2016-11-17 04:20:49 +00:00
Update ideal_integration.md 2016-11-23 18:27:29 +00:00			`For better OSS-Fuzz integration,`
Switch docs to new structure (#2663) 2019-08-07 14:37:16 +00:00			`the seed corpus should be available in revision control (can be same or`
			`different as the source code). It should be regularly extended with the inputs`
			`that (used to) trigger bugs and/or touch new parts of the code.`
Update ideal_integration.md 2016-11-16 06:10:56 +00:00
Create ideal_integration.md 2016-11-15 18:04:07 +00:00			`Examples:`
			`[boringssl](https://github.com/google/boringssl/tree/master/fuzz),`
			`[openssl](https://github.com/openssl/openssl/tree/master/fuzz),`
Update ideal_integration.md 2016-11-29 19:19:32 +00:00			`[nss](https://github.com/mozilla/nss-fuzzing-corpus) (corpus in a separate repo).`
Create ideal_integration.md 2016-11-15 18:04:07 +00:00
Documentation fixes. (#2705) 2019-08-15 22:07:23 +00:00			`## Dictionary`
Update ideal_integration.md 2016-11-23 18:27:29 +00:00			`For some input types, a simple dictionary of tokens used by the input language`
			`can have a dramatic positive effect on fuzzing efficiency.`
Update ideal_integration.md 2016-11-18 23:04:06 +00:00			`For example, when fuzzing an XML parser, a dictionary of XML tokens will help.`
			`AFL has a [collection](https://github.com/rc0r/afl-fuzz/tree/master/dictionaries)`
			`of such dictionaries for some of the popular data formats.`
			`Ideally, a dictionary should be maintained alongside the fuzz target.`
			`The syntax is described [here](http://libfuzzer.info/#dictionaries).`

Update ideal_integration.md 2017-04-25 02:20:38 +00:00			`## Coverage`
Switch docs to new structure (#2663) 2019-08-07 14:37:16 +00:00			`For a fuzz target to be useful, it must have good coverage in the code that it`
			`is testing. You can view the coverage for your fuzz targets by looking at the`
[docs] Replace "furthur" with "further" throughout the documentation. (#2737) 2019-08-21 22:10:15 +00:00			`[fuzzer stats]({{ site.baseurl }}/further-reading/clusterfuzz#fuzzer-stats)`
Switch docs to new structure (#2663) 2019-08-07 14:37:16 +00:00			`dashboard on ClusterFuzz, as well as`
[docs] Replace "furthur" with "further" throughout the documentation. (#2737) 2019-08-21 22:10:15 +00:00			`[coverage reports]({{ site.baseurl }}/further-reading/clusterfuzz#coverage-reports).`
Update ideal_integration.md 2017-04-25 02:20:38 +00:00
[docs] Link code_coverage.md page from the ideal_integration.md. (#1621) 2018-07-17 19:34:25 +00:00			`To generate an aggregated code coverage report for your project, please see`
Switch docs to new structure (#2663) 2019-08-07 14:37:16 +00:00			`[code coverage]({{ site.baseurl }}/advanced-topics/code-coverage)`
[docs] Link code_coverage.md page from the ideal_integration.md. (#1621) 2018-07-17 19:34:25 +00:00			`documentation page.`

Switch docs to new structure (#2663) 2019-08-07 14:37:16 +00:00			`Coverage can often be improved by adding dictionaries, more inputs for seed`
			`corpora, and fixing timeouts/out-of-memory bugs in your targets.`
Update ideal_integration.md 2017-04-25 02:20:38 +00:00
Documentation fixes. (#2705) 2019-08-15 22:07:23 +00:00			`## Regression Testing`
			`The fuzz targets should be regularly tested (not necessarily fuzzed!) as a part`
			`of the project's regression testing process.`
			`One way to do so is to link the fuzz target with a simple standalone driver`
			`(e.g. [this one](https://github.com/llvm-mirror/compiler-rt/tree/master/lib/fuzzer/standalone))`
			`that runs the provided inputs and use this driver with the seed corpus created`
			`in previous step. It is recommended to use`
			`[sanitizers](https://github.com/google/sanitizers) during regression testing.`

			`Examples: [SQLite](https://www.sqlite.org/src/artifact/d9f1a6f43e7bab45),`
			`[openssl](https://github.com/openssl/openssl/blob/master/fuzz/test-corpus.c)`

Update ideal_integration.md 2017-04-25 02:24:00 +00:00			`## Performance`
Switch docs to new structure (#2663) 2019-08-07 14:37:16 +00:00			`Fuzz targets should also be performant, as high memory usage and/or slow`
			`execution speed can slow the down the growth of coverage and finding of new`
			`bugs. ClusterFuzz provides a`
[docs] Replace "furthur" with "further" throughout the documentation. (#2737) 2019-08-21 22:10:15 +00:00			`[performance analyzer]({{ site.baseurl }}/further-reading/clusterfuzz/#performance-analyzer)`
Switch docs to new structure (#2663) 2019-08-07 14:37:16 +00:00			`for each fuzz target that shows problems that are impacting the performance of`
			`the fuzz target.`
Update ideal_integration.md 2016-11-17 04:26:11 +00:00
Update ideal_integration.md 2017-05-15 22:17:40 +00:00			`## Example`
Switch docs to new structure (#2663) 2019-08-07 14:37:16 +00:00
			`You may look at a simple`
			`[example](https://github.com/google/oss-fuzz/tree/master/projects/example/my-api-repo)`
			`that covers most of the items above.`
Update ideal_integration.md 2017-05-15 22:17:40 +00:00
Update ideal_integration.md 2016-11-18 23:25:28 +00:00			`## Not a project member?`

Update ideal_integration.md 2016-11-18 23:26:02 +00:00			`If you are a member of the project you want to fuzz, most of the steps above are simple.`
Update ideal_integration.md 2016-11-23 18:27:29 +00:00			`However in some cases, someone outside the project team may want to fuzz the code`
Update ideal_integration.md 2016-11-18 23:30:46 +00:00			`and the project maintainers are not interested in helping.`

Update ideal_integration.md 2016-11-23 18:27:29 +00:00			`In such cases, we can host the fuzz targets, dictionaries, etc in OSS-Fuzz's`
Update ideal_integration.md 2016-11-19 00:04:50 +00:00			`repository and mention them in the Dockerfile.`
Switch docs to new structure (#2663) 2019-08-07 14:37:16 +00:00			`Examples: [libxml2](https://github.com/google/oss-fuzz/tree/master/projects/libxml2),`
			`[c-ares](https://github.com/google/oss-fuzz/tree/master/projects/c-ares), [expat](https://github.com/google/oss-fuzz/tree/master/projects/expat).`
fixed a handful of typos (#91) 2016-11-19 02:54:10 +00:00			`This is far from ideal because the fuzz targets will not be continuously tested`
Update ideal_integration.md 2016-11-18 23:33:16 +00:00			`and hence may quickly bitrot.`
Update ideal_integration.md 2016-11-18 23:30:46 +00:00
Switch docs to new structure (#2663) 2019-08-07 14:37:16 +00:00			`If you are not a project maintainer, we may not be able to CC you to security`
			`bugs found by OSS-Fuzz.`