research-article

Understanding and Improving Coverage Tracking with AFL++ (Registered Report)

Authors:

Stefan BrunthalerAuthors Info & Claims

FUZZING 2024: Proceedings of the 3rd ACM International Fuzzing Workshop

Pages 80 - 89

https://doi.org/10.1145/3678722.3685537

Published: 13 September 2024 Publication History

Get Access

Abstract

Coverage-based fuzzers track which program parts they visit when executing a specific input as a proxy measure to (1) guide the fuzzing process, and (2) explore the PUT's state space. One way to record coverage progress is to enumerate basic block pairs (e.g., edges in the control-flow graph) and use them to index into a hash table that holds counters. The counter is incremented every time a fuzzer's input exercises the corresponding edge. Traditionally the coverage map has been a compact bitmap that fits the L2 CPU cache to reduce runtime overhead and boost fuzzing throughput. In such a design where space is traded for speed, two sources of imprecision can arise: (1) collisions, and (2) arithmetic inaccuracies.

Collisions refer to the situation when two different basic block pairs hash to the same entry. Imprecision arises since one pair is now counted together, but the fuzzer cannot tell one apart from the other.

Arithmetic inaccuracies refer to errors in the counting strategy. For example, a monotonically incrementing counter inside the hash table can overflow. This indicates a situation where high-frequency control-flow exceeds the predefined, expected maximum counter size (e.g., in loops). Due to execution frequencies obeying exponential power laws, such overflows will affect a small number of hash table entries. Another arithmetic inaccuracy results from range-based counters that capture only predefined frequency intervals (e.g., logarithmic counters).

In 2018, CollAFL examined how collisions impact precision, and presented a new hashing scheme to reduce the number of collisions. CollAFL did not address the problem of arithmetic inaccuracies. Furthermore, CollAFL considered only a single-core virtual machine, a limited set of benchmark programs, and did not explore hardware-specific effects (e.g., cache utilization for concurrent fuzzing processes).

This registered report aims at providing new insights of how collisions and arithmetic inaccuracies affect coverage tracking for fuzzing. We propose experiments for multiple hardware architectures with different cache topologies, and a more diverse set of benchmark programs. Leveraging the evaluation data, our aim is to determine precise architecture-aware settings for AFL++. Furthermore, we plan to demonstrate an adaptive optimization strategy that optimizes the coverage map to collisions and counting strategies for a specific combination of the CPU architecture and PUT.

References

[1]

Alif Ahmed, Jason D. Hiser, Anh Nguyen-Tuong, Jack W. Davidson, and Kevin Skadron. 2021. BigMap: Future-proofing Fuzzers with Efficient Large Maps. In 2021 51st Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN). 531–542. issn:2158-3927 https://doi.org/10.1109/DSN48987.2021.00062

Abstract

References

Index Terms

Recommendations

Fine-grained Coverage-based Fuzzing

Feature-Sensitive Coverage for Conformance Testing of Programming Language Implementations

JQF: coverage-guided property-based testing in Java

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Get Access

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations