AI Model Security Testing

Linux Ascend GPU CPU Data Preparation Model Development Model Training Model Optimization Enterprise Expert

Background

Different from fuzzing security test for traditional programs, MindArmour provides the AI model security test module fuzz_testing for deep neural network. Based on the neural network features, the concept of neuron coverage rate [1] is introduced to guide the fuzz testing. Fuzz testing is guided to generate samples in the direction of increasing neuron coverage rate so that more neurons can be activated by inputs. The distribution range of neuron values is wider to fully test DNN and explore the output results of different types of models and model error behavior.

Fuzz Testing Design

The following figure shows the security test design of the AI model.

fuzz_architecture

At the user interface layer, users need to provide the original dataset DataSet, tested model Model, and Fuzzer parameter Fuzzer configuration. After fuzzing the model and data, Fuzzer module returns the security report Security Report.

Fuzz testting architecture consists of three modules:

Natural Threat/Adversarial Example Generator:

Randomly select a mutation method to mutate seed data and generate multiple variants. Mutation policies supporting multiple samples include:
- Image affine transformation methods: Translate, Rotate, Scale, and Shear.
- Methods based on image pixel value changes: Contrast, Brightness, Blur, and Noise.
- Methods for generating adversarial examples based on white-box and black-box attacks: FGSM, PGD, and MDIIM.
Fuzzer Moduler:

Perform fuzz testing on the mutated data to observe the change of the neuron coverage rate. If the generated data increases the neuron coverage rate, add the data to the mutated seed queue for the next round of data mutation. Currently, the following neuron coverage metrics are supported: KMNC, NBC, and SNAC [2].
Evaluation:

Evaluate the fuzz testing effect, quality of generated data, and strength of mutation methods. Five metrics of three types are supported, including the general evaluation metric (accuracy), neuron coverage rate metrics (kmnc, nbc, and snac), and adversarial attack evaluation metric (attack_success_rate).

Fuzz Testing Process

fuzz_process

The fuzz testing process is as follows:

Select seed A from the seed queue according to the policy.
Randomly select a mutation policy to mutate seed A and generate multiple variants A1, A2, …
Use the target model to predict the variants. If the semantics of variant is consistent with the seed, the variant enters the Fuzzed Tests.
If the prediction is correct, use the neuron coverage metric for analysis.
If a variant increases the coverage rate, place the variant in the seed queue for the next round of mutation.

Through multiple rounds of mutations, you can obtain a series of variant data in the Fuzzed Tests, perform further analysis, and provide security reports from multiple perspectives. You can use them to deeply analyze defects of the neural network model and enhance the model to improve its universality and robustness.

Code Implementation

fuzzing.py: overall fuzz testing process.
model_coverage_metrics.py: neuron coverage rate metrics, including KMNC, NBC, and SNAC.
image_transform.py: image mutation methods, including methods based on image pixel value changes and affine transformation methods.
adversarial attacks: methods for generating adversarial examples based on white-box and black-box attacks.

References

[1] Pei K, Cao Y, Yang J, et al. Deepxplore: Automated whitebox testing of deep learning systems[C]//Proceedings of the 26th Symposium on Operating Systems Principles. ACM, 2017: 1-18.

[2] Ma L, Juefei-Xu F, Zhang F, et al. Deepgauge: Multi-granularity testing criteria for deep learning systems[C]//Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering. ACM, 2018: 120-131.