DyAdvDefender: An instance-based online machine learning model for perturbation-trial-based black-box adversarial defense

Published in Journal of Information Sciences, 2022

Recommended citation: Li, Miles Q., Benjamin CM Fung, and Philippe Charland. "DyAdvDefender: An instance-based online machine learning model for perturbation-trial-based black-box adversarial defense." Information Sciences (2022). https://www.sciencedirect.com/science/article/pii/S0020025522003747?casa_token=p5N50hWOf0oAAAAA:OoG3up9I8-W8kW1zutzK3zuzOZL1kpWspm_7h0YJZC_aowNcFvN97aUNwcWJvMX61QngMi4aNjy4

Recent research indicates that machine learning models are vulnerable to adversarial samples that are slightly perturbed versions of natural samples. Adversarial samples can be crafted in white-box or black-box scenario. In the black-box scenario adversaries possess no knowledge of the detailed architecture and parameters of the model they attack, and they seek information by querying the model with multiple, slightly perturbed samples to achieve their attack purpose. The difficulty in recognizing adversarial samples arises from the fact that the perturbations are often imperceptible, yet effective in misleading machine learning models. Existing defense methods are static, and they cannot dynamically evolve to adapt to adversarial attacks, which unnecessarily disadvantages them. In this paper we propose a novel dynamic defense method called DyAdvDefender, and we show that a dynamic defense method can effectively utilize previous experience to defend against black-box attacks. Extensive experimental results suggest that DyAdvDefender outperforms existing static methods in terms of defense effectiveness while keeping the original classification accuracy with only limited extra time consumption.

Download paper here

Recommended citation: Li, Miles Q., Benjamin CM Fung, and Philippe Charland. “DyAdvDefender: An instance-based online machine learning model for perturbation-trial-based black-box adversarial defense.” Information Sciences (2022).