Privacy Preserving Defense For Black Box Classifiers Against On-Line Adversarial Attacks | PERPUSTAKAAN UNIVERSITAS KATOLIK PARAHYANGAN

Pencarian berdasarkan :

Pencarian terakhir:

Text

Privacy Preserving Defense For Black Box Classifiers Against On-Line Adversarial Attacks

Theagarajan, Rajkumar - Nama Orang; Bhanu, Bir - Nama Orang;

Deep learning models have been shown to be vulnerable to adversarial attacks. Adversarial attacks are imperceptible perturbations added to an image such that the deep learning model misclassifies the image with a high confidence. Existing adversarial defenses validate their performance using only the classification accuracy. However, classification accuracy by itself is not a reliable metric to determine if the resulting image is “adversarial-free”. This is a foundational problem for online image recognition applications where the ground-truth of the incoming image is not known and hence we cannot compute the accuracy of the classifier or validate if the image is “adversarial-free” or not. This paper proposes a novel privacy preserving framework for defending Black box classifiers from adversarial attacks using an ensemble of iterative adversarial image purifiers whose performance is continuously validated in a loop using Bayesian uncertainties. The proposed approach can convert a single-step black box adversarial defense into an iterative defense and proposes three novel privacy preserving Knowledge Distillation (KD) approaches that use prior meta-information from various datasets to mimic the performance of the Black box classifier. Additionally, this paper proves the existence of an optimal distribution for the purified images that can reach a theoretical lower bound, beyond which the image can no longer be purified. Experimental results on six public benchmark datasets namely: 1) Fashion-MNIST, 2) CIFAR-10, 3) GTSRB, 4) MIO-TCD, 5) Tiny-ImageNet, and 6) MS-Celeb show that the proposed approach can consistently detect adversarial examples and purify or reject them against a variety of adversarial attacks.

Ketersediaan

Barcode		Tipe Koleksi	Nomor Panggil	Lokasi	Status
art145980	null	Artikel		Gdg9-Lt3	Tersedia namun tidak untuk dipinjamkan - No Loan

Informasi Detail

Judul Seri: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE; Vol.44 No.12 Part 2 December 2022
No. Panggil: -
Penerbit: : .,
Deskripsi Fisik: p. 9503-9520
Bahasa: English
ISBN/ISSN: -
Klasifikasi: NONE
Tipe Isi: -
Tipe Media: -
Tipe Pembawa: -
Edisi: -
Subjek: ADVERSARIAL DEFENSE
KNOWLEDGE DISTILLATION
BAYESIAN UNCERAINTIES
BLACK BOX DEFENSE
ENSEMBLE OF DEFENSES
IMAGE PURIFIERS
PRIVACY PRESERVING DEFENSE
Info Detail Spesifik: 10.1109/TPAMI.2021.3125931
Pernyataan Tanggungjawab: Rajkumar Theagarajan, Bir Bhanu

Versi lain/terkait

Tidak tersedia versi lain

Lampiran Berkas

Tidak Ada Data

Komentar

Anda harus masuk sebelum memberikan komentar