Attack to Fool and Explain Deep Networks | PERPUSTAKAAN UNIVERSITAS KATOLIK PARAHYANGAN

Pencarian berdasarkan :

Pencarian terakhir:

Text

Attack to Fool and Explain Deep Networks

Akhtar, Naveed - Nama Orang; Bennamoun, Mohammed - Nama Orang; Mian, Ajmal - Nama Orang; Jalwana, Mohammad A. A. K. - Nama Orang;

Deep visual models are susceptible to adversarial perturbations to inputs. Although these signals are carefully crafted, they still appear noise-like patterns to humans. This observation has led to the argument that deep visual representation is misaligned with human perception. We counter-argue by providing evidence of human-meaningful patterns in adversarial perturbations. We first propose an attack that fools a network to confuse a whole category of objects (source class) with a target label. Our attack also limits the unintended fooling by samples from non-sources classes, thereby circumscribing human-defined semantic notions for network fooling. We show that the proposed attack not only leads to the emergence of regular geometric patterns in the perturbations, but also reveals insightful information about the decision boundaries of deep models. Exploring this phenomenon further, we alter the ‘adversarial’ objective of our attack to use it as a tool to ‘explain’ deep visual representation. We show that by careful channeling and projection of the perturbations computed by our method, we can visualize a model's understanding of human-defined semantic notions. Finally, we exploit the explanability properties of our perturbations to perform image generation, inpainting and interactive image manipulation by attacking adversarialy robust ‘classifiers’. In all, our major contribution is a novel pragmatic adversarial attack that is subsequently transformed into a tool to interpret the visual models. The article also makes secondary contributions in terms of establishing the utility of our attack beyond the adversarial objective with multiple interesting applications.

Ketersediaan

Barcode		Tipe Koleksi	Nomor Panggil	Lokasi	Status
art145168	null	Artikel		Gdg9-Lt3	Tersedia namun tidak untuk dipinjamkan - No Loan

Informasi Detail

Judul Seri: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE; Vol.44 No.10 Part 1 October 2022
No. Panggil: -
Penerbit: : .,
Deskripsi Fisik: p. 5980-5995
Bahasa: English
ISBN/ISSN: -
Klasifikasi: NONE
Tipe Isi: -
Tipe Media: -
Tipe Pembawa: -
Edisi: -
Subjek: ADVERSARIAL EXAMPLES
PERTURBATIONS
EXPLAINABLE AI
TARGETED ATTACK
MODEL INTERPRETATION
Info Detail Spesifik: DOI: 10.1109/TPAMI.2021.3083769
Pernyataan Tanggungjawab: Naveed Akhtar, Mohammad A. A. K. Jalwana, Mohammed Bennamoun, Ajmal Mian

Versi lain/terkait

Tidak tersedia versi lain

Lampiran Berkas

Tidak Ada Data

Komentar

Anda harus masuk sebelum memberikan komentar