Local Gradients Smoothing: Defense against localized adversarial attacks

## 公開日
2018-07-03

## 1. 概要
LaVANやadversarial pathなど様々なDNNに対する攻撃を防御する Local Gradients Smoothing(LGS) 手法を提案。

↓サンプル

<img width="473" alt="screen shot 2018-07-21 at 17 22 31" src="https://user-images.githubusercontent.com/10243885/43033763-b1fbb2d2-8d0a-11e8-8e09-c911d4fcf204.png">


## 2. 新規性・差分


## 3. 手法
LaVANやadversarial pathなどは、特定の位置に高周波ノイズを埋め込んでいて、画像の勾配領域で非常にノイズが強くなる。そこで、分類するのに必要な低周波画像領域に影響を与えずに高周波領域を除去することにより、ノイズの影響を大幅に低減させる。

画像勾配を以下の式で求める。

<img width="285" alt="screen shot 2018-07-21 at 17 06 58" src="https://user-images.githubusercontent.com/10243885/43033679-a2d9c16a-8d08-11e8-82d7-27ab60c74b2d.png">

次に、画像全体で正規化する。

<img width="319" alt="screen shot 2018-07-21 at 17 07 25" src="https://user-images.githubusercontent.com/10243885/43033686-bef88624-8d08-11e8-9441-df13a7d8374e.png">

これにより、入力データ画像の高周波数の細部が平滑化される。
また、雑音抑圧は以下の式で行われる。

<img width="219" alt="screen shot 2018-07-21 at 17 07 29" src="https://user-images.githubusercontent.com/10243885/43033710-491f5864-8d09-11e8-9ebc-9e53bc3cc8de.png">

λはLGSの平滑化係数で、`0≦λ*g(x)≦1` である。
この操作をグローバルに行うと、そもそもの分類器の精度が低下してしまうため、block-wizeアプローチをする。
以下のように勾配の大きさのマップをK個の同じサイズτに分ける。そして、閾値γでフィルタをかけ、adversarial noiseがある可能性が最も高い活性化領域を推定する。（以下の式で表せる）

<img width="381" alt="screen shot 2018-07-21 at 17 07 34" src="https://user-images.githubusercontent.com/10243885/43033756-8a5fc77c-8d0a-11e8-81eb-f51de880b6d1.png">



## 4. 結果

<img width="560" alt="screen shot 2018-07-21 at 17 25 36" src="https://user-images.githubusercontent.com/10243885/43033777-1a9acb3e-8d0b-11e8-8d97-11752e3a32b4.png">

95*95のパッチを当てた時の精度比較は以下の通り

<img width="336" alt="screen shot 2018-07-21 at 17 27 26" src="https://user-images.githubusercontent.com/10243885/43033788-5bf7b538-8d0b-11e8-9015-f8fd3831d245.png">

計算コストの比較は以下の通り

<img width="540" alt="screen shot 2018-07-21 at 17 28 18" src="https://user-images.githubusercontent.com/10243885/43033792-7ab97498-8d0b-11e8-9577-b0b991de4734.png">

## 5. 議論
分類器は以下のような決定段階を備える必要があると考えられる。

1. 検知：制限がなければ基本的に任意の防御システムをバイパスすることができるが、通常のデータからかけ離れているためadversarialな入力だと検知する方が楽なので検知システムを入れる。
2. 変換：adversarialなのを防ぐために今回のLGSのような手法を用いて入力データを変換する。
3. 分類：adversarial trainingで堅牢性を高めたモデルで分類する。

## 6. コメント
DNN使ってる製品がだいぶ出回り始めたので、プロダクトを作ってる人たちは議論に書いた3つの段階をちゃんと実装しましょうという気持ちになった。

## 論文情報・リンク
https://arxiv.org/pdf/1807.01216.pdf


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Local Gradients Smoothing: Defense against localized adversarial attacks #8

公開日

1. 概要

2. 新規性・差分

3. 手法

4. 結果

5. 議論

6. コメント

論文情報・リンク

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Local Gradients Smoothing: Defense against localized adversarial attacks #8

Description

公開日

1. 概要

2. 新規性・差分

3. 手法

4. 結果

5. 議論

6. コメント

論文情報・リンク

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions