SlideShare ist ein Scribd-Unternehmen logo
1 von 17
Downloaden Sie, um offline zu lesen
Text Processing Like Humans Do:
Visually Attacking and Shielding NLP Systems
Steffen Eger, Gözde Gül Şahin, Andreas Rücklé , Ji-Ung Lee, Claudia Schulz ,
Mohsen Mesgar, Krishnkant Swarnkar , Edwin Simpson , Iryna Gurevych
Ubiquitous Knowledge Processing Lab (UKP-TUDA)
Research Training Group AIPHES
Department of Computer Science, Technische Universität Darmstadt
長澤 駿太(テツ)
- 法政大学理工学部応用情報工学科 B4
知的情報処理研究室(彌冨研) 所属
- ブログ書いてます(論文読みとか)
- https://tetsu316.hatenablog.com
- kaggleとかのコンペも好きです
- 好きなだけで強くはないです
研究分野
- 文字体系を考慮した自然言語処理
2
@tetsu316naga
iron316
自己紹介
文献情報
Steffen Eger, Gözde Gül Şahin, Andreas Rücklé , Ji-Ung Lee,
Claudia Schulz, Mohsen Mesgar, Krishnkant Swarnkar ,
Edwin Simpson , Iryna Gurevych
Text Processing Like Humans Do:
Visually Attacking and Shielding NLP Systems
In Proceedings of the 2019 Conference of the North
American Chapter of the Association for Computational
Linguistics: Human Language Technologies
arxiv : https://arxiv.org/abs/1903.11508
TL;DR
視覚的摂動を利用した攻撃Visual Perturber (VIPER)を提案
- 既存のSoTA手法のスコアの低下を確認
- VIPERに対して人間とNLPシステムにgapを確認
VIPERに対する防衛手法を提案
- 以下の3つの手法の提案
- Image-base embedding
- Adversarial training (AT)
- ルールベースによる対策
背景
人間の視覚的な情報は文字を認識する上で重要
文字形状の似ている文字に置換されていても読むことが可能
機械学習などのNLPシステムでは
これらに対して弱い
視覚的
摂動
提案手法 (VIPER)
視覚的な摂動を取り入れた文を自動で生成するVIPERを提案
- ベルヌーイ分布pの確率で置換
- 文字の置換候補を以下の方法で取得
- ICES (image-base character embedding space)
- 文字画像をベクトルに変換 (24*24 -> 576)
- K近傍(K=20)を使用
- DCES (description-base character embedding space)
- Unicode11.0.0に付与されている説明文を使用
- ECES (easy character embedding space)
- 人手によるアノテーション
- すべての文字はa-zA-Zに属する
提案手法 (VIPER)
ELMo[Peter+ 2018]を用いた単語分散表現
- SELMo (standard ELMo)
- 単語表現を文字レベルから獲得する
- 文字レベルのembeddingに対してCNNを用いる
- VELMo (visually informed variant of ELMo)
- SELMoと同じ構成
- 文字表現にICESを用いる
[Peter+ 2018] Matthew E. Peters, Mark Neumann, Mohit Iyyer, Matt Gardner, Christopher Clark, Kenton Lee, Luke Zettlemoyer, “Deep
contextualized word representations”, in NAACL 2018
実験 (人間)
VIPERに対して人間がどれだけ修復能力を持つか
- 英語がnear-nativeかnativeのアノテータ6人が挑戦
- 設定パラメータ
- clean: VIPER(0,_), i.e., no perturbation
- VIPER(p,ICES) for p = 0.2,0.4,0.6,0.8
- VIPER(p,DCES) for p = 0.2,0.4,0.6,0.8
- VIPER(p,ECES) for p = 0.4,0.8
- 各条件最大20文
結果 (人間)
- 置換の確率上がるに連れerrorは上がっていく
- ICESが高いのは「i」「l」「I」などが混合しやすいため
- 最低でも90%は修復することができてる
実験 (NLP system)
4つのタスクに対してVIPERを評価
- G2P (Grapheme-to-phoneme) -> character level
- 単語に対する音素を求めるタスク
- POS tagging -> word level
- 単語に対する品詞付与
- Chunking -> word level 
- 構文解析?(名詞句や動詞句などを付与)
- TC (toxic classification) -> sentence level
- 有害コメントor無害の二値分類
- kaggleのデータセットを使用
- sentence vectorにはword vectorのaverageを使用
評価と結果 (NLP system)
VIPERを適用前後の比で評価
モデルは各タスクのSOTAを使用
pが大きくなると精度が下がっていくことを確認
G2P taskではSoTAの20%のほどの精度しか出ていない
提案手法と実験 (防衛編)
VIPERに対して防衛手法を3つ提案
- AT (adversarial training)
- CE (character embedding)
- G2PではICES,それ以外ではVELMoを使用
- RBR (rule-base recovery)
- ECESを用いたrule-baseによる文字修復
評価
- 学習データにVIPER(0.2, DCES)をかけたもので学習
最終評価 防衛適応前後の比 攻撃適応前後の比
結果 (防衛編)
- AT
- G2P以外のタスクでスコアの向上を確認
- 類似文字に対してATでは適切な表現を得られなかった
- CE
- 単語単位のタスクではスコアの向上がなかった
- ELMoのアーキテクチャには必要ない?
結果 (防衛編)
- AT + CE
- 全タスクでスコアの向上を確認
- お互いが補うような形でスコアが上がっている?
- RBR
- 全体的にスコアの向上を確認
- RBRはほか比べかなりhardな手法
- 多言語の単語を意味がないものにしてしまう?
結果まとめ
- RBRを用いる場合は機械翻訳に通してからの方が良い
- AT+CEはCEのみよりもドメインシフトを抑えている
- CEだとl, i, Iなどの違いを吸収できず,違う単語に?
- 文字単位に対するCNNの機能が弱い
- すべてのアプローチを導入してもcleanなスコアより
かなり下回っている
- VIPER攻撃はそれだけNLPシステムに対して強い攻撃
- これらは人間とNLPシステムの大きな違い
議論 (エラー分析)
TCタスクには有害コメントに対して6種類のフラグを持つ
- フラグの合計をTL(toxic level)と定義
これに対しVIPER, 防衛アプローチに対してTLの変化を確認
防衛アプローチを入れることでTLの変化を減らすことが可能
特定単語に対して摂動を加えるとTLの低下が起きる
- he などの単語には入れても効果なし
まとめ
視覚的摂動を利用した攻撃手法VIPERを提案
- 既存のSoTA手法の精度低下を確認
- VIPERに対して人間は強く,NLPシステムは弱い
VIPERに対する3つの防衛手法の提案
- 組み合わせることでより効果が高くなった
人間とNLPシステムのgapを埋めることが今後システムの
構築の上で重要になってくる

Weitere ähnliche Inhalte

Empfohlen

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by HubspotMarius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTExpeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 

Empfohlen (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Text processing like humans do : visually attacking and shielding nlp systems[paper survey]