最近の重要な論文の紹介 - テキストとの対応付けによる映像の理解に関連して（ステアラボ人工知能シンポジウム2017）

•

2 gefällt mir•1,977 views

STAIR Lab, Chiba Institute of Technology

講演者: 中島悠太先生（大阪大学）

Technologie

2
 
Deep Semantic Feature  
Sentence Sentence
Embedding
Video
Embedding
Web Images
Embedding Space
Video
“A baby is playing a guitar.”
Image Search
 
Deep Semantic Feature

•
- Xu et al., “Show, attend and tell: Neural image caption generation
with visual attention,” in Proc. ICML 2015.
•
- Grave, Wayne, et al., “Hybrid computing using a neural network with
dynamic external memory,” Nature, vol. 2538, pp.471—476, 2016.
• Adversarial Examples
- Goodfellow, et al., “Exmpaining and harnessing adversarial
examples,” in Proc. ICLR 2015.
3

• Xu, Ba, Kiros, Cho, Courville, Salakhutdinov, Zemel, and Bengio 
“Show, attend and tell: Neural image caption generation with visual attention” 
Proc. ICML 2015

•
•
•
• Visual Question Answering
•
7
Q: what are racing down the 
track with their jockies?
A: horses

•
-
-
•
8Image from: [Nakashima et al. 2012]

• Grave, Wayne, et al.  
“Hybrid computing using a neural network with dynamic external memory”  
Nature, vol. 2538, pp.471—476, 2016

Differentiable neural computer (DCN)
10
Image from: [Grave et al. 2016]

•
-  
•  
DCN
11
:
:  
Controller
Image from: [Grave et al. 2016]

• RNN
/
- 3D-CNN
- Mean/Max pooling
•
-
-
12

Adversarial examples
• Goodfellow, Shlens, and Szegedy 
“Exmpaining and harnessing adversarial examples” 
Proc. ICLR 2015.

Adversarial examples?
•
DNN
14
Images from: [Goodfellow et al. 2015]

• Microsoft Research Video Description Corpus
• > 2000 Video and descriptions
• TVD: a reproducible and multiply aligned TV series dataset
• Big Bang Theory Games of Thrones
• MSR VTT
• > 1M video and description pairs
• MPII Movie Description Dataset
• > 100K clip and description pairs
• YouTube 8M
•
• SumMe
• TVSum
• UG Video Dataset
17

Weitere ähnliche Inhalte

Andere mochten auch

多腕バンディット問題: 定式化と応用 (第13回ステアラボ人工知能セミナー)STAIR Lab, Chiba Institute of Technology

自然言語処理分野の最前線（ステアラボ人工知能シンポジウム2017）STAIR Lab, Chiba Institute of Technology

Higher-order Factorization Machines（第5回ステアラボ人工知能セミナー）STAIR Lab, Chiba Institute of Technology

知識グラフの埋め込みとその応用 (第10回ステアラボ人工知能セミナー)STAIR Lab, Chiba Institute of Technology

高次元空間におけるハブの出現 (第11回ステアラボ人工知能セミナー)STAIR Lab, Chiba Institute of Technology

群衆の知を引き出すための機械学習（第4回ステアラボ人工知能セミナー）STAIR Lab, Chiba Institute of Technology

JSAI Cup2017報告会STAIR Lab, Chiba Institute of Technology

第1回ステアラボ人工知能セミナー（オープニング）STAIR Lab, Chiba Institute of Technology

時系列ビッグデータの特徴自動抽出とリアルタイム将来予測（第9回ステアラボ人工知能セミナー）STAIR Lab, Chiba Institute of Technology

情報抽出入門〜非構造化データを構造化させる技術〜Yuya Unno

深層学習による自然言語処理の研究動向STAIR Lab, Chiba Institute of Technology

深層学習時代の自然言語処理Yuya Unno

Andere mochten auch (12)

多腕バンディット問題: 定式化と応用 (第13回ステアラボ人工知能セミナー)

自然言語処理分野の最前線（ステアラボ人工知能シンポジウム2017）

Higher-order Factorization Machines（第5回ステアラボ人工知能セミナー）

知識グラフの埋め込みとその応用 (第10回ステアラボ人工知能セミナー)

高次元空間におけるハブの出現 (第11回ステアラボ人工知能セミナー)

群衆の知を引き出すための機械学習（第4回ステアラボ人工知能セミナー）

JSAI Cup2017報告会

第1回ステアラボ人工知能セミナー（オープニング）

時系列ビッグデータの特徴自動抽出とリアルタイム将来予測（第9回ステアラボ人工知能セミナー）

情報抽出入門〜非構造化データを構造化させる技術〜

深層学習による自然言語処理の研究動向

深層学習時代の自然言語処理

Mehr von STAIR Lab, Chiba Institute of Technology

リアクティブプログラミングにおける時変値永続化の試み (第2回ステアラボソフトウェア技術セミナー)STAIR Lab, Chiba Institute of Technology

制約解消によるプログラム検証・合成 (第1回ステアラボソフトウェア技術セミナー)STAIR Lab, Chiba Institute of Technology

グラフ構造データに対する深層学習〜創薬・材料科学への応用とその問題点〜 (第26回ステアラボ人工知能セミナー)STAIR Lab, Chiba Institute of Technology

企業化する大学と、公益化する企業。そして、人工知能の社会実装に向けて。(ステアラボ人工知能シンポジウム)STAIR Lab, Chiba Institute of Technology

メテオサーチチャレンジ報告 (2位解法)STAIR Lab, Chiba Institute of Technology

画像キャプションと動作認識の最前線〜データセットに注目して〜（第17回ステアラボ人工知能セミナー）STAIR Lab, Chiba Institute of Technology

文法および流暢性を考慮した頑健なテキスト誤り訂正 (第15回ステアラボ人工知能セミナー)STAIR Lab, Chiba Institute of Technology

Mehr von STAIR Lab, Chiba Institute of Technology (7)

リアクティブプログラミングにおける時変値永続化の試み (第2回ステアラボソフトウェア技術セミナー)

制約解消によるプログラム検証・合成 (第1回ステアラボソフトウェア技術セミナー)

グラフ構造データに対する深層学習〜創薬・材料科学への応用とその問題点〜 (第26回ステアラボ人工知能セミナー)

企業化する大学と、公益化する企業。そして、人工知能の社会実装に向けて。(ステアラボ人工知能シンポジウム)

メテオサーチチャレンジ報告 (2位解法)

画像キャプションと動作認識の最前線〜データセットに注目して〜（第17回ステアラボ人工知能セミナー）

文法および流暢性を考慮した頑健なテキスト誤り訂正 (第15回ステアラボ人工知能セミナー)

Kürzlich hochgeladen

DBX First Quarter 2024 Investor PresentationDropbox

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10

Corporate and higher education May webinar.pptxRustici Software

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...apidays

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays

Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya

Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz

Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh

Architecting Cloud Native ApplicationsWSO2

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood

Why Teams call analytics are critical to your entire businesspanagenda

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz

AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin

A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz

TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc

Kürzlich hochgeladen (20)

DBX First Quarter 2024 Investor Presentation

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...

Corporate and higher education May webinar.pptx

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...

Axa Assurance Maroc - Insurer Innovation Award 2024

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

Artificial Intelligence Chap.5 : Uncertainty

Boost Fertility New Invention Ups Success Rates.pdf

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model

Architecting Cloud Native Applications

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...

Why Teams call analytics are critical to your entire business

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...

AWS Community Day CPH - Three problems of Terraform

A Beginners Guide to Building a RAG App Using Open Source Milvus

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

最近の重要な論文の紹介 - テキストとの対応付けによる映像の理解に関連して（ステアラボ人工知能シンポジウム2017）

1. — 2017/3/12

2. 2   Deep Semantic Feature   Sentence Sentence Embedding Video Embedding Web Images Embedding Space Video “A baby is playing a guitar.” Image Search   Deep Semantic Feature

3. • - Xu et al., “Show, attend and tell: Neural image caption generation with visual attention,” in Proc. ICML 2015. • - Grave, Wayne, et al., “Hybrid computing using a neural network with dynamic external memory,” Nature, vol. 2538, pp.471—476, 2016. • Adversarial Examples - Goodfellow, et al., “Exmpaining and harnessing adversarial examples,” in Proc. ICLR 2015. 3

4. • Xu, Ba, Kiros, Cho, Courville, Salakhutdinov, Zemel, and Bengio  “Show, attend and tell: Neural image caption generation with visual attention”  Proc. ICML 2015

5. • • (?) 5 Images from: [Xu et al. 2015]

6. 6

7. • • • • Visual Question Answering • 7 Q: what are racing down the  track with their jockies? A: horses

8. • - - • 8Image from: [Nakashima et al. 2012]

9. • Grave, Wayne, et al.   “Hybrid computing using a neural network with dynamic external memory”   Nature, vol. 2538, pp.471—476, 2016

10. Differentiable neural computer (DCN) 10 Image from: [Grave et al. 2016]

11. • -   •   DCN 11 : :   Controller Image from: [Grave et al. 2016]

12. • RNN / - 3D-CNN - Mean/Max pooling • - - 12

13. Adversarial examples • Goodfellow, Shlens, and Szegedy  “Exmpaining and harnessing adversarial examples”  Proc. ICLR 2015.

14. Adversarial examples? • DNN 14 Images from: [Goodfellow et al. 2015]

15. • DNN • • 15

16. • • • 16

17. • Microsoft Research Video Description Corpus • > 2000 Video and descriptions • TVD: a reproducible and multiply aligned TV series dataset • Big Bang Theory Games of Thrones • MSR VTT • > 1M video and description pairs • MPII Movie Description Dataset • > 100K clip and description pairs • YouTube 8M • • SumMe • TVSum • UG Video Dataset 17

最近の重要な論文の紹介 - テキストとの対応付けによる映像の理解に関連して（ステアラボ人工知能シンポジウム2017）

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Andere mochten auch

Andere mochten auch (12)

Mehr von STAIR Lab, Chiba Institute of Technology

Mehr von STAIR Lab, Chiba Institute of Technology (7)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

最近の重要な論文の紹介 - テキストとの対応付けによる映像の理解に関連して（ステアラボ人工知能シンポジウム2017）