Dense-captioning events in videos

•Als PPTX, PDF herunterladen•

1 gefällt mir•122 views

Qianyu Feng

seminar share

Technologie

Highlight
• Task: dense-captioning events
• Dataset: ActivityNet Captions
• Events range across multiple time scales and can even overlap.
• generating action proposals to multi-scale detection of events,
processes each video in a forward pass to detect events as they occur
• Events in a given video are usually related to one another.
• introduce a captioning module that utilizes the context from all the
events from our proposal module to generate each sentence

DenseCap:
Fully Convolutional Localization Networks for Dense Captioning

Method V. Escorcia, F. C. Heilbron, J. C. Niebles, and B. Ghanem.
Daps: Deep action proposals for action understanding.
2016,ECCV
J. Johnson, A.
Karpathy, and L.
Fei-Fei.
DenseCap:
Fully
convolutional
localization
networks for
dense
captioning.
A. Alahi, K. Goel, V.
Ramanathan, A.
Robicquet, L. Fei-
Fei,
and S. Savarese.
Social lstm: Human
trajectory prediction
in
crowded spaces.
object-centric
in images
action-centric
in videos

Discussion Jointly Localizing and Describing Events for Dense Video Captioning

Discussion Joint Event Detection and Description in Continuous Video Streams

Weitere ähnliche Inhalte

Ähnlich wie Dense-captioning events in videos

Dcnn for text捷恩蔡

NumPyCNNAndroid: A Library for Straightforward Implementation of Convolutiona...Ahmed Gad

(Deep) Neural Networks在 NLP 和 Text Mining 总结君廖

モデルアーキテクチャ観点からの高速化2019Yusuke Uchida

TelekineticAkshat Singh

Deep Neural Methods for RetrievalBhaskar Mitra

telekinetic-170302195145 (1).pdfAjinSanthosh1

An Introduction to Recent Advances in the Field of NLPRrubaa Panchendrarajan

Video + Language 2019Goergen Institute for Data Science

Video + LanguageGoergen Institute for Data Science

BIng NLP Expert - Dl summer-school-2017.-jianfeng-gao.v2Karthik Murugesan

Deep Learning for NLP: An Introduction to Neural Word EmbeddingsRoelof Pieters

Video + Language: Where Does Domain Knowledge Fit in?Goergen Institute for Data Science

Multi modal retrieval and generation with deep distributed modelsRoelof Pieters

Bagwordsmustafa sarac

NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...Maryam Farooq

Empathic Computing: Capturing the Potential of the MetaverseMark Billinghurst

Educational technologyvcher nuketnowlanNuket

Deep Generative Models Chia-Wen Cheng

Ähnlich wie Dense-captioning events in videos (20)

Dcnn for text

NumPyCNNAndroid: A Library for Straightforward Implementation of Convolutiona...

(Deep) Neural Networks在 NLP 和 Text Mining 总结

モデルアーキテクチャ観点からの高速化2019

Telekinetic

Deep Neural Methods for Retrieval

telekinetic-170302195145 (1).pdf

An Introduction to Recent Advances in the Field of NLP

Video + Language 2019

Video + Language

BIng NLP Expert - Dl summer-school-2017.-jianfeng-gao.v2

Deep Learning for NLP: An Introduction to Neural Word Embeddings

Video + Language: Where Does Domain Knowledge Fit in?

Multi modal retrieval and generation with deep distributed models

Bagwords

NYAI #27: Cognitive Architecture & Natural Language Processing w/ Dr. Catheri...

Empathic Computing: Capturing the Potential of the Metaverse

Educational technologyvcher nuketnowlan

Deep Generative Models

Kürzlich hochgeladen

Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

Salesforce Community Group Quito, Salesforce 101Paola De la Torre

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

Google AI Hackathon: LLM based Evaluator for RAGSujit Pal

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent

A Domino Admins Adventures (Engage 2024)Gabriella Davis

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

Scaling API-first – The story of a global engineering organizationRadu Cotescu

CNv6 Instructor Chapter 6 Quality of Servicegiselly40

08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls

🐬 The future of MySQL is Postgres 🐘RTylerCroy

GenCyber Cyber Security Day PresentationMichael W. Hawkins

How to convert PDF to text with Nanonetsnaman860154

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh

Kürzlich hochgeladen (20)

Swan(sea) Song – personal research during my six years at Swansea ... and bey...

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

08448380779 Call Girls In Civil Lines Women Seeking Men

Salesforce Community Group Quito, Salesforce 101

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Google AI Hackathon: LLM based Evaluator for RAG

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...

A Domino Admins Adventures (Engage 2024)

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

IAC 2024 - IA Fast Track to Search Focused AI Solutions

Breaking the Kubernetes Kill Chain: Host Path Mount

Scaling API-first – The story of a global engineering organization

CNv6 Instructor Chapter 6 Quality of Service

08448380779 Call Girls In Friends Colony Women Seeking Men

🐬 The future of MySQL is Postgres 🐘

GenCyber Cyber Security Day Presentation

How to convert PDF to text with Nanonets

FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi

Dense-captioning events in videos

1. Dense-Captioning Events in Videos

2. Dense-Captioning

3. Highlight • Task: dense-captioning events • Dataset: ActivityNet Captions • Events range across multiple time scales and can even overlap. • generating action proposals to multi-scale detection of events, processes each video in a forward pass to detect events as they occur • Events in a given video are usually related to one another. • introduce a captioning module that utilizes the context from all the events from our proposal module to generate each sentence

4. DenseCap: Fully Convolutional Localization Networks for Dense Captioning

5. DenseCap: Fully Convolutional Localization Networks for Dense Captioning

6. Method V. Escorcia, F. C. Heilbron, J. C. Niebles, and B. Ghanem. Daps: Deep action proposals for action understanding. 2016,ECCV J. Johnson, A. Karpathy, and L. Fei-Fei. DenseCap: Fully convolutional localization networks for dense captioning. A. Alahi, K. Goel, V. Ramanathan, A. Robicquet, L. Fei- Fei, and S. Savarese. Social lstm: Human trajectory prediction in crowded spaces. object-centric in images action-centric in videos

7. Performance

8. Discussion Jointly Localizing and Describing Events for Dense Video Captioning

9. Discussion Joint Event Detection and Description in Continuous Video Streams

Hinweis der Redaktion

1.给定视频，生成特征序列。实验中以16帧为单位，输入C3D提取特征。 2.proposal module。proposal module是在DAPs的基础上做了一点修改，即在每一个time step输出K个proposals。采用LSTM结构，输入上述C3D特征序列，用不同的strides提取特征序列，strides={1,2,4,8}。生成的proposal在时间上会有重叠。每检测出一个event，就将当前的隐藏层状态作为视频描述。 3.captioning module。利用相邻事件的context来生成event caption。采用LSTM结构。将所有的事件相对于当前事件分成两个桶：past events和future events。并发事件则依据结束时间分成past events和future events。

Dense-captioning events in videos

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Ähnlich wie Dense-captioning events in videos

Ähnlich wie Dense-captioning events in videos (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Dense-captioning events in videos

Hinweis der Redaktion