SlideShare ist ein Scribd-Unternehmen logo
1 von 131
Interactive
Video Search
Klaus Schoeffmann, PhD
Klagenfurt University
Institute of Information Technology
Klagenfurt, Austria
Frank Hopfgartner, PhD
University of Glasgow
School of Humanities
Glasgow, UK
Outline
• Search in video content: motivation and challenges
• Video retrieval and its challenges
• What is interactive video search and how can it help?
• Video browsing
• Video navigation
• Break
• Video content visualization
• Ad-hoc similarity search / video exploration
• Sketch-based search in video
• Evaluation of interactive video search tools
• Visual lifelogging
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 2
Motivation
3Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Videos Everywhere
• Ubiquitous use of videos nowadays
Entertainment and commercials
Social gaming (screencasts)
Personal videos (family, kids, …)
Sports documentation and analysis (e.g., GoPro)
Product usage instructions (e.g., furniture)
Surveillance (buildings, places, street, …)
Lifelogging
Health care and medical science (endoscopic procedures)
• Enormous amount of data!
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 4
Video as the Ultimate Media?
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 5
[Mary Meeker, Liang Wu, Internet Trends, D11 Conference, May, 2013]
As of 2014, every
minute 300 hours of
video are uploaded
to YouTube!
Video Cameras
• Increasingly powerful
These days you can record 4K content with your mobile!
Video sensors use auto-focus, object tracking, color
correction, and image stabilization
Storage space not a big problem
 Current smartphones have 128 GB of memory
 NAS devices cheaply available
Network bandwidth also dramatically increased over years
 Video streaming on the go is simple and common
 LTE connections provide 30 Mbit/s and even much more!
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 6
7Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
[Mary Meeker, Liang Wu, Internet Trends, D11 Conference, May, 2013]
Challenges
• Video data
are a continuous media: the content depends on time!
often contain several media types: image, text, audio
cannot be simply stored and indexed in a data base,
requires own indexing and search methods!
require huge amount of storage space without compression!
may contain a lot of important information,
which is, however, often very subjective!
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 8
Challenges
• Subjectivity of data
How many people use it?
The more the easier!?
• Different levels
 Internet scale (YouTube)
 Country/region
 Company/organization/group
 Individual
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 9
Available
meta-data
How to query for a video clip?
How to efficiently retrieve results?
How to effectively present content to the user?
“Poor Man’s Video Search Tool”
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 10
VCR in the 1970s provided a similar functionality!
Video Data Still Tedious to Use
• Even with video retrieval tools it is still challenging
to find desired video content
Especially if it is not a publicly available (and popular)
Many problems with querying, in particular for novice users!
• The ultimate goal is to make use of and search in
video as effective as for text
Quickly find relevant content
Compare to interactivity of a text book
 Index, ToC, list of figures/tables, etc.
 Change, extend, copy, bookmark, highlight, etc.
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 11
Traditional
Video Retrieval
12Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
“Query and browse results”
Search Example (TRECVID KIS 2010)
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 13
Find the “video of President Bush standing near sea vessels with
Coast Guard members talking about his pride of the Coast Guard,
immigration, and security issues”.
Video from IACC
public data set!
TRECVID:
see later!
Video Clip Hidden in Huge Collection
14
Internet Archive
with Creative
Commons (IACC)
data set, as used
for TRECVID:
146,788 shots
(~9,000 videos)
Page 1 2 3 …. 38 39 40
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
1st Trial at YouTube
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 15
2nd Trial at YouTube
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 16
[ Heesch, D., Howarth, P., Magalhaes, J., May, A., Pickering, M., Yavlinsky, A., & Rüger, S. (2004, November). Video retrieval using search and browsing. In TREC Video Retrieval Evaluation Online Proceedings. ]
17
Content-
based
Feature
Example
Image
Text
Ranked list
of shots
In IACC about
5800 pages. 
Temporal
Context
Traditional Video Retrieval Tool
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
More Interactive Retrieval Tool
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 18
[A. Moumtzidou et al., “VERGE: A Multimodal Interactive Video Search Engine”, Proc. of the 21st International Conference on MultiMedia Modeling (MMM 2015), Sydney, 2015]
kNN Similarity search
based on VLAD vectors
Concept detection with SVM and
five local descriptors (SIFT, SURF,
ORB, ...) and PCA
Hierarchical
keyframe clustering
 Interaction
details later
Challenges for
Video Retrieval
19Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Traditional Video Retrieval Approach
“Query-and-Browse-Resoluts” Paradigm
Works well if (and only if)
users can properly express their needs.
content features can sufficiently describe visual content.
computer vision can accurately detect semantics.
20
Content-
based
Search
Ranked Results
Unfortunately, in practice these assumptions do not hold.
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Challenges
Content-based features
 How to understand semantics from pixels? Semantic Gap
Both images show
bears in front
of a landscape.
21Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Database affinity of concept classifiers
Low performance in broad domain
P(k) Precision at level k (after k results)
rel(k) defines if kth retrieved document is relevant
Performance
Gap
22
Challenges
TRECVID 2013 Semantic Indexing (SIN-500):
median “inferred average precision” (infAP) < 0.13
In other words:
88% of the results
are not correct!
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
 Query-by-concept
 Which concept to use? Choose from a long list of results…
 Query-by-example
 Typically no perfect example available.
 Query-by-sketch
 Users are no artists 
 Query-by-text
 How to describe a desired image by text?
Usability Gap
23
A picture tells a 1000 words.
by marfis75
How to describe a video clip by text???
Challenges
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Example: Query by Motion Sketch
• Matching based on trajectory descriptor
• Challenge: may differ a lot among different users
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 24
[Ghosal, Koustav, and Anoop Namboodiri. "A Sketch-Based Approach To Video Retrieval Using Qualitative Features." Proceedings of the 2014 Indian
Conference on Computer Vision Graphics and Image Processing. ACM, 2014.]
Well-Known Issues
of “Query-and-Browse-Results” Paradigm
Users cannot formulate or have no query
  provide exploratory search features!
 For example: browsing, filtering, similarity search
Users expect good results (on first page!)
  Use relevance feedback / active learning instead of long lists!
Shots have a temporal context
Videos are dynamic
 Static thumbnails are not informative
 Esp. true for long shots and self-similar content
  skims and visual summaries (“smart playback”)
  sophisticated navigation & content structure visualization
Grid interfaces are not always the best choice
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 25
Usability Gap
 See later
Uniform Sampled Frames
from a Video with High Self-Similarity
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 26
Needs Special Keyframe Extraction
and Object Detection
[Klaus Schoeffmann, Manfred Del Fabro, Tibor Szkaliczki, Laszlo Böszörmenyi, and Jörg Keckstein, “Keyframe Extraction in Endoscopic Video“, in Multimedia Tools and Applications, Springer, August, 2014]
[Manfred J. Primus, Klaus Schoeffmann, and Laszlo Böszörmenyi, “Instrument Classification in Laparoscopic Videos“, in Proceedings of the International Workshop on Content-Based Multimedia Indexing (CBMI 2015),
Prague, Czech Republic, IEEE, 2015, pp. 1-6]
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 27
And Special Browsing Tools /
Visualization
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 28
Agglomerative clustering based on visual similarity and temporal information
[Jakub Lokoc, Klaus Schoeffmann, and Manfred Del Fabro, “Dynamic Hierarchical Visualization of Keyframes in Endoscopic Video“, in Proceedings of the 21st International Conference on
MultiMedia Modelling 2015 (MMM 2015), Sydney, Australia, Lecture Notes on Computer Science (LNCS), Vol. 8936, Springer International Publishing, 2015, pp. 291-294]
Where Is the User in Multimedia Retrieval?
29
[Marcel Worring et al., „Where Is the User in Multimedia Retrieval?“, IEEE Multimedia, Vol. 19, No. 4, Oct.-Dec. 2012, pp. 6-10 ]
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Interactive vs. “Traditional” Retrieval
30Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Interactive Video Search
And how it can help…
31Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Performance
Gap
Interactive Video Search
32
• Mostly interactive search
• Human computation
• Simple-to-use
• Inflexible and tedious for archives
• Low performance (?)
• Mostly automatic search
• Retrieval engine
• Complicated to use
• Flexible and easier (?) for archives
• Limited performance too!
Usability Gap
Novices Experts
 Combines HCI with CV and MIR
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Interactive Video Search
Traditional video retrieval + interactive inspection/exploration/navigation
and rich content visualization in order to satisfy an information need
33
Focuses on search and exploration in
(i) single videos as well as (ii) video collections
Directed Search
Find a specific shot or segment in a video
Find a specific video in an archive
Undirected Search
Searching to discover information
E.g., browse through a video in order to
Learn how the content looks like
See if it is interesting
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Not supported by
trad. video retrieval
Interactive Video Search
• Tries to strongly integrate user into search process
Search – Inspect – Think – Repeat
 Exploratory search (“will know it when I see it”)
 Instead of „query-and-browse-results“
User controls search process
 Inspects and interacts
 Most meaningful tool for current need, e.g.
• Content Browsing/Navigation
• Content Visualization and Summarization
• Ad-hoc Querying (e.g., by sketch, filtering, ad-hoc example)
34Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
User Studies with Significance Tests!
• Many interfaces proposed without proper evaluation
• Interface A better than interface B?
 comparative user study needed!
 Perform search tasks in exactly the
same setting (data, environment, etc.)
 Logging of interaction behavior
and task solve time
 Questionnaire about subjective workloads
 Statistical analysis with proper tests
(e.g., t-test, ANOVA, Wilcoxon signed-rank, etc.)
• User simulations?
• Evaluation competitions
 Same data set
 Comparative evaluation
 TRECVID, MediaEval, Video Browser Showdown (see later)
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 35
IVS Tools:
Video Browsing
36Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
37
Video Browsing
[ F. Arman, R. Depommier, A. Hsu, and M-Y. Chiu, Content-based Browsing of Video Sequences, in Proc. of ACM International Conference on Multimedia, 1994, pp. 97-103 ]
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
The ThumbBrowser
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 38
[Marco Hudelist, Klaus Schoeffmann, Laszlo Böszörmenyi. “Mobile Video Browsing with the ThumbBrowser”, Proc. of the International Conference on Multimedia, 2013, pp. 405-406 ]
Video Browser for the Digital Native
39
[Adams, Brett, Stewart Greenhill, and Svetha Venkatesh. "Towards a video browser for the digital native." Multimedia and Expo Workshops (ICMEW), 2012 IEEE International
Conference on. IEEE, 2012.]
“Temporal Semantic Compression” based on tempo function and shot popularity (insight)
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Video Browser for the Digital Native
• User study with 8 participants
 Test configuration elements by two tasks
(after presentation + 5 minutes training)
 (i) Browse a familiar movie to find scenes you remember
 (ii) Browse an unfamiliar movie to get a feel for its story or structure
 Questionnaire with
Likert-scale ratings
40
[Adams, Brett, Stewart Greenhill, and Svetha Venkatesh. "Towards a video browser for the digital native." Multimedia and Expo Workshops (ICMEW), 2012 IEEE International
Conference on. IEEE, 2012.]
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Thread-Based Browsing of Retrieval Results
• Thread: linked seq. of shots in a specified order
 Query results, visual similarity, semantic similarity, textual similarity
time, …
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 41
[De Rooij, Ork, Cees GM Snoek, and Marcel Worring. "Balancing thread based navigation for targeted video search." Proceedings of the 2008 international conference on Content-based image
and video retrieval (CIVR). ACM, 2008.]
Thread-Based Browsing of Retrieval Results
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 42
[Ork de
Rooij et al.]
IVS Tools:
Video Navigation
43Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Improving Navigation
44
e.g., on YouTube
default window:
640 pixels = frames
(25 seconds)
Common seeker-bar limits
navigation granularity
[Huerst et al., ICME 2007]
ZoomSlider
Improvements (selected):
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Navigation with a Seeker-Bar
Idea of the ZoomSlider
45
Wolfgang Hürst, Georg Götz, and Martina Welte, “Interactive video browsing on mobile devices”, in Proceedings of the 15th International Conference on Multimedia (MULTIMEDIA '07). ACM, pp. 247-256, 2007
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
46Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Improving Navigation
47
e.g., on YouTube
default window:
640 pixels = frames
(25 seconds)
Common seeker-bar limits
navigation granularity
[Dragicevic et al., CHI 2008]
Direct
Manipulation
[Huerst et al., ICME 2007]
ZoomSlider
Improvements (selected):
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Relative Flow Dragging
Background Stabilization
48
Pierre Dragicevic, Gonzalo Ramos, Jacobo Bibliowitcz, Derek Nowrouzezahrai, Ravin Balakrishnan, and Karan Singh. “Video browsing by direct manipulation”, in Proceedings of the SIGCHI Conference on Human Factors
in Computing Systems (CHI '08). ACM, pp. 237-246, 2008
Video browsing by direct manipulation / relative flow dragging
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Relative Flow Dragging
• Evaluation with a user study
 16 participants (18-44 years old)
 Direct comparison to seeker-bar navigation
 Navigation tasks, 2 videos (ladybug, cars)
 “Find the position where the ladybug passes over marker X”
 “Find the moment when car X starts moving”
 Flow dragging significantly faster (RM-ANOVA)
by at least 250% (also significantly less errors)
49
Pierre Dragicevic, Gonzalo Ramos, Jacobo Bibliowitcz, Derek Nowrouzezahrai, Ravin Balakrishnan, and Karan Singh. “Video browsing by direct manipulation”, in Proceedings of the SIGCHI Conference on Human Factors
in Computing Systems (CHI '08). ACM, pp. 237-246, 2008
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
How Do Users Search in
Video With a Common
Video Player?
50Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
How do Users Search with Video Players
(Navigate with Seeker-Bars)?
• User study with more than 30 participants
• Known Item Search Tasks
51
[Claudiu Cobarzan and Klaus Schoeffmann, “How do Users Search with Basic HTML5 Video Players?“, in Proceedings of The 20th International Conference on MultiMedia Modeling (MMM2014), 2014]
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
How do Users Search with Video Players
(Navigate with Seeker-Bars)?
52
[Claudiu Cobarzan and Klaus Schoeffmann, “How do Users Search with Basic HTML5 Video Players?“, in Proceedings of The 20th International Conference on MultiMedia Modeling (MMM2014), 2014]
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
How do Users Search with Video Players
(Navigate with Seeker-Bars)?
53
[Claudiu Cobarzan and Klaus Schoeffmann, “How do Users Search with Basic HTML5 Video Players?“, in Proceedings of The 20th International Conference on MultiMedia Modeling (MMM2014), 2014]
Vast amount of content was checked with normal playback!?
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
How do Users Search with Video Players
(Navigate with Seeker-Bars)?
Timeinvideo(ms)
54
[Claudiu Cobarzan and Klaus Schoeffmann, “How do Users Search with Basic HTML5 Video Players?“, in Proceedings of The 20th International Conference on MultiMedia Modeling (MMM2014), 2014]
User start with rough navigation and look more carefully after narrowing down the search area!
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Target segment
Improving
Video Navigation on
Touch Devices
55Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Session continues at 3:20 pm
56Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Keyframe Navigation Tree
• Consider findings of study on navigation behavior
• Basic idea inspired by frame stripes (MO images)
• Goal
 very compact visualization
 not as fine as frames but not as coarse as keyframes of shots
 provide different granularity levels for navigation
 previous work has shown that
users typically navigate in a coarse-to-fine grained manner
57
[ Schoeffmann, K., Taschwer, M., & Boeszoermenyi, L. (2010, February). The video explorer: a tool for navigation and searching within a single video based
on fast content analysis. In Proceedings of the first annual ACM SIGMM conference on Multimedia systems (pp. 247-258). ACM. ]
[Mueller-Seelich, H., Tan, E.: Visualizing the semantic structure of film and video (2000) ]
compact overview but abstract & very high level of detail (frame-based!)
[Xiaoxiao Luo, Qing Xu, Mateu Sbert, Klaus Schoeffmann, “F-Divergences Driven Video Key Frame Extraction“, in Proc. of the IEEE Int. Conference on Multimedia & Expo (ICME 2014), Chengdu, China, 2014, pp. 6]
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
• Keyframe selection based on sub-shots (JSD with color histograms)
Cover all important scenes even for long shots (e.g. pans)
Excerpts with three levels of detail: L1: narrow, L2: wide, L3: full keyframe
Used as seeker-bar with synchronized interaction for all levels
Simple touch-based interaction (tap and wipe gestures)
58
L1 (30 shots)
L2 (~ 12 shots)
L3 (~ 3 shots)
Keyframe Navigation Tree
[Hudelist, Marco A., Klaus Schoeffmann, and Qing Xu. "Improving interactive known-item search in video with the keyframe navigation tree." MultiMedia Modeling. Springer International Publishing, 2015.]
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
59Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Keyframe Navigation Tree
• Video Player vs. KNT Browser (iPad, 4th generation, 9.7-inch)
• User study with 20 participants (15m/5f)
 Age: 18-40 (mean 28.15, s.d. 6.08)
• Known-item search tasks
 Given 20 seconds long target clip
 Find correct clip in 1-h long video as fast as possible
 200 search tasks in total
 Each participant performed random selection of 10 tasks (5/5)
 latin-square principle to avoid familiarization effects
 Time-out after 3 minutes (“unanswered”)
 Wrong results marked as “erroneous”
• Time measurement, logging, questionnaire (Likert-scale ratings)
60Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Keyframe Navigation Tree
Search Time & Performance
• Task solve time, erroneous trials, unanswered trials…
61
28.11
44.18
KNT Browser statistically significantly faster
acc. to dependent paired-samples t-test (t(19) = -3.937 p < 0.005)
7 trials
10 trials
5 trials
22 trials
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
[Hudelist, Marco A., Klaus Schoeffmann, and Qing Xu. "Improving interactive known-item search in video with the keyframe navigation tree." MultiMedia Modeling. Springer International Publishing, 2015.]
Keyframe Navigation Tree
Subjective Rating
• NASA Task-Load-Index (TLX) questionnaires
62
KNT browser significantly
better in all 7 categories!
Acc. to Wilcoxon signed-
rank tests
(for details see paper)
KNT browser is preferred
search tool for 85% of
tested users (17/20)
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
[Hudelist, Marco A., Klaus Schoeffmann, and Qing Xu. "Improving interactive known-item search in video with the keyframe navigation tree." MultiMedia Modeling. Springer International Publishing, 2015.]
IVS Tools:
Content Visualization
63Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Grid Interfaces Aren‘t Enough!
• Many video retrieval systems use a Grid interface!?
Moreover, a grid interface does not allow
for fast human visual search (see later)!
64
A ranked list of results does not convey
the temporal content structure!
• To which video does a shot belong to?
• What is the sequence of shots?
• How long is a shot / scene?
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Table of Video Content
(TOVC)
[Goeau et al., ICME 2007]
65
Squeeze / Fisheye
Rapid Visual Serial
Presentation (RSVP)
Improving Visualization
aka “Video Surrogates”
[Wildemuth et al., 2003]
[Wittenburg et al., 2005]
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
66
VideoTree
[Jansen et al., CBMI 2008]
However, outperformed by
simple “grid of keyframes”
in terms of search time.
Similar concept proposed later
[Girgensohn et al., ICMR 2011]
• Split-based clustering algorithm with
color correlograms.
• Tree not directly shown to the user
(only one level).
Improving Visualization
aka “Video Surrogates”
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
3D Ring Interface
• Utilization of screen real estate
 Large set of images
 Minor occlusion, slight distortion
• Intuitive interaction
 Rotate and zoom
• Content-based sorting
• “Pop-out images” (in the back)
• Further advantages
 Immediately continue on miss,
scaling
67
Klaus Schoeffmann, David Ahlström, and Marco Andrea Hudelist, “3-D Interfaces to Improve the Performance of Visual Known-Item Search“,
in IEEE Transactions on Multimedia, Vol. 16, No. 7, November, 2014, pp. 1942-1951.
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
3D Ring Interface - Perspectives
Preferred Design acc. to user study
25% Vertical 66% Horizontal 8.3% Frontal
68
Klaus Schoeffmann, David Ahlström, and Marco Andrea Hudelist, “3-D Interfaces to Improve the Performance of Visual Known-Item Search“,
in IEEE Transactions on Multimedia, Vol. 16, No. 7, November, 2014, pp. 1942-1951.
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
3D interface significantly faster than grid by 12.7%
User Study: Grid vs. Ring (both sorted)
150 images, 12 participants, 1440 trials
69
Klaus Schoeffmann, David Ahlström, and Marco Andrea Hudelist, “3-D Interfaces to Improve the Performance of Visual Known-Item Search“,
in IEEE Transactions on Multimedia, Vol. 16, No. 7, November, 2014, pp. 1942-1951.
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Extension: Multiple Rings with Vertical Scrolling
70
Klaus Schoeffmann. 2014. The Stack-of-Rings Interface for Large-Scale Image Browsing on Mobile Touch Devices. In Proc. of the ACM Int. Conference on Multimedia (MM '14). ACM, New York, NY, USA, 1097-1100.
Significantly faster search (by about 48%) than common image browser on iPad!
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
IVS Tools:
Ad-Hoc Similarity Search
71Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
The Video Explorer
72
[ Schoeffmann, K., Taschwer, M., & Boeszoermenyi, L. (2010, February). The video explorer: a tool for navigation and searching within a single video based on fast content analysis. In Proceedings of the first annual
ACM SIGMM conference on Multimedia systems (pp. 247-258). ACM. ]
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Interactive Navigation Summaries
Allows a user to quickly identify
similar/repeating scenes
73
[ Schoeffmann, K., & Boeszoermenyi, L. (2009, June). Video browsing using interactive navigation summaries. In Content-Based Multimedia Indexing, 2009. CBMI'09. Seventh Int.Workshop on (pp. 243-248). IEEE. ]
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Motion Layout: Direction + Intensity
Motion Vector (µ) classification into
K=12 equidistant motion directions
Mapping to Hue channel
74
[ Schoeffmann, K., Lux, M., Taschwer, M., & Boeszoermenyi, L. (2009, June). Visualization of video motion in context of video browsing. In Multimedia and Expo, 2009. ICME 2009. IEEE Int. Conf. on (pp. 658-661). IEEE. ]
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
75
[ Schoeffmann, K., Lux, M., Taschwer, M., & Boeszoermenyi, L. (2009, June). Visualization of video motion in context of video browsing. In Multimedia and Expo, 2009. ICME 2009. IEEE Int. Conf. on (pp. 658-661). IEEE. ]
Similarity Search (SOI) with Motion Layout
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
• SOI Search
 Motion-based search by example sequence
 Using Motion Direction histogram Db
 User-selected sequence
 Find most similar sequences
 Compute distance to any possible seq. of same length
 Match if below spec. threshold
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 76
Motion Layout (Db)
Match 1 Match 2 Match 3
frame 1 frame n
Similarity Search (SOI) with Motion Layout
Region-of-Interest (ROI) Search
 User selects spatial region-of-interest
 On search
 Compute Euclidian distance of frame F
to every other frame f (acc. to selected region)
 Based on color layout descriptor
…
frame F
frame 1 frame k frame n
User-selected
region (I)
…
d(F,1)=350 d(F,k)=8 d(F,n)=400
77
[ Schoeffmann, K., Taschwer, M., & Boeszoermenyi, L. (2010, February). The video explorer: a tool for navigation and searching within a single video based on fast content analysis. In Proceedings of the first annual
ACM SIGMM conference on Multimedia systems (pp. 247-258). ACM. ]
Similarity Search (ROI) with Color Layout
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
78
[ Schoeffmann, K., Taschwer, M., & Boeszoermenyi, L. (2010, February). The video explorer: a tool for navigation and searching within a single video based on fast content analysis. In Proceedings of the first annual
ACM SIGMM conference on Multimedia systems (pp. 247-258). ACM. ]
Similarity Search (ROI) with Color Layout
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
IVS Tools:
Sketch-Based Search
79Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
• Color sketches mapped to
feature signatures
• Matched to those of
keyframes
80
1. Sampling keypoints
2. Description through location (x,y),
CIE Lab, contrast and entropy of
surrounding pixels
3. K-means clustering
Feature Signatures
[ Kruliš, M., Lokoč, J. and Skopal, T. (2013). Efficient Extraction of Feature Signatures Using Multi-GPU Architecture. Springer Berlin Heidelberg, LNCS 7733, pp.446-456. ]
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Feature Signature-Based Video Browser
81
Color Sketch
(Signature)
Player
Winner of Video Browser Showdown 2014 + 2015
Download demo at: http://siret.ms.mff.cuni.cz/lokoc/vbs.zip
2nd Color Sketch
(optional)
[ Lokoč, J., Blažek, A., & Skopal, T. (2014, January). Signature-Based Video Browser. In MultiMedia Modeling (pp. 415-418). Springer International Publishing. ]
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Compact visualization
Simple color-position sketch
Negative
example
Matched key-frames
Time to
2nd sketch
2nd optional
sketch
Interactive-navigation summaryOn demand neighborhood expansion
[Slide: Adam Blazek et al.
(siret research group, Czech Republic)]
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 82
Compact Visualization in Detail
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 83
[Courtesy of Jakub Lokoc et al.]
Another Example of a Color-Based Browser
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 84
Browsing
Area
Map
Segment
Inspection
Color SketchCategories
[Kai Uwe Barthel, Nico Hezel, Radek Mackowiak. Navigating a graph of scenes for exploring large video collections, in Proc. of 22nd International Conference on MultiMedia Modeling (MMM 2016), Lecture Notes in
Computer Science (LNCS), Vol. tbd, Springer International Publishing, 2016, pp. 1-7]
Evaluation of
IVS Tools
85Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
TRECVID
http://trecvid.nist.gov/
• International video retrieval competition evaluation
 Annually performed by NIST (Gaithersburg, Maryland, USA)
 Funded by NIST and other US government agencies
 Benchmark for researchers using same data
 Origin in TREC (Text REtrieval Conference, since 1992)
• Founded in 2003, by
 Alan Smeaton (Dublin City University)
 Wessel Kraaij (TNO-ICT, Delft)
• International advisory Committee
 Alex Hauptmann (CMU)
 Michael Lew (Leiden Institute of Advanced Computer Science)
 Georges Quenot (LIG, Grenoble)
 John Smith (IBM Research)
 …
• Local organisation
 Paul Over (NIST)
86Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
TRECVID Known-item Search
TRECVID KIS (2010-2012)
models the situation in which
“someone knows of a video, has seen it before, believes it is
contained in a collection, but doesn‘t know where to look”
Automatic Search
 Text-description about the video
 Return ranked list of 100 videos (out of 9000)
Interactive Search
 Pre-processing based on text query
 Searcher browses through result list (e.g., keyframes of shots)
• Interactively find target video as fast as possible
• Within 5 minutes
87Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
TRECVID Known-item Search
The Performance of State-of-The-Art Video Retrieval Tools
Known items not found by any team:
Interactive Automatic out of
2010 5 / 24 21% 69 / 300 22% 15 teams
2011 6 / 25 24% 142 / 391 36% 9 teams
2012 2 / 24 17% 108 / 361 29% 9 teams
From: [Alan Smeaton, Paul Over, “Known-Item Search @ TRECVID 2012”, NIST, 2012]
88Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
MediaEval 2015
• Search and Anchoring in Video Archives
“Search for Multimedia Content”
 Multi-model textual and visual descriptions of content of interest
“Automatic Anchor Selection”
 Predict key elements of videos as anchor points for hyperlinking
Professional (BBC) and non-professional content (users)
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 89
http://www.multimediaeval.org
Video Browser Showdown (VBS)
• Annual performance evaluation competition
 Live evaluation of search performance
 Special session at Int. Conference on MultiMedia Modeling (MMM)
 Demonstrates and evaluates state-of-the-art interactive video search tools
 Idea influenced by VideOlympics (Snoek et al., IEEE Multimedia 2008)
• Focus
 Known-item Search tasks
 Target clips are presented on site
 Teams search in shared data set
 Highly interactive search
 Should push research on interfaces
and interaction/navigation
 Experts and Novices
 Easy-to-use tools and methods
90
Teams connected to a server that issues tasks and evaluates submitted results
http://videobrowsershowdown.org/
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Video Browser Showdown (VBS)
• Scoring through VBS Server
• Score (s) [0-100] for task i and team k is based on
Solve time (t)
Penalty (p) based on
number of submissions (m)
91
Maximum solve time (Tmax)
typically 3-5 minutes
[Schoeffmann, K., Ahlström, D., Bailer, W., Cobârzan, C., Hopfgartner, F., McGuinness, K., ... & Weiss, W. (2013). The Video Browser Showdown: a live evaluation of interactive video search tools. International Journal
of Multimedia Information Retrieval, 1-15. ]
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Video Browser Showdown 2015
• Search in mid-sized video collections (2016: 200 hours)
 Originally only single video search
• Two different kind of tasks:
 Visual: visual presentation of a 30s target clip
 Textual: textual description of a 30s target clip
• Shared video data from BBC
 2015: 153 video files, about 100.000 shots (9 Mio frames)
Participants:
• Need to find the target clips as quickly as possible
• Get points for each task (the faster the better)
• But only for submission of exact location of target clip
[Schoeffmann, Klaus. "A user-centric media retrieval competition: The video browser showdown 2012-2014." MultiMedia, IEEE 21.4 (2014): 8-13.]
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 92
93Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
2012: Klagenfurt
11 teams
2013: Huangshan
6 teams
2014: Dublin
7 teams
2015: Sydney
9 teams
VBS 2016: January 5, 2016, Miami, USA (MMM 2016)
http://www.videobrowsershowdown.org/
Video Browser Showdown 2012
Two examples (of the 11 tools; single video search only)
94
[Xiangyu Chen, Jin Yuan, Liqiang Nie, Zheng-Jun Zha, Shuicheng Yan, and Tat-Seng Chua, "TRECVID 2010
Known-item Search by NUS", in Proceedings of TRECVID 2010 workshop, NIST, Gaithersburgh, USA, 2011
Jin Yuan, Huanbo Luan, Dejun Hou, Han Zhang, Yan-Tao Zheng, Zheng-Jun Zha, and Tat-Seng Chua, "Video
Browser Showdown by NUS", in Proceedings of th 18th International Conference on Multimedia Modeling
(MMM) 2012, Klagenfurt, Austria, pp. 642-645]
• Keyframe extraction (shots)
• ASR and OCR
• HLF (Concepts)
• RF with Related Samples
• Uniform sampled keyframes
(with flexible distance)
• Parallel playback + navigation
[Manfred Del Fabro and Laszlo Böszörmenyi, "AAU Video Browser: Non-
Sequential Hierarchical Video Browsing without Content Analysis", in
Proceedings of th 18th International Conference on Multimedia Modeling
(MMM) 2012, Klagenfurt, Austria, pp. 639-641]
Winner of VBS 2012
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Winner 2014 and 2015
(2014: single video and collection search, 2015: collection only)
95
Color Sketch
(Signature)
Player
2nd Color Sketch
(optional)
[ Lokoč, J., Blažek, A., & Skopal, T. (2014, January). Signature-Based Video Browser. In MultiMedia Modeling (pp. 415-418). Springer International Publishing. ]
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Video Browser Showdown 2015
Two examples (of the 9tools, collection search only)
96
Moumtzidou, A., Avgerinakis, K., Apostolidis, E., Markatopoulou, F., Apostolidis, K., Mironidis, T., ... &
Patras, I. (2015, January). VERGE: A Multimodal Interactive Video Search Engine. In MultiMedia Modeling
(pp. 249-254). Springer International Publishing.
• Shot and scene detection
• HLF (Concepts) with
SIFT/SURF and VLAD
• Similarity search
• Uniform sampled frames
• Human computation
Hürst, W., van de Werken, R., & Hoet, M. (2015, January). A Storyboard-Based
Interface for Mobile Video Browsing. In MultiMedia Modeling (pp. 261-265).
Springer International Publishing.
3rd place
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
97Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
URL: http://mklab-services.iti.gr/vss2015/ [Courtesy of Stefanos Vrochidis]
98Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
[Courtesy of Stefanos Vrochidis]
99Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Similarity Search Results
[Courtesy of Stefanos Vrochidis]
100Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
[Courtesy of Stefanos Vrochidis]
Human vs. Machine
• Utrecht University @ VBS 2015
 Wolfgang Huerst et al., The Netherlands
 Strong experience in HCI
• Features
 Uniformly sampled thumbs
(1 second distance)
 Huge storyboard on tablet
 Vertical scrolling, paging
101
625 thumbnails in one screen
[Hürst, W., van de Werken, R., & Hoet, M. (2015, January). A Storyboard-Based Interface for Mobile Video Browsing. In MultiMedia Modeling (pp. 261-265). Springer International Publishing.]
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
Visual Lifelogging
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 102
Slides by Frank Hopfgartner
What is The Quantified Self?
The Quantified Self is about obtaining self-knowledge
through self-tracking.
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 103
What is The Quantified Self?
Self-tracking is also referred to as lifelogging, self-
analysis, or self-hacking.
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 104
Memex
Bush, Vannevar. "As We May Think." The Atlantic Monthly. July 1945.
ImagesofMemex:http://trevor.smith.name/memex/
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 105
MyLifeBits
• Gordon Bell (Microsoft)
digitized his life:
Books written
Personal documents
Photos
Posters, paintings, photo of
things
Home movies and videos
CD collection
PC files
…
Gordon Bell and Jim Gemmell. Total Recall: How the E-Memory Revolution will change everything, New York, Dutton 2009
http://research.microsoft.com/en-us/projects/mylifebits/
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 106
Creating Personal Lifebraries
A lifebrary consists of heterogeneous data recorded
using many different sensors.
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 107
Recording what I eat
Aizawa, Kiyoharu, Maruyama, Yutu, Li, He, and Morikawa, Chamin. “Food Balance Estimation by Using Personal Dietrary Tendencies in a Multimedia Food Log." IEEE Transactions on Multimedia, 15(8):2176-2185, 2013.
Semantic Gap
http://foodlog.jp/
http://mealsnap.com/
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 108
Recording what I see
"LifeGlogging cameras 1998 2004 2006 2013 labeled" by Glogger - Own work. Licensed under CC BY-SA 3.0 via Commons -
https://commons.wikimedia.org/wiki/File:LifeGlogging_cameras_1998_2004_2006_2013_labeled.jpg#/media/File:LifeGlogging_cameras_1998_2004_2006_2013_labeled.jpg
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 109
Visual Lifelogging
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 110
Example: Visual Lifelog of a day
5,500 pictures a day
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 111
[Slide: C. Gurrin, DCU]
Big Data
Cathal Gurrin, Alan F. Smeaton and Aiden R. Doherty (2014), "LifeLogging: Personal Big Data", Foundations and Trends® in Information Retrieval: Vol. 8: No. 1, pp 1-125.
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 112
Semantic Analysis
• Context cues help us to
remember (Naaman et al.)
• Context in lifelogging data:
 Location, bluetooth, time, date,
…
 Derived Knowledge (e.g.
activities)
• Approaches:
 Combine cues from different
sources
 Perform content analysis to
identify objects, people, events…
 Annotate lifelogs in form of
narrative text
Mor Naaman, Susumu Harada, QianYing Wang, Hector Garcia-Molina, Andreas Paepcke: Context data in geo-referenced digital photo collections. ACM Multimedia 2004: 196-203
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 113
[Slide: C. Gurrin, DCU]
Visual Feature Extraction
 Steering wheel (72%)
 Shopping (75%)
 Inside of vehicle when not driving (airplane, taxi, car,
bus) (60%)
 Toilet/Bathroom (58%)
 Giving Presentation / Teaching (29%)
 View of Horizon (23%)
 Door (62%)
 Staircase (48%)
 Hands (68%)
 Holding a cup/glass (35%)
 Holding a mobile phone (39%)
 Eating food (41%)
 Screen (computer/laptop/tv) (78%)
 Reading paper/book (58%)
 Meeting (34%)
 Road (47%)
 Vegetation (64%)
 Office Scene (72%)
 Faces (61%)
 People (45%)
 Grass (61%)
 Sky (79%)
 Tree (63%)
Byrne, Daragh, Doherty, Aiden R., Snoek, Cees G. M., Jones, Gareth J. F., Smeaton, Alan F. “Everyday concept detection in visual lifelogs: validation, relationships and trends." Multimedia Tools and Applications, 49(1):119-144, 2010.
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 114
A day
This does not work well…
Let’s add event segmentation.
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 115
[Slide: C. Gurrin, DCU]
Event Segmentation & Annotation
• Segment 5,500 photos per day into a set of events
 Similar to SBD in digital video processing
 We employ visual features and output of on-device sensors
Multiple Events
Finishing work in
the lab
At the bus stop Chatting at Skylon Hotel lobby Moving to a
room
Tea time On the way
back home
Event Segmentation
Summarization
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 116
[Slide: C. Gurrin, DCU]
Non-supervised Event
Segmentation
2. Arriving
in the office
6. Walking in
the building 12. Leaving
the office
Na Li et al. “Random Matrix Ensembles of Time Correlation Matrices to Analyze Visual Lifelogs." In Proc. Multimedia Modeling Conference, Dublin, Ireland, pp. 400-411, 2014.
Event Segmentation based on the
extraction of low level features and
computation of semantic concepts
requires knowledge about dataset.
Alternative: Highlight “significant events”
by performing time series analysis
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 117
MyLifeBits
Gordon Bell and Jim Gemmell. Total Recall: How the E-Memory Revolution will change everything, New York, Dutton 2009
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 118
MyLifeBits
Gordon Bell and Jim Gemmell. Total Recall: How the E-Memory Revolution will change everything, New York, Dutton 2009
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 119
Virtual reality
“Bad Trip is an immersive virtual
reality installation […] that enables
people to navigate the creator's
mind using a game controller.
Since November 2011, every
moments of his life has been
documented by a video camera
mounted on glasses, producing an
expanding database of digitalized
visual memories. Using custom
virtual reality software, he created a
virtual mindscape where people
could navigate, and experience his
memories and dreams.”
Souce: http://www.kwanalan.com
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 120
Art installations
Kelly, Philip and Doherty, Aiden R. and Smeaton, Alan F. and Gurrin, Cathal and O’Connor, Noel E. “The Colour of Life: Novel Visualisations of Population Lifestyles." In Proc. ACM Multimedia, pp. 1063-1066, 2010.
Image:CourtesyofC.Gurrin
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 121
Video Summary
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 122
[Courtesy of T. Plumbaum]
NTCIR
• Workshop series focusing on research on
Information Access technologies (information
retrieval, question answering, text
summarisation, etc)
• Sponsored by Japan Society for Promotion of
Science (JSPS)
• Organised since 1997 in an 18-months cycle
• NTCIR-12: January 2015 – June 2016
NII Test Collection for IR Systems
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 123
NTCIR-12 Tasks
NTCIR-12
 Second round:
 Search-Intent Mining
 Mobile Click
 Temporal Information Access
 Spoken Query & Spoken Document Retrieval
 QA Lab for Entrance Exam
 First round:
 Medical NLP for Clinical Documents
 Personal Lifelog Access & Retrieval
 Short Text Conversation
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 124
Encourage research advances in organising
and retrieving from lifelog data.
LifeLog @ NTCIR-12
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 125
Multimodal dataset with information
needs
Created by
various
individuals over
10+ days
TESTCOLLECTION
 1,500 images, location, GSR,
heart-rate, others… per
lifelogger per day
 Accompanying output of
1,000 concepts
 Data processed pre-release
(removal of personal content;
face blurring, translation of
concepts)
 Detailed user queries and
judgments generated by the
lifelogging data gatherers
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 126
Tasks
Evaluate different methods of
retrieval and access.
T1:LIFELOGSEMANTICACCESS(LSAT)
 Models the retrieval need
from lifelogs (Known-item
Search)
 Retrieve N segments that
match information need
 Interactive or Automatic
participation
 Interactive: Time limit for fair
and comparative evaluation in
an interactive system with
users
 Automatic: Fully-automatic
retrieval system. Automated
query processing
T2:LIFELOGINSIGHT
 Models the need for
reflection over lifelog data
 Exploratory task, the aim is to:
 Encourage broad
participation
 Novel methods to
visualize and explore
lifelogs
 Same data as LSAT task
 Presented via demo/poster
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 127
Task 1: Lifelog Semantic Access
Find the
moment(s)
where I use my
coffee machine.
Find the
moment(s)
where I am in
the kitchen
Find the
moment(s)
where I am
playing with my
phone.
Find the
moment(s)
where I am
preparing
breakfast.
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 128
Task 2: Lifelog Insight Task
Provide insights
on the time I
spend taking
breakfast.
Provide insights
on the time I
spend driving to
work.
Provide insights
on the time I
spend reading a
paper.
Provide insights
on the time I
spend working
on the computer.
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 129
Further information
http://ntcir-lifelog.computing.dcu.ie/
21 Sep 2015: Release of formal run
collection and task data (topics)
15 Dec 2015: Deadline for formal run
submissions
15 Jan 2016: Formal run evaluation
results return
01 Mar 2016: Paper for the Proceedings
7-10 Jun 2016: NCTIR-12 conference
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 130
The End
Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 131

Weitere ähnliche Inhalte

Ähnlich wie Interactive Video Search - Tutorial at ACM Multimedia 2015

Screencasting and Podcasting: Experience of the Yale Medical Library
Screencasting and Podcasting: Experience of the Yale Medical LibraryScreencasting and Podcasting: Experience of the Yale Medical Library
Screencasting and Podcasting: Experience of the Yale Medical Librarywlei
 
Participatory Media Literacy Uwi2009
Participatory Media Literacy Uwi2009Participatory Media Literacy Uwi2009
Participatory Media Literacy Uwi2009urauch
 
Gareth Jones 1209
Gareth Jones 1209Gareth Jones 1209
Gareth Jones 1209Yandex
 
Improving the OER Experience: Enabling Rich Media Notebooks of OER Video and ...
Improving the OER Experience: Enabling Rich Media Notebooks of OER Video and ...Improving the OER Experience: Enabling Rich Media Notebooks of OER Video and ...
Improving the OER Experience: Enabling Rich Media Notebooks of OER Video and ...Brandon Muramatsu
 
Multimedia Information Retrieval: Bytes and pixels meet the challenges of hum...
Multimedia Information Retrieval: Bytes and pixels meet the challenges of hum...Multimedia Information Retrieval: Bytes and pixels meet the challenges of hum...
Multimedia Information Retrieval: Bytes and pixels meet the challenges of hum...maranlar
 
Interactive Video and Adult Education
Interactive Video and Adult EducationInteractive Video and Adult Education
Interactive Video and Adult EducationCarolyn Guertin
 
Learning with (re)Purpose: How to Turn Any Event into Durable Online Video Le...
Learning with (re)Purpose: How to Turn Any Event into Durable Online Video Le...Learning with (re)Purpose: How to Turn Any Event into Durable Online Video Le...
Learning with (re)Purpose: How to Turn Any Event into Durable Online Video Le...Michael Kolowich
 
Kaltura Digital Media Hub Launch - Graham McElearney et. al.
Kaltura Digital Media Hub Launch - Graham McElearney et. al.Kaltura Digital Media Hub Launch - Graham McElearney et. al.
Kaltura Digital Media Hub Launch - Graham McElearney et. al.telshef
 
What can users do for multimedia?
What can users do for multimedia?What can users do for multimedia?
What can users do for multimedia?Lora Aroyo
 
Participatory Media Literacy Diverse2008
Participatory Media Literacy Diverse2008Participatory Media Literacy Diverse2008
Participatory Media Literacy Diverse2008urauch
 
The difference between content curation and content creation
The difference between content curation and content creationThe difference between content curation and content creation
The difference between content curation and content creationSafeWorkday SafetyPoints(R)
 
Final draft sr. proj. paper
Final draft sr. proj. paperFinal draft sr. proj. paper
Final draft sr. proj. paperAshley Toups
 
Creating Media Stories, SSSW July 13
Creating Media Stories, SSSW July 13Creating Media Stories, SSSW July 13
Creating Media Stories, SSSW July 13Lynda Hardman
 
SpokenMedia: Automatic Lecture Transcription and Rich Media Notebooks
SpokenMedia: Automatic Lecture Transcription and Rich Media NotebooksSpokenMedia: Automatic Lecture Transcription and Rich Media Notebooks
SpokenMedia: Automatic Lecture Transcription and Rich Media NotebooksBrandon Muramatsu
 
ALT Podcasting workshop, 14 April 2010
ALT Podcasting workshop, 14 April 2010ALT Podcasting workshop, 14 April 2010
ALT Podcasting workshop, 14 April 2010Andrew Middleton
 
Convenient Discovery of Archived Video Using Audiovisual Hyperlinking
Convenient Discovery of Archived Video Using Audiovisual HyperlinkingConvenient Discovery of Archived Video Using Audiovisual Hyperlinking
Convenient Discovery of Archived Video Using Audiovisual HyperlinkingBenoit HUET
 
CHI2014 Workshop - Leveraging Video Interaction Data and Content Analysis to ...
CHI2014 Workshop - Leveraging Video Interaction Data and Content Analysis to ...CHI2014 Workshop - Leveraging Video Interaction Data and Content Analysis to ...
CHI2014 Workshop - Leveraging Video Interaction Data and Content Analysis to ...Juho Kim
 
How to prepare a perfect video abstract for your research paper – Pubrica.pdf
How to prepare a perfect video abstract for your research paper – Pubrica.pdfHow to prepare a perfect video abstract for your research paper – Pubrica.pdf
How to prepare a perfect video abstract for your research paper – Pubrica.pdfPubrica
 

Ähnlich wie Interactive Video Search - Tutorial at ACM Multimedia 2015 (20)

Screencasting and Podcasting: Experience of the Yale Medical Library
Screencasting and Podcasting: Experience of the Yale Medical LibraryScreencasting and Podcasting: Experience of the Yale Medical Library
Screencasting and Podcasting: Experience of the Yale Medical Library
 
Participatory Media Literacy Uwi2009
Participatory Media Literacy Uwi2009Participatory Media Literacy Uwi2009
Participatory Media Literacy Uwi2009
 
Gareth Jones 1209
Gareth Jones 1209Gareth Jones 1209
Gareth Jones 1209
 
Improving the OER Experience: Enabling Rich Media Notebooks of OER Video and ...
Improving the OER Experience: Enabling Rich Media Notebooks of OER Video and ...Improving the OER Experience: Enabling Rich Media Notebooks of OER Video and ...
Improving the OER Experience: Enabling Rich Media Notebooks of OER Video and ...
 
Multimedia Information Retrieval: Bytes and pixels meet the challenges of hum...
Multimedia Information Retrieval: Bytes and pixels meet the challenges of hum...Multimedia Information Retrieval: Bytes and pixels meet the challenges of hum...
Multimedia Information Retrieval: Bytes and pixels meet the challenges of hum...
 
Interactive Video and Adult Education
Interactive Video and Adult EducationInteractive Video and Adult Education
Interactive Video and Adult Education
 
Learning with (re)Purpose: How to Turn Any Event into Durable Online Video Le...
Learning with (re)Purpose: How to Turn Any Event into Durable Online Video Le...Learning with (re)Purpose: How to Turn Any Event into Durable Online Video Le...
Learning with (re)Purpose: How to Turn Any Event into Durable Online Video Le...
 
Kaltura Digital Media Hub Launch - Graham McElearney et. al.
Kaltura Digital Media Hub Launch - Graham McElearney et. al.Kaltura Digital Media Hub Launch - Graham McElearney et. al.
Kaltura Digital Media Hub Launch - Graham McElearney et. al.
 
What can users do for multimedia?
What can users do for multimedia?What can users do for multimedia?
What can users do for multimedia?
 
Participatory Media Literacy Diverse2008
Participatory Media Literacy Diverse2008Participatory Media Literacy Diverse2008
Participatory Media Literacy Diverse2008
 
Video Thumbnail Selector
Video Thumbnail SelectorVideo Thumbnail Selector
Video Thumbnail Selector
 
Torun2013
Torun2013Torun2013
Torun2013
 
The difference between content curation and content creation
The difference between content curation and content creationThe difference between content curation and content creation
The difference between content curation and content creation
 
Final draft sr. proj. paper
Final draft sr. proj. paperFinal draft sr. proj. paper
Final draft sr. proj. paper
 
Creating Media Stories, SSSW July 13
Creating Media Stories, SSSW July 13Creating Media Stories, SSSW July 13
Creating Media Stories, SSSW July 13
 
SpokenMedia: Automatic Lecture Transcription and Rich Media Notebooks
SpokenMedia: Automatic Lecture Transcription and Rich Media NotebooksSpokenMedia: Automatic Lecture Transcription and Rich Media Notebooks
SpokenMedia: Automatic Lecture Transcription and Rich Media Notebooks
 
ALT Podcasting workshop, 14 April 2010
ALT Podcasting workshop, 14 April 2010ALT Podcasting workshop, 14 April 2010
ALT Podcasting workshop, 14 April 2010
 
Convenient Discovery of Archived Video Using Audiovisual Hyperlinking
Convenient Discovery of Archived Video Using Audiovisual HyperlinkingConvenient Discovery of Archived Video Using Audiovisual Hyperlinking
Convenient Discovery of Archived Video Using Audiovisual Hyperlinking
 
CHI2014 Workshop - Leveraging Video Interaction Data and Content Analysis to ...
CHI2014 Workshop - Leveraging Video Interaction Data and Content Analysis to ...CHI2014 Workshop - Leveraging Video Interaction Data and Content Analysis to ...
CHI2014 Workshop - Leveraging Video Interaction Data and Content Analysis to ...
 
How to prepare a perfect video abstract for your research paper – Pubrica.pdf
How to prepare a perfect video abstract for your research paper – Pubrica.pdfHow to prepare a perfect video abstract for your research paper – Pubrica.pdf
How to prepare a perfect video abstract for your research paper – Pubrica.pdf
 

Mehr von klschoef

Relevant Content Detection in Cataract Surgery Videos (Invited Talk 1 at IPTA...
Relevant Content Detection in Cataract Surgery Videos (Invited Talk 1 at IPTA...Relevant Content Detection in Cataract Surgery Videos (Invited Talk 1 at IPTA...
Relevant Content Detection in Cataract Surgery Videos (Invited Talk 1 at IPTA...klschoef
 
Medical Video Processing (Tutorial)
Medical Video Processing (Tutorial)Medical Video Processing (Tutorial)
Medical Video Processing (Tutorial)klschoef
 
Medical Multimedia Systems and Applications
Medical Multimedia Systems and ApplicationsMedical Multimedia Systems and Applications
Medical Multimedia Systems and Applicationsklschoef
 
Detecting Semantics in Endoscopic Videos with Deep Neural Networks
Detecting Semantics in Endoscopic Videos with Deep Neural NetworksDetecting Semantics in Endoscopic Videos with Deep Neural Networks
Detecting Semantics in Endoscopic Videos with Deep Neural Networksklschoef
 
The Importance of Medical Multimedia
The Importance of Medical MultimediaThe Importance of Medical Multimedia
The Importance of Medical Multimediaklschoef
 
Video Browser Showdown 2018 - Results
Video Browser Showdown 2018 - ResultsVideo Browser Showdown 2018 - Results
Video Browser Showdown 2018 - Resultsklschoef
 
Video Browser Showdown 2018
Video Browser Showdown 2018Video Browser Showdown 2018
Video Browser Showdown 2018klschoef
 
Medical Multimedia Information Systems (ACMMM17 Tutorial)
Medical Multimedia Information Systems (ACMMM17 Tutorial) Medical Multimedia Information Systems (ACMMM17 Tutorial)
Medical Multimedia Information Systems (ACMMM17 Tutorial) klschoef
 

Mehr von klschoef (8)

Relevant Content Detection in Cataract Surgery Videos (Invited Talk 1 at IPTA...
Relevant Content Detection in Cataract Surgery Videos (Invited Talk 1 at IPTA...Relevant Content Detection in Cataract Surgery Videos (Invited Talk 1 at IPTA...
Relevant Content Detection in Cataract Surgery Videos (Invited Talk 1 at IPTA...
 
Medical Video Processing (Tutorial)
Medical Video Processing (Tutorial)Medical Video Processing (Tutorial)
Medical Video Processing (Tutorial)
 
Medical Multimedia Systems and Applications
Medical Multimedia Systems and ApplicationsMedical Multimedia Systems and Applications
Medical Multimedia Systems and Applications
 
Detecting Semantics in Endoscopic Videos with Deep Neural Networks
Detecting Semantics in Endoscopic Videos with Deep Neural NetworksDetecting Semantics in Endoscopic Videos with Deep Neural Networks
Detecting Semantics in Endoscopic Videos with Deep Neural Networks
 
The Importance of Medical Multimedia
The Importance of Medical MultimediaThe Importance of Medical Multimedia
The Importance of Medical Multimedia
 
Video Browser Showdown 2018 - Results
Video Browser Showdown 2018 - ResultsVideo Browser Showdown 2018 - Results
Video Browser Showdown 2018 - Results
 
Video Browser Showdown 2018
Video Browser Showdown 2018Video Browser Showdown 2018
Video Browser Showdown 2018
 
Medical Multimedia Information Systems (ACMMM17 Tutorial)
Medical Multimedia Information Systems (ACMMM17 Tutorial) Medical Multimedia Information Systems (ACMMM17 Tutorial)
Medical Multimedia Information Systems (ACMMM17 Tutorial)
 

Kürzlich hochgeladen

Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxMohamedFarag457087
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceAlex Henderson
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusNazaninKarimi6
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptxryanrooker
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfSumit Kumar yadav
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptxSilpa
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxseri bangash
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspectsmuralinath2
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .Poonam Aher Patil
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professormuralinath2
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Silpa
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxRenuJangid3
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsOrtegaSyrineMay
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxSilpa
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Silpa
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learninglevieagacer
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Serviceshivanisharma5244
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....muralinath2
 

Kürzlich hochgeladen (20)

Digital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptxDigital Dentistry.Digital Dentistryvv.pptx
Digital Dentistry.Digital Dentistryvv.pptx
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Human genetics..........................pptx
Human genetics..........................pptxHuman genetics..........................pptx
Human genetics..........................pptx
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 
Genome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptxGenome sequencing,shotgun sequencing.pptx
Genome sequencing,shotgun sequencing.pptx
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 

Interactive Video Search - Tutorial at ACM Multimedia 2015

  • 1. Interactive Video Search Klaus Schoeffmann, PhD Klagenfurt University Institute of Information Technology Klagenfurt, Austria Frank Hopfgartner, PhD University of Glasgow School of Humanities Glasgow, UK
  • 2. Outline • Search in video content: motivation and challenges • Video retrieval and its challenges • What is interactive video search and how can it help? • Video browsing • Video navigation • Break • Video content visualization • Ad-hoc similarity search / video exploration • Sketch-based search in video • Evaluation of interactive video search tools • Visual lifelogging Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 2
  • 3. Motivation 3Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 4. Videos Everywhere • Ubiquitous use of videos nowadays Entertainment and commercials Social gaming (screencasts) Personal videos (family, kids, …) Sports documentation and analysis (e.g., GoPro) Product usage instructions (e.g., furniture) Surveillance (buildings, places, street, …) Lifelogging Health care and medical science (endoscopic procedures) • Enormous amount of data! Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 4
  • 5. Video as the Ultimate Media? Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 5 [Mary Meeker, Liang Wu, Internet Trends, D11 Conference, May, 2013] As of 2014, every minute 300 hours of video are uploaded to YouTube!
  • 6. Video Cameras • Increasingly powerful These days you can record 4K content with your mobile! Video sensors use auto-focus, object tracking, color correction, and image stabilization Storage space not a big problem  Current smartphones have 128 GB of memory  NAS devices cheaply available Network bandwidth also dramatically increased over years  Video streaming on the go is simple and common  LTE connections provide 30 Mbit/s and even much more! Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 6
  • 7. 7Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search [Mary Meeker, Liang Wu, Internet Trends, D11 Conference, May, 2013]
  • 8. Challenges • Video data are a continuous media: the content depends on time! often contain several media types: image, text, audio cannot be simply stored and indexed in a data base, requires own indexing and search methods! require huge amount of storage space without compression! may contain a lot of important information, which is, however, often very subjective! Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 8
  • 9. Challenges • Subjectivity of data How many people use it? The more the easier!? • Different levels  Internet scale (YouTube)  Country/region  Company/organization/group  Individual Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 9 Available meta-data How to query for a video clip? How to efficiently retrieve results? How to effectively present content to the user?
  • 10. “Poor Man’s Video Search Tool” Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 10 VCR in the 1970s provided a similar functionality!
  • 11. Video Data Still Tedious to Use • Even with video retrieval tools it is still challenging to find desired video content Especially if it is not a publicly available (and popular) Many problems with querying, in particular for novice users! • The ultimate goal is to make use of and search in video as effective as for text Quickly find relevant content Compare to interactivity of a text book  Index, ToC, list of figures/tables, etc.  Change, extend, copy, bookmark, highlight, etc. Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 11
  • 12. Traditional Video Retrieval 12Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search “Query and browse results”
  • 13. Search Example (TRECVID KIS 2010) Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 13 Find the “video of President Bush standing near sea vessels with Coast Guard members talking about his pride of the Coast Guard, immigration, and security issues”. Video from IACC public data set! TRECVID: see later!
  • 14. Video Clip Hidden in Huge Collection 14 Internet Archive with Creative Commons (IACC) data set, as used for TRECVID: 146,788 shots (~9,000 videos) Page 1 2 3 …. 38 39 40 Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 15. 1st Trial at YouTube Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 15
  • 16. 2nd Trial at YouTube Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 16
  • 17. [ Heesch, D., Howarth, P., Magalhaes, J., May, A., Pickering, M., Yavlinsky, A., & Rüger, S. (2004, November). Video retrieval using search and browsing. In TREC Video Retrieval Evaluation Online Proceedings. ] 17 Content- based Feature Example Image Text Ranked list of shots In IACC about 5800 pages.  Temporal Context Traditional Video Retrieval Tool Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 18. More Interactive Retrieval Tool Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 18 [A. Moumtzidou et al., “VERGE: A Multimodal Interactive Video Search Engine”, Proc. of the 21st International Conference on MultiMedia Modeling (MMM 2015), Sydney, 2015] kNN Similarity search based on VLAD vectors Concept detection with SVM and five local descriptors (SIFT, SURF, ORB, ...) and PCA Hierarchical keyframe clustering  Interaction details later
  • 19. Challenges for Video Retrieval 19Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 20. Traditional Video Retrieval Approach “Query-and-Browse-Resoluts” Paradigm Works well if (and only if) users can properly express their needs. content features can sufficiently describe visual content. computer vision can accurately detect semantics. 20 Content- based Search Ranked Results Unfortunately, in practice these assumptions do not hold. Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 21. Challenges Content-based features  How to understand semantics from pixels? Semantic Gap Both images show bears in front of a landscape. 21Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 22. Database affinity of concept classifiers Low performance in broad domain P(k) Precision at level k (after k results) rel(k) defines if kth retrieved document is relevant Performance Gap 22 Challenges TRECVID 2013 Semantic Indexing (SIN-500): median “inferred average precision” (infAP) < 0.13 In other words: 88% of the results are not correct! Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 23.  Query-by-concept  Which concept to use? Choose from a long list of results…  Query-by-example  Typically no perfect example available.  Query-by-sketch  Users are no artists   Query-by-text  How to describe a desired image by text? Usability Gap 23 A picture tells a 1000 words. by marfis75 How to describe a video clip by text??? Challenges Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 24. Example: Query by Motion Sketch • Matching based on trajectory descriptor • Challenge: may differ a lot among different users Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 24 [Ghosal, Koustav, and Anoop Namboodiri. "A Sketch-Based Approach To Video Retrieval Using Qualitative Features." Proceedings of the 2014 Indian Conference on Computer Vision Graphics and Image Processing. ACM, 2014.]
  • 25. Well-Known Issues of “Query-and-Browse-Results” Paradigm Users cannot formulate or have no query   provide exploratory search features!  For example: browsing, filtering, similarity search Users expect good results (on first page!)   Use relevance feedback / active learning instead of long lists! Shots have a temporal context Videos are dynamic  Static thumbnails are not informative  Esp. true for long shots and self-similar content   skims and visual summaries (“smart playback”)   sophisticated navigation & content structure visualization Grid interfaces are not always the best choice Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 25 Usability Gap  See later
  • 26. Uniform Sampled Frames from a Video with High Self-Similarity Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 26
  • 27. Needs Special Keyframe Extraction and Object Detection [Klaus Schoeffmann, Manfred Del Fabro, Tibor Szkaliczki, Laszlo Böszörmenyi, and Jörg Keckstein, “Keyframe Extraction in Endoscopic Video“, in Multimedia Tools and Applications, Springer, August, 2014] [Manfred J. Primus, Klaus Schoeffmann, and Laszlo Böszörmenyi, “Instrument Classification in Laparoscopic Videos“, in Proceedings of the International Workshop on Content-Based Multimedia Indexing (CBMI 2015), Prague, Czech Republic, IEEE, 2015, pp. 1-6] Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 27
  • 28. And Special Browsing Tools / Visualization Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 28 Agglomerative clustering based on visual similarity and temporal information [Jakub Lokoc, Klaus Schoeffmann, and Manfred Del Fabro, “Dynamic Hierarchical Visualization of Keyframes in Endoscopic Video“, in Proceedings of the 21st International Conference on MultiMedia Modelling 2015 (MMM 2015), Sydney, Australia, Lecture Notes on Computer Science (LNCS), Vol. 8936, Springer International Publishing, 2015, pp. 291-294]
  • 29. Where Is the User in Multimedia Retrieval? 29 [Marcel Worring et al., „Where Is the User in Multimedia Retrieval?“, IEEE Multimedia, Vol. 19, No. 4, Oct.-Dec. 2012, pp. 6-10 ] Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 30. Interactive vs. “Traditional” Retrieval 30Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 31. Interactive Video Search And how it can help… 31Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 32. Performance Gap Interactive Video Search 32 • Mostly interactive search • Human computation • Simple-to-use • Inflexible and tedious for archives • Low performance (?) • Mostly automatic search • Retrieval engine • Complicated to use • Flexible and easier (?) for archives • Limited performance too! Usability Gap Novices Experts  Combines HCI with CV and MIR Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 33. Interactive Video Search Traditional video retrieval + interactive inspection/exploration/navigation and rich content visualization in order to satisfy an information need 33 Focuses on search and exploration in (i) single videos as well as (ii) video collections Directed Search Find a specific shot or segment in a video Find a specific video in an archive Undirected Search Searching to discover information E.g., browse through a video in order to Learn how the content looks like See if it is interesting Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search Not supported by trad. video retrieval
  • 34. Interactive Video Search • Tries to strongly integrate user into search process Search – Inspect – Think – Repeat  Exploratory search (“will know it when I see it”)  Instead of „query-and-browse-results“ User controls search process  Inspects and interacts  Most meaningful tool for current need, e.g. • Content Browsing/Navigation • Content Visualization and Summarization • Ad-hoc Querying (e.g., by sketch, filtering, ad-hoc example) 34Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 35. User Studies with Significance Tests! • Many interfaces proposed without proper evaluation • Interface A better than interface B?  comparative user study needed!  Perform search tasks in exactly the same setting (data, environment, etc.)  Logging of interaction behavior and task solve time  Questionnaire about subjective workloads  Statistical analysis with proper tests (e.g., t-test, ANOVA, Wilcoxon signed-rank, etc.) • User simulations? • Evaluation competitions  Same data set  Comparative evaluation  TRECVID, MediaEval, Video Browser Showdown (see later) Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 35
  • 36. IVS Tools: Video Browsing 36Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 37. 37 Video Browsing [ F. Arman, R. Depommier, A. Hsu, and M-Y. Chiu, Content-based Browsing of Video Sequences, in Proc. of ACM International Conference on Multimedia, 1994, pp. 97-103 ] Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 38. The ThumbBrowser Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 38 [Marco Hudelist, Klaus Schoeffmann, Laszlo Böszörmenyi. “Mobile Video Browsing with the ThumbBrowser”, Proc. of the International Conference on Multimedia, 2013, pp. 405-406 ]
  • 39. Video Browser for the Digital Native 39 [Adams, Brett, Stewart Greenhill, and Svetha Venkatesh. "Towards a video browser for the digital native." Multimedia and Expo Workshops (ICMEW), 2012 IEEE International Conference on. IEEE, 2012.] “Temporal Semantic Compression” based on tempo function and shot popularity (insight) Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 40. Video Browser for the Digital Native • User study with 8 participants  Test configuration elements by two tasks (after presentation + 5 minutes training)  (i) Browse a familiar movie to find scenes you remember  (ii) Browse an unfamiliar movie to get a feel for its story or structure  Questionnaire with Likert-scale ratings 40 [Adams, Brett, Stewart Greenhill, and Svetha Venkatesh. "Towards a video browser for the digital native." Multimedia and Expo Workshops (ICMEW), 2012 IEEE International Conference on. IEEE, 2012.] Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 41. Thread-Based Browsing of Retrieval Results • Thread: linked seq. of shots in a specified order  Query results, visual similarity, semantic similarity, textual similarity time, … Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 41 [De Rooij, Ork, Cees GM Snoek, and Marcel Worring. "Balancing thread based navigation for targeted video search." Proceedings of the 2008 international conference on Content-based image and video retrieval (CIVR). ACM, 2008.]
  • 42. Thread-Based Browsing of Retrieval Results Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 42 [Ork de Rooij et al.]
  • 43. IVS Tools: Video Navigation 43Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 44. Improving Navigation 44 e.g., on YouTube default window: 640 pixels = frames (25 seconds) Common seeker-bar limits navigation granularity [Huerst et al., ICME 2007] ZoomSlider Improvements (selected): Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 45. Navigation with a Seeker-Bar Idea of the ZoomSlider 45 Wolfgang Hürst, Georg Götz, and Martina Welte, “Interactive video browsing on mobile devices”, in Proceedings of the 15th International Conference on Multimedia (MULTIMEDIA '07). ACM, pp. 247-256, 2007 Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 46. 46Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 47. Improving Navigation 47 e.g., on YouTube default window: 640 pixels = frames (25 seconds) Common seeker-bar limits navigation granularity [Dragicevic et al., CHI 2008] Direct Manipulation [Huerst et al., ICME 2007] ZoomSlider Improvements (selected): Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 48. Relative Flow Dragging Background Stabilization 48 Pierre Dragicevic, Gonzalo Ramos, Jacobo Bibliowitcz, Derek Nowrouzezahrai, Ravin Balakrishnan, and Karan Singh. “Video browsing by direct manipulation”, in Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '08). ACM, pp. 237-246, 2008 Video browsing by direct manipulation / relative flow dragging Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 49. Relative Flow Dragging • Evaluation with a user study  16 participants (18-44 years old)  Direct comparison to seeker-bar navigation  Navigation tasks, 2 videos (ladybug, cars)  “Find the position where the ladybug passes over marker X”  “Find the moment when car X starts moving”  Flow dragging significantly faster (RM-ANOVA) by at least 250% (also significantly less errors) 49 Pierre Dragicevic, Gonzalo Ramos, Jacobo Bibliowitcz, Derek Nowrouzezahrai, Ravin Balakrishnan, and Karan Singh. “Video browsing by direct manipulation”, in Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI '08). ACM, pp. 237-246, 2008 Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 50. How Do Users Search in Video With a Common Video Player? 50Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 51. How do Users Search with Video Players (Navigate with Seeker-Bars)? • User study with more than 30 participants • Known Item Search Tasks 51 [Claudiu Cobarzan and Klaus Schoeffmann, “How do Users Search with Basic HTML5 Video Players?“, in Proceedings of The 20th International Conference on MultiMedia Modeling (MMM2014), 2014] Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 52. How do Users Search with Video Players (Navigate with Seeker-Bars)? 52 [Claudiu Cobarzan and Klaus Schoeffmann, “How do Users Search with Basic HTML5 Video Players?“, in Proceedings of The 20th International Conference on MultiMedia Modeling (MMM2014), 2014] Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 53. How do Users Search with Video Players (Navigate with Seeker-Bars)? 53 [Claudiu Cobarzan and Klaus Schoeffmann, “How do Users Search with Basic HTML5 Video Players?“, in Proceedings of The 20th International Conference on MultiMedia Modeling (MMM2014), 2014] Vast amount of content was checked with normal playback!? Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 54. How do Users Search with Video Players (Navigate with Seeker-Bars)? Timeinvideo(ms) 54 [Claudiu Cobarzan and Klaus Schoeffmann, “How do Users Search with Basic HTML5 Video Players?“, in Proceedings of The 20th International Conference on MultiMedia Modeling (MMM2014), 2014] User start with rough navigation and look more carefully after narrowing down the search area! Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search Target segment
  • 55. Improving Video Navigation on Touch Devices 55Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search Session continues at 3:20 pm
  • 56. 56Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 57. Keyframe Navigation Tree • Consider findings of study on navigation behavior • Basic idea inspired by frame stripes (MO images) • Goal  very compact visualization  not as fine as frames but not as coarse as keyframes of shots  provide different granularity levels for navigation  previous work has shown that users typically navigate in a coarse-to-fine grained manner 57 [ Schoeffmann, K., Taschwer, M., & Boeszoermenyi, L. (2010, February). The video explorer: a tool for navigation and searching within a single video based on fast content analysis. In Proceedings of the first annual ACM SIGMM conference on Multimedia systems (pp. 247-258). ACM. ] [Mueller-Seelich, H., Tan, E.: Visualizing the semantic structure of film and video (2000) ] compact overview but abstract & very high level of detail (frame-based!) [Xiaoxiao Luo, Qing Xu, Mateu Sbert, Klaus Schoeffmann, “F-Divergences Driven Video Key Frame Extraction“, in Proc. of the IEEE Int. Conference on Multimedia & Expo (ICME 2014), Chengdu, China, 2014, pp. 6] Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 58. • Keyframe selection based on sub-shots (JSD with color histograms) Cover all important scenes even for long shots (e.g. pans) Excerpts with three levels of detail: L1: narrow, L2: wide, L3: full keyframe Used as seeker-bar with synchronized interaction for all levels Simple touch-based interaction (tap and wipe gestures) 58 L1 (30 shots) L2 (~ 12 shots) L3 (~ 3 shots) Keyframe Navigation Tree [Hudelist, Marco A., Klaus Schoeffmann, and Qing Xu. "Improving interactive known-item search in video with the keyframe navigation tree." MultiMedia Modeling. Springer International Publishing, 2015.] Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 59. 59Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 60. Keyframe Navigation Tree • Video Player vs. KNT Browser (iPad, 4th generation, 9.7-inch) • User study with 20 participants (15m/5f)  Age: 18-40 (mean 28.15, s.d. 6.08) • Known-item search tasks  Given 20 seconds long target clip  Find correct clip in 1-h long video as fast as possible  200 search tasks in total  Each participant performed random selection of 10 tasks (5/5)  latin-square principle to avoid familiarization effects  Time-out after 3 minutes (“unanswered”)  Wrong results marked as “erroneous” • Time measurement, logging, questionnaire (Likert-scale ratings) 60Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 61. Keyframe Navigation Tree Search Time & Performance • Task solve time, erroneous trials, unanswered trials… 61 28.11 44.18 KNT Browser statistically significantly faster acc. to dependent paired-samples t-test (t(19) = -3.937 p < 0.005) 7 trials 10 trials 5 trials 22 trials Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search [Hudelist, Marco A., Klaus Schoeffmann, and Qing Xu. "Improving interactive known-item search in video with the keyframe navigation tree." MultiMedia Modeling. Springer International Publishing, 2015.]
  • 62. Keyframe Navigation Tree Subjective Rating • NASA Task-Load-Index (TLX) questionnaires 62 KNT browser significantly better in all 7 categories! Acc. to Wilcoxon signed- rank tests (for details see paper) KNT browser is preferred search tool for 85% of tested users (17/20) Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search [Hudelist, Marco A., Klaus Schoeffmann, and Qing Xu. "Improving interactive known-item search in video with the keyframe navigation tree." MultiMedia Modeling. Springer International Publishing, 2015.]
  • 63. IVS Tools: Content Visualization 63Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 64. Grid Interfaces Aren‘t Enough! • Many video retrieval systems use a Grid interface!? Moreover, a grid interface does not allow for fast human visual search (see later)! 64 A ranked list of results does not convey the temporal content structure! • To which video does a shot belong to? • What is the sequence of shots? • How long is a shot / scene? Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 65. Table of Video Content (TOVC) [Goeau et al., ICME 2007] 65 Squeeze / Fisheye Rapid Visual Serial Presentation (RSVP) Improving Visualization aka “Video Surrogates” [Wildemuth et al., 2003] [Wittenburg et al., 2005] Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 66. 66 VideoTree [Jansen et al., CBMI 2008] However, outperformed by simple “grid of keyframes” in terms of search time. Similar concept proposed later [Girgensohn et al., ICMR 2011] • Split-based clustering algorithm with color correlograms. • Tree not directly shown to the user (only one level). Improving Visualization aka “Video Surrogates” Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 67. 3D Ring Interface • Utilization of screen real estate  Large set of images  Minor occlusion, slight distortion • Intuitive interaction  Rotate and zoom • Content-based sorting • “Pop-out images” (in the back) • Further advantages  Immediately continue on miss, scaling 67 Klaus Schoeffmann, David Ahlström, and Marco Andrea Hudelist, “3-D Interfaces to Improve the Performance of Visual Known-Item Search“, in IEEE Transactions on Multimedia, Vol. 16, No. 7, November, 2014, pp. 1942-1951. Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 68. 3D Ring Interface - Perspectives Preferred Design acc. to user study 25% Vertical 66% Horizontal 8.3% Frontal 68 Klaus Schoeffmann, David Ahlström, and Marco Andrea Hudelist, “3-D Interfaces to Improve the Performance of Visual Known-Item Search“, in IEEE Transactions on Multimedia, Vol. 16, No. 7, November, 2014, pp. 1942-1951. Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 69. 3D interface significantly faster than grid by 12.7% User Study: Grid vs. Ring (both sorted) 150 images, 12 participants, 1440 trials 69 Klaus Schoeffmann, David Ahlström, and Marco Andrea Hudelist, “3-D Interfaces to Improve the Performance of Visual Known-Item Search“, in IEEE Transactions on Multimedia, Vol. 16, No. 7, November, 2014, pp. 1942-1951. Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 70. Extension: Multiple Rings with Vertical Scrolling 70 Klaus Schoeffmann. 2014. The Stack-of-Rings Interface for Large-Scale Image Browsing on Mobile Touch Devices. In Proc. of the ACM Int. Conference on Multimedia (MM '14). ACM, New York, NY, USA, 1097-1100. Significantly faster search (by about 48%) than common image browser on iPad! Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 71. IVS Tools: Ad-Hoc Similarity Search 71Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 72. The Video Explorer 72 [ Schoeffmann, K., Taschwer, M., & Boeszoermenyi, L. (2010, February). The video explorer: a tool for navigation and searching within a single video based on fast content analysis. In Proceedings of the first annual ACM SIGMM conference on Multimedia systems (pp. 247-258). ACM. ] Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 73. Interactive Navigation Summaries Allows a user to quickly identify similar/repeating scenes 73 [ Schoeffmann, K., & Boeszoermenyi, L. (2009, June). Video browsing using interactive navigation summaries. In Content-Based Multimedia Indexing, 2009. CBMI'09. Seventh Int.Workshop on (pp. 243-248). IEEE. ] Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 74. Motion Layout: Direction + Intensity Motion Vector (µ) classification into K=12 equidistant motion directions Mapping to Hue channel 74 [ Schoeffmann, K., Lux, M., Taschwer, M., & Boeszoermenyi, L. (2009, June). Visualization of video motion in context of video browsing. In Multimedia and Expo, 2009. ICME 2009. IEEE Int. Conf. on (pp. 658-661). IEEE. ] Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 75. 75 [ Schoeffmann, K., Lux, M., Taschwer, M., & Boeszoermenyi, L. (2009, June). Visualization of video motion in context of video browsing. In Multimedia and Expo, 2009. ICME 2009. IEEE Int. Conf. on (pp. 658-661). IEEE. ] Similarity Search (SOI) with Motion Layout Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 76. • SOI Search  Motion-based search by example sequence  Using Motion Direction histogram Db  User-selected sequence  Find most similar sequences  Compute distance to any possible seq. of same length  Match if below spec. threshold Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 76 Motion Layout (Db) Match 1 Match 2 Match 3 frame 1 frame n Similarity Search (SOI) with Motion Layout
  • 77. Region-of-Interest (ROI) Search  User selects spatial region-of-interest  On search  Compute Euclidian distance of frame F to every other frame f (acc. to selected region)  Based on color layout descriptor … frame F frame 1 frame k frame n User-selected region (I) … d(F,1)=350 d(F,k)=8 d(F,n)=400 77 [ Schoeffmann, K., Taschwer, M., & Boeszoermenyi, L. (2010, February). The video explorer: a tool for navigation and searching within a single video based on fast content analysis. In Proceedings of the first annual ACM SIGMM conference on Multimedia systems (pp. 247-258). ACM. ] Similarity Search (ROI) with Color Layout Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 78. 78 [ Schoeffmann, K., Taschwer, M., & Boeszoermenyi, L. (2010, February). The video explorer: a tool for navigation and searching within a single video based on fast content analysis. In Proceedings of the first annual ACM SIGMM conference on Multimedia systems (pp. 247-258). ACM. ] Similarity Search (ROI) with Color Layout Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 79. IVS Tools: Sketch-Based Search 79Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 80. • Color sketches mapped to feature signatures • Matched to those of keyframes 80 1. Sampling keypoints 2. Description through location (x,y), CIE Lab, contrast and entropy of surrounding pixels 3. K-means clustering Feature Signatures [ Kruliš, M., Lokoč, J. and Skopal, T. (2013). Efficient Extraction of Feature Signatures Using Multi-GPU Architecture. Springer Berlin Heidelberg, LNCS 7733, pp.446-456. ] Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 81. Feature Signature-Based Video Browser 81 Color Sketch (Signature) Player Winner of Video Browser Showdown 2014 + 2015 Download demo at: http://siret.ms.mff.cuni.cz/lokoc/vbs.zip 2nd Color Sketch (optional) [ Lokoč, J., Blažek, A., & Skopal, T. (2014, January). Signature-Based Video Browser. In MultiMedia Modeling (pp. 415-418). Springer International Publishing. ] Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 82. Compact visualization Simple color-position sketch Negative example Matched key-frames Time to 2nd sketch 2nd optional sketch Interactive-navigation summaryOn demand neighborhood expansion [Slide: Adam Blazek et al. (siret research group, Czech Republic)] Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 82
  • 83. Compact Visualization in Detail Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 83 [Courtesy of Jakub Lokoc et al.]
  • 84. Another Example of a Color-Based Browser Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 84 Browsing Area Map Segment Inspection Color SketchCategories [Kai Uwe Barthel, Nico Hezel, Radek Mackowiak. Navigating a graph of scenes for exploring large video collections, in Proc. of 22nd International Conference on MultiMedia Modeling (MMM 2016), Lecture Notes in Computer Science (LNCS), Vol. tbd, Springer International Publishing, 2016, pp. 1-7]
  • 85. Evaluation of IVS Tools 85Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 86. TRECVID http://trecvid.nist.gov/ • International video retrieval competition evaluation  Annually performed by NIST (Gaithersburg, Maryland, USA)  Funded by NIST and other US government agencies  Benchmark for researchers using same data  Origin in TREC (Text REtrieval Conference, since 1992) • Founded in 2003, by  Alan Smeaton (Dublin City University)  Wessel Kraaij (TNO-ICT, Delft) • International advisory Committee  Alex Hauptmann (CMU)  Michael Lew (Leiden Institute of Advanced Computer Science)  Georges Quenot (LIG, Grenoble)  John Smith (IBM Research)  … • Local organisation  Paul Over (NIST) 86Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 87. TRECVID Known-item Search TRECVID KIS (2010-2012) models the situation in which “someone knows of a video, has seen it before, believes it is contained in a collection, but doesn‘t know where to look” Automatic Search  Text-description about the video  Return ranked list of 100 videos (out of 9000) Interactive Search  Pre-processing based on text query  Searcher browses through result list (e.g., keyframes of shots) • Interactively find target video as fast as possible • Within 5 minutes 87Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 88. TRECVID Known-item Search The Performance of State-of-The-Art Video Retrieval Tools Known items not found by any team: Interactive Automatic out of 2010 5 / 24 21% 69 / 300 22% 15 teams 2011 6 / 25 24% 142 / 391 36% 9 teams 2012 2 / 24 17% 108 / 361 29% 9 teams From: [Alan Smeaton, Paul Over, “Known-Item Search @ TRECVID 2012”, NIST, 2012] 88Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 89. MediaEval 2015 • Search and Anchoring in Video Archives “Search for Multimedia Content”  Multi-model textual and visual descriptions of content of interest “Automatic Anchor Selection”  Predict key elements of videos as anchor points for hyperlinking Professional (BBC) and non-professional content (users) Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 89 http://www.multimediaeval.org
  • 90. Video Browser Showdown (VBS) • Annual performance evaluation competition  Live evaluation of search performance  Special session at Int. Conference on MultiMedia Modeling (MMM)  Demonstrates and evaluates state-of-the-art interactive video search tools  Idea influenced by VideOlympics (Snoek et al., IEEE Multimedia 2008) • Focus  Known-item Search tasks  Target clips are presented on site  Teams search in shared data set  Highly interactive search  Should push research on interfaces and interaction/navigation  Experts and Novices  Easy-to-use tools and methods 90 Teams connected to a server that issues tasks and evaluates submitted results http://videobrowsershowdown.org/ Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 91. Video Browser Showdown (VBS) • Scoring through VBS Server • Score (s) [0-100] for task i and team k is based on Solve time (t) Penalty (p) based on number of submissions (m) 91 Maximum solve time (Tmax) typically 3-5 minutes [Schoeffmann, K., Ahlström, D., Bailer, W., Cobârzan, C., Hopfgartner, F., McGuinness, K., ... & Weiss, W. (2013). The Video Browser Showdown: a live evaluation of interactive video search tools. International Journal of Multimedia Information Retrieval, 1-15. ] Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 92. Video Browser Showdown 2015 • Search in mid-sized video collections (2016: 200 hours)  Originally only single video search • Two different kind of tasks:  Visual: visual presentation of a 30s target clip  Textual: textual description of a 30s target clip • Shared video data from BBC  2015: 153 video files, about 100.000 shots (9 Mio frames) Participants: • Need to find the target clips as quickly as possible • Get points for each task (the faster the better) • But only for submission of exact location of target clip [Schoeffmann, Klaus. "A user-centric media retrieval competition: The video browser showdown 2012-2014." MultiMedia, IEEE 21.4 (2014): 8-13.] Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 92
  • 93. 93Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 2012: Klagenfurt 11 teams 2013: Huangshan 6 teams 2014: Dublin 7 teams 2015: Sydney 9 teams VBS 2016: January 5, 2016, Miami, USA (MMM 2016) http://www.videobrowsershowdown.org/
  • 94. Video Browser Showdown 2012 Two examples (of the 11 tools; single video search only) 94 [Xiangyu Chen, Jin Yuan, Liqiang Nie, Zheng-Jun Zha, Shuicheng Yan, and Tat-Seng Chua, "TRECVID 2010 Known-item Search by NUS", in Proceedings of TRECVID 2010 workshop, NIST, Gaithersburgh, USA, 2011 Jin Yuan, Huanbo Luan, Dejun Hou, Han Zhang, Yan-Tao Zheng, Zheng-Jun Zha, and Tat-Seng Chua, "Video Browser Showdown by NUS", in Proceedings of th 18th International Conference on Multimedia Modeling (MMM) 2012, Klagenfurt, Austria, pp. 642-645] • Keyframe extraction (shots) • ASR and OCR • HLF (Concepts) • RF with Related Samples • Uniform sampled keyframes (with flexible distance) • Parallel playback + navigation [Manfred Del Fabro and Laszlo Böszörmenyi, "AAU Video Browser: Non- Sequential Hierarchical Video Browsing without Content Analysis", in Proceedings of th 18th International Conference on Multimedia Modeling (MMM) 2012, Klagenfurt, Austria, pp. 639-641] Winner of VBS 2012 Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 95. Winner 2014 and 2015 (2014: single video and collection search, 2015: collection only) 95 Color Sketch (Signature) Player 2nd Color Sketch (optional) [ Lokoč, J., Blažek, A., & Skopal, T. (2014, January). Signature-Based Video Browser. In MultiMedia Modeling (pp. 415-418). Springer International Publishing. ] Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 96. Video Browser Showdown 2015 Two examples (of the 9tools, collection search only) 96 Moumtzidou, A., Avgerinakis, K., Apostolidis, E., Markatopoulou, F., Apostolidis, K., Mironidis, T., ... & Patras, I. (2015, January). VERGE: A Multimodal Interactive Video Search Engine. In MultiMedia Modeling (pp. 249-254). Springer International Publishing. • Shot and scene detection • HLF (Concepts) with SIFT/SURF and VLAD • Similarity search • Uniform sampled frames • Human computation Hürst, W., van de Werken, R., & Hoet, M. (2015, January). A Storyboard-Based Interface for Mobile Video Browsing. In MultiMedia Modeling (pp. 261-265). Springer International Publishing. 3rd place Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 97. 97Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search URL: http://mklab-services.iti.gr/vss2015/ [Courtesy of Stefanos Vrochidis]
  • 98. 98Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search [Courtesy of Stefanos Vrochidis]
  • 99. 99Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search Similarity Search Results [Courtesy of Stefanos Vrochidis]
  • 100. 100Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search [Courtesy of Stefanos Vrochidis]
  • 101. Human vs. Machine • Utrecht University @ VBS 2015  Wolfgang Huerst et al., The Netherlands  Strong experience in HCI • Features  Uniformly sampled thumbs (1 second distance)  Huge storyboard on tablet  Vertical scrolling, paging 101 625 thumbnails in one screen [Hürst, W., van de Werken, R., & Hoet, M. (2015, January). A Storyboard-Based Interface for Mobile Video Browsing. In MultiMedia Modeling (pp. 261-265). Springer International Publishing.] Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search
  • 102. Visual Lifelogging Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 102 Slides by Frank Hopfgartner
  • 103. What is The Quantified Self? The Quantified Self is about obtaining self-knowledge through self-tracking. Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 103
  • 104. What is The Quantified Self? Self-tracking is also referred to as lifelogging, self- analysis, or self-hacking. Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 104
  • 105. Memex Bush, Vannevar. "As We May Think." The Atlantic Monthly. July 1945. ImagesofMemex:http://trevor.smith.name/memex/ Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 105
  • 106. MyLifeBits • Gordon Bell (Microsoft) digitized his life: Books written Personal documents Photos Posters, paintings, photo of things Home movies and videos CD collection PC files … Gordon Bell and Jim Gemmell. Total Recall: How the E-Memory Revolution will change everything, New York, Dutton 2009 http://research.microsoft.com/en-us/projects/mylifebits/ Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 106
  • 107. Creating Personal Lifebraries A lifebrary consists of heterogeneous data recorded using many different sensors. Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 107
  • 108. Recording what I eat Aizawa, Kiyoharu, Maruyama, Yutu, Li, He, and Morikawa, Chamin. “Food Balance Estimation by Using Personal Dietrary Tendencies in a Multimedia Food Log." IEEE Transactions on Multimedia, 15(8):2176-2185, 2013. Semantic Gap http://foodlog.jp/ http://mealsnap.com/ Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 108
  • 109. Recording what I see "LifeGlogging cameras 1998 2004 2006 2013 labeled" by Glogger - Own work. Licensed under CC BY-SA 3.0 via Commons - https://commons.wikimedia.org/wiki/File:LifeGlogging_cameras_1998_2004_2006_2013_labeled.jpg#/media/File:LifeGlogging_cameras_1998_2004_2006_2013_labeled.jpg Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 109
  • 110. Visual Lifelogging Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 110
  • 111. Example: Visual Lifelog of a day 5,500 pictures a day Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 111 [Slide: C. Gurrin, DCU]
  • 112. Big Data Cathal Gurrin, Alan F. Smeaton and Aiden R. Doherty (2014), "LifeLogging: Personal Big Data", Foundations and Trends® in Information Retrieval: Vol. 8: No. 1, pp 1-125. Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 112
  • 113. Semantic Analysis • Context cues help us to remember (Naaman et al.) • Context in lifelogging data:  Location, bluetooth, time, date, …  Derived Knowledge (e.g. activities) • Approaches:  Combine cues from different sources  Perform content analysis to identify objects, people, events…  Annotate lifelogs in form of narrative text Mor Naaman, Susumu Harada, QianYing Wang, Hector Garcia-Molina, Andreas Paepcke: Context data in geo-referenced digital photo collections. ACM Multimedia 2004: 196-203 Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 113 [Slide: C. Gurrin, DCU]
  • 114. Visual Feature Extraction  Steering wheel (72%)  Shopping (75%)  Inside of vehicle when not driving (airplane, taxi, car, bus) (60%)  Toilet/Bathroom (58%)  Giving Presentation / Teaching (29%)  View of Horizon (23%)  Door (62%)  Staircase (48%)  Hands (68%)  Holding a cup/glass (35%)  Holding a mobile phone (39%)  Eating food (41%)  Screen (computer/laptop/tv) (78%)  Reading paper/book (58%)  Meeting (34%)  Road (47%)  Vegetation (64%)  Office Scene (72%)  Faces (61%)  People (45%)  Grass (61%)  Sky (79%)  Tree (63%) Byrne, Daragh, Doherty, Aiden R., Snoek, Cees G. M., Jones, Gareth J. F., Smeaton, Alan F. “Everyday concept detection in visual lifelogs: validation, relationships and trends." Multimedia Tools and Applications, 49(1):119-144, 2010. Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 114
  • 115. A day This does not work well… Let’s add event segmentation. Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 115 [Slide: C. Gurrin, DCU]
  • 116. Event Segmentation & Annotation • Segment 5,500 photos per day into a set of events  Similar to SBD in digital video processing  We employ visual features and output of on-device sensors Multiple Events Finishing work in the lab At the bus stop Chatting at Skylon Hotel lobby Moving to a room Tea time On the way back home Event Segmentation Summarization Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 116 [Slide: C. Gurrin, DCU]
  • 117. Non-supervised Event Segmentation 2. Arriving in the office 6. Walking in the building 12. Leaving the office Na Li et al. “Random Matrix Ensembles of Time Correlation Matrices to Analyze Visual Lifelogs." In Proc. Multimedia Modeling Conference, Dublin, Ireland, pp. 400-411, 2014. Event Segmentation based on the extraction of low level features and computation of semantic concepts requires knowledge about dataset. Alternative: Highlight “significant events” by performing time series analysis Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 117
  • 118. MyLifeBits Gordon Bell and Jim Gemmell. Total Recall: How the E-Memory Revolution will change everything, New York, Dutton 2009 Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 118
  • 119. MyLifeBits Gordon Bell and Jim Gemmell. Total Recall: How the E-Memory Revolution will change everything, New York, Dutton 2009 Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 119
  • 120. Virtual reality “Bad Trip is an immersive virtual reality installation […] that enables people to navigate the creator's mind using a game controller. Since November 2011, every moments of his life has been documented by a video camera mounted on glasses, producing an expanding database of digitalized visual memories. Using custom virtual reality software, he created a virtual mindscape where people could navigate, and experience his memories and dreams.” Souce: http://www.kwanalan.com Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 120
  • 121. Art installations Kelly, Philip and Doherty, Aiden R. and Smeaton, Alan F. and Gurrin, Cathal and O’Connor, Noel E. “The Colour of Life: Novel Visualisations of Population Lifestyles." In Proc. ACM Multimedia, pp. 1063-1066, 2010. Image:CourtesyofC.Gurrin Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 121
  • 122. Video Summary Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 122 [Courtesy of T. Plumbaum]
  • 123. NTCIR • Workshop series focusing on research on Information Access technologies (information retrieval, question answering, text summarisation, etc) • Sponsored by Japan Society for Promotion of Science (JSPS) • Organised since 1997 in an 18-months cycle • NTCIR-12: January 2015 – June 2016 NII Test Collection for IR Systems Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 123
  • 124. NTCIR-12 Tasks NTCIR-12  Second round:  Search-Intent Mining  Mobile Click  Temporal Information Access  Spoken Query & Spoken Document Retrieval  QA Lab for Entrance Exam  First round:  Medical NLP for Clinical Documents  Personal Lifelog Access & Retrieval  Short Text Conversation Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 124
  • 125. Encourage research advances in organising and retrieving from lifelog data. LifeLog @ NTCIR-12 Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 125
  • 126. Multimodal dataset with information needs Created by various individuals over 10+ days TESTCOLLECTION  1,500 images, location, GSR, heart-rate, others… per lifelogger per day  Accompanying output of 1,000 concepts  Data processed pre-release (removal of personal content; face blurring, translation of concepts)  Detailed user queries and judgments generated by the lifelogging data gatherers Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 126
  • 127. Tasks Evaluate different methods of retrieval and access. T1:LIFELOGSEMANTICACCESS(LSAT)  Models the retrieval need from lifelogs (Known-item Search)  Retrieve N segments that match information need  Interactive or Automatic participation  Interactive: Time limit for fair and comparative evaluation in an interactive system with users  Automatic: Fully-automatic retrieval system. Automated query processing T2:LIFELOGINSIGHT  Models the need for reflection over lifelog data  Exploratory task, the aim is to:  Encourage broad participation  Novel methods to visualize and explore lifelogs  Same data as LSAT task  Presented via demo/poster Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 127
  • 128. Task 1: Lifelog Semantic Access Find the moment(s) where I use my coffee machine. Find the moment(s) where I am in the kitchen Find the moment(s) where I am playing with my phone. Find the moment(s) where I am preparing breakfast. Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 128
  • 129. Task 2: Lifelog Insight Task Provide insights on the time I spend taking breakfast. Provide insights on the time I spend driving to work. Provide insights on the time I spend reading a paper. Provide insights on the time I spend working on the computer. Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 129
  • 130. Further information http://ntcir-lifelog.computing.dcu.ie/ 21 Sep 2015: Release of formal run collection and task data (topics) 15 Dec 2015: Deadline for formal run submissions 15 Jan 2016: Formal run evaluation results return 01 Mar 2016: Paper for the Proceedings 7-10 Jun 2016: NCTIR-12 conference Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 130
  • 131. The End Klaus Schoeffmann, Frank Hopfgartner ACM Multimedia 2015 Tutorial: Interactive Video Search 131

Hinweis der Redaktion

  1. http://www.kwanalan.com/#!blank/czod
  2. A video of a lifelog is created by playing back all pictures based on their creation time. An audio layer could be added as well. But as we have seen in the first part of the tutorial, that might not be very efficient. It allows, however, to give you a good overview of the day. This is a video of a trip that my research group a while ago. Klaus: You do not need to play the whole video, just the beginning should be fine to give the idea.
  3. NTCIR is a Japanese version of TREC. It’s main aim is to promote research on information access technologies. It is organised in an 18 months cycle. The current cycle started in January 2015 and will end with a conference in Tokyo in June 2016.