SlideShare ist ein Scribd-Unternehmen logo
1 von 42
Semi-supervised concept detection by learning
the structure of similarity graphs
Symeon Papadopoulos1, Christos Sagonas1, Ioannis Kompatsiaris1, Athena Vakali2
1
  Centre for Research and Technology Hellas, Information Technologies Institute
2
  Aristotle University of Thessaloniki, Informatics Department




    19th International Conference on Multimedia Modeling
    Huangshan, China, Jan 7-9, 2012
IMAGE           TAGS                          CONCEPTS

                chocolate
                cake                          food
                chocolateganachebuttercream
                shamsd




                                              female
                N/A                           indoor
                                              people
                                              portrait


                nature
                landscape                     clouds
                water                         lake
                reflection                    sky
                mirror                        water
                flickrelite
                abigfave
                                              SOURCE: MIR-Flickr
 mklab.iti.gr                 #2
Overview

•    Problem formulation
•    Related work
•    Graph Structure Features Approach
•    Evaluation
      – Synthetic datasets
      – MIR-Flickr
• Conclusions


    mklab.iti.gr             #3
Overview

•    Problem formulation
•    Related work
•    Graph Structure Features Approach
•    Evaluation
      – Synthetic datasets
      – MIR-Flickr
• Conclusions


    mklab.iti.gr             #4
Concept detection

ML perspective
• Given an image, produce a set of relevant concepts

IR perspective
• Given an image collection and a concept of interest,
   rank all images in order of relevance.




 mklab.iti.gr             #5
Semi-supervised learning

• Transductive learning setting
                target concepts

                                    annotated set

                         D-dimensional feature vector from image i
                           concept indicator vector (labels) for image i

                                  set of unknown items

       Predict concepts associated with items of         by processing
       together    and     .

 mklab.iti.gr                        #6
Overview

•    Problem formulation
•    Related work
•    Graph Structure Features Approach
•    Evaluation
      – Synthetic datasets
      – MIR-Flickr
• Conclusions


    mklab.iti.gr             #7
Related work
• Neighborhood similarity (Wang et al., 2009)
   – Uses image similarity graphs in combination with graph-based SSL
     (Zhu, 2005; Zhou et al., 2004) – Not incremental
• Sparse similarity graph by convex optim. (Tang et al., 2009)
   – Applicable to online settings - Computationally intensive training step
• Hashing-based graph construction (Chen et al., 2010)
   – Uses KL divergence multi-label propagation, but relies on iterative
     computational scheme – Difficult to apply in incremental settings
• Social dimensions (Tang & Liu, 2011)
   – Uses LEs for networked classification problems (i.e. when network
     between nodes is explicit) – Not incremental, not applied to
     multimedia


 mklab.iti.gr                      #8
Overview

•    Problem formulation
•    Related work
•    Graph Structure Features Approach
•    Evaluation
      – Synthetic datasets
      – MIR-Flickr
• Conclusions


    mklab.iti.gr             #9
Graph Structure Features (GSF)




 mklab.iti.gr     #10
Graph construction

                     image similarity graph

                     set of nodes-images

                cardinality of node set



Construction options
• full weighted graph
• kNN graph (connect k most similar images)
• εNN graph (connect images < similarity threshold)
 mklab.iti.gr                    #11
Eigenvector/value computation

 Normalized graph Laplacian


                degree matrix (diagonal)
                adjacency matrix
         (typical form of graph Laplacian:                   )

                 non-zero eigenvalues

                           graph structure features*

         by solving
                                                 *aka Laplacian Eigenmaps
 mklab.iti.gr                      #12
Graph structure feature learning


• Each media item is represented by a vector

• At this point, any supervised learning method could be used.
   [note that the whole framework is still SSL since unlabeled items are
      used during graph construction]


• SVM is selected
   – good performance in several problems
   – good implementations available (LibSVM, LIBLINEAR)
   – real-valued output (IR perspective  rank images by concept)


 mklab.iti.gr                     #13
Intuition

                             coast                         coast, person
          coast

                            0.2415                          -0.4552
                                           coast, person                   coast, person
         0.3077    coast

                                             -0.0893
                                                                            -0.4552
                   0.2748


        0.3144                                               -0.4663
         coast              0.2415
                             coast                         coast, person

                  2nd eigenvector of graph Laplacian



 mklab.iti.gr                        #14
Incremental learning setting                               (1)

•      Transductive learning setting often impractical. For
       each new set of unlabeled items:
      1. recompute image similarity matrix
      2. recompute graph structure features (LEs)
      3. use SVM to obtain prediction scores
•      Step 2 is computationally expensive.
•      Devise two incremental schemes:
      –    Linear Projection (LP) :
                                          set of k most similar images


      –    Submanifold Analysis (SA)    [cf. next slide]
    mklab.iti.gr                  #15
Incremental learning setting                      (2)

• Submanifold Analysis [Jia et al., 2009]
   – Construct (k+1)x(k+1) similarity matrix WS between new
     item and k most images from the annotated set
   – Construct sub-diagonal and sub-Laplacian matrices


   – Compute eigenvalues                            and d
     eigenvectors          corresponding to non-zero
     eigenvalues [computation is lightweight since k << n]
   – Minimize reconstruction error:

   – Reconstruct approximate eigenvectors:

 mklab.iti.gr               #16
Fusion of multiple features
                                  Graph struct. feature fusion (F-GSF)


      Feature fusion (F-FEAT)




Similarity graph fusion (F-SIM)          Result fusion (F-RES)

  mklab.iti.gr                  #17
Overview

•    Problem formulation
•    Related work
•    Graph Structure Features Approach
•    Evaluation
      – Synthetic datasets
      – MIR-Flickr
• Conclusions


    mklab.iti.gr             #18
Synthetic data - experiments
• Use of four 2D distributions with limited number of
  samples (thousands) to test many settings
        TWO MOONS        LINES           CIRCLES         GAUSSIANS




• Performance aspects
   – Parameters of approach: number of features (CD), graph
     construction technique (kNN, εNN) and parameters (k, ε)
   – Learning setting (training size, data noise, nr. of classes)
   – Inductive learning (LP vs SA)
   – Fusion method
 mklab.iti.gr                    #19
Role of number of GSF (CD)
                TWO MOONS                        LINES




                                      noise
                                      levels




                  CIRCLES                      GAUSSIANS



                 higher CD  better mAP
                 higher noise  higher CD




 mklab.iti.gr                           #20
Role of graph construction technique




                kNN                                   εNN

                  kNN better and less sensitive than εΝΝ



 mklab.iti.gr                    #21
Role of noise (σ)
           TWO MOONS                               LINES




                           competing
                CIRCLES    methods               GAUSSIANS




     In most cases GSF equal or better than the expensive SVM-RBF.

 mklab.iti.gr                     #22
Role of training samples (α%)
           TWO MOONS                                  LINES




                CIRCLES                            GAUSSIANS




 In most cases few training samples (2-5%) are sufficient for high accuracy.
 mklab.iti.gr                     #23
Number of classes (K)

                LINES                                CIRCLES




             Sufficiently good accuracy wrt. number of classes
         (much better than linear SVM, a bit worse than SVM-RBF).


 mklab.iti.gr                     #24
Scalability wrt. number of features


                             Linearly increasing cost wrt.
                             dimensionality




                              Constant cost wrt.
                              dimensionality




 mklab.iti.gr      #25
Comparison between fusion methods

                LINES                                  CIRCLES




       Even when one feature goes bad, result and GSF fusion still do
                          better than the best.


 mklab.iti.gr                     #26
Incremental schemes               SA much better and less sensitive than LP.
                TWO MOONS                           LINES




                  CIRCLES                         GAUSSIANS




 mklab.iti.gr               #27
Overview

•    Problem formulation
•    Related work
•    Graph Structure Features Approach
•    Evaluation
      – Synthetic datasets
      – MIR-Flickr
• Conclusions


    mklab.iti.gr             #28
Experimental setting

• MIR-Flickr
   – 25,000 images + tags
   – 38 concepts (24 + 14 with two interpretations [strict/rel])


• Benchmark methods
   – Semantic Spaces (SESPA) [Hare & Lewis, 2010]
   – Multiple Kernel Learning (MKL) [Guillaumin et al., 2010]




 mklab.iti.gr                #29
GSF vs SESPA




GSF-F1, F2, F3: Single feature GSF
GSF-C: Graph structure feature fusion
GSF-D1, D2: Result fusion using LIBLINEAR (1) and RBF (2)

 mklab.iti.gr                 #30
GSF vs MKL

                                                                                            VISUAL




                                                   MKL better in: baby, bird, river, sea.



                    Possible thanks to
                scalable behavior wrt.
                                                                                            TAG
                  number of features.




                                               GSF better in: baby, bird, car, dog, river, sea.

 mklab.iti.gr                            #31
Example results




 mklab.iti.gr     #32
Evaluation: adding unlabeled samples (1)


                                   ~6% relative
                                   increase in mAP




                   GIST

 mklab.iti.gr     #33
Evaluation: adding unlabeled samples (2)


                                   ~12% relative
                                   increase in mAP




                DenseSiftV3H1

 mklab.iti.gr       #34
Evaluation: adding unlabeled samples (3)

                                   ~4% relative
                                   increase in mAP




                 TagRaw50

 mklab.iti.gr     #35
Overview

•    Problem formulation
•    Related work
•    Graph Structure Features Approach
•    Evaluation
      – Synthetic datasets
      – MIR-Flickr
• Conclusions


    mklab.iti.gr             #36
Conclusions
• Concept detection approach based on the structure of image
  similarity graphs
   – Transductive learning setting
   – Two variants for online learning
• Thorough experimental analysis
   – Behavior under a variety of settings/parameters
   – Equivalent or better behavior compared to SoA approaches
• Fast:
   – SA with k=5 takes 38.4msec per image (not incl. feature extraction)
   – Future work: further analysis of computational characteristics +
     application to larger scale datasets (NUS-Wide, ImageNet)


 mklab.iti.gr                    #37
Thank you




Further contact:   papadop@iti.gr
                                     www.socialsensor.eu

   mklab.iti.gr                #38
References (1)
• Graph-based semi-supervised learning
   Zhu, X.: Semi-supervised learning with graphs. PhD Thesis, Carnegie
     Mellon University, 0-542-19059-1 (2005)
   Zhou, D., Bousquet, O., Navin Lal, T., Weston, J. Schoelkopf, B.: Learning
     with Local and Global Consistency. Advances in NIPS 16, MIT Press
     (2004) 321-328
• Related approaches
   Wang, M., Hua, X.-S. Tang, J., Hong, R.: Beyond distance measurement:
     constructing neighborhood similarity for video annotation. TMM 11
     (3) (2009), 465-476
   Tang, J. et al.: Inferring semantic concepts from community contributed
     images and noisy tags. ACM Multimedia (2009) 223-232
   Chen, X. et al.: Efficient large scale image annotation by probabilistic
     collaborative multi-label propagation. ACM Multimedia (2010), 35-44
   Tang, L., Liu, H.: Leveraging social media networks for classification. Data
     Mining and Knowledge Discovery 23 (3) (2011), 447-478


 mklab.iti.gr                      #39
References (2)
• Relational classification
   Macskassy, S.A., Provost, F.: Classification in Networked Data: A Toolkit
     and a Univariate Case Study. Journal of Machine Learning Research 8,
     (2007), 935-983

• Laplacian Eigenmaps
   Mikhail, B., Partha, N.: Laplacian Eigenmaps for dimensionality reduction
      and data representation. Neural Computing 15 (6), MIT Press (2003)
      1373-1396
   Jia, P., Yin, J., Huang, X., Hu, D.: Incremental Laplacian eigenmaps by
      preserving adjacent information between data points. PR Letters 30
      (16) (2009), 1457–1463




 mklab.iti.gr                     #40
References (3)
• Tools
   Leyffer, S., Mahajan, A.: Nonlinear Constrained Optimization: Methods and
      Software. Preprint ANL/MCS-P1729-0310 (2010)
   Fan, R., Chang, K., Hsieh, C., Wang, X., Lin, C.: LIBLINEAR: A Library for Large Linear
      Classification. Journal of ML Research 9 (2008), 1871-1874
   Chang, C.-C., Lin, C.-J.: LIBSVM: A library for support vector machines. ACM
      Transactions on Intelligent Systems and Technology 2 (3) (2011), 27:1–27:27
• Dataset
   Huiskes, M.J., Michael S. Lew, M.S.: The MIR Flickr Retrieval Evaluation.
     Proceedings of ACM Intern. Conf. on Multimedia Information Retrieval (2008)
• Competing methods
   Hare, J.S., Lewis, P.H.: Automatically annotating the MIR Flickr dataset. ACM ICMR
     (2010), 547-556
   Guillaumin, M., Verbeek, J., Schmid, C.: Multimodal semi supervised learning for
     image classification. Proceedings of IEEE CVPR Conference (2010), 902-909


 mklab.iti.gr                          #41
mklab.iti.gr   #42

Weitere ähnliche Inhalte

Was ist angesagt?

[Paper] Multiscale Vision Transformers(MVit)
[Paper] Multiscale Vision Transformers(MVit)[Paper] Multiscale Vision Transformers(MVit)
[Paper] Multiscale Vision Transformers(MVit)Susang Kim
 
Visual odometry & slam utilizing indoor structured environments
Visual odometry & slam utilizing indoor structured environmentsVisual odometry & slam utilizing indoor structured environments
Visual odometry & slam utilizing indoor structured environmentsNAVER Engineering
 
[論文紹介] BlendedMVS: A Large-scale Dataset for Generalized Multi-view Stereo Ne...
[論文紹介] BlendedMVS: A Large-scale Dataset for Generalized Multi-view Stereo Ne...[論文紹介] BlendedMVS: A Large-scale Dataset for Generalized Multi-view Stereo Ne...
[論文紹介] BlendedMVS: A Large-scale Dataset for Generalized Multi-view Stereo Ne...Seiya Ito
 
Deep image retrieval - learning global representations for image search - ub ...
Deep image retrieval - learning global representations for image search - ub ...Deep image retrieval - learning global representations for image search - ub ...
Deep image retrieval - learning global representations for image search - ub ...Universitat de Barcelona
 
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)Universitat Politècnica de Catalunya
 
高精地图数据协议标准探究
高精地图数据协议标准探究高精地图数据协议标准探究
高精地图数据协议标准探究Weijun Zhong
 
DeconvNet, DecoupledNet, TransferNet in Image Segmentation
DeconvNet, DecoupledNet, TransferNet in Image SegmentationDeconvNet, DecoupledNet, TransferNet in Image Segmentation
DeconvNet, DecoupledNet, TransferNet in Image SegmentationNamHyuk Ahn
 
Convolutional Neural Network and RNN for OCR problem.
Convolutional Neural Network and RNN for OCR problem.Convolutional Neural Network and RNN for OCR problem.
Convolutional Neural Network and RNN for OCR problem.Vishal Mishra
 
Multimodal Residual Learning for Visual QA
Multimodal Residual Learning for Visual QAMultimodal Residual Learning for Visual QA
Multimodal Residual Learning for Visual QANamHyuk Ahn
 
Anchor free object detection by deep learning
Anchor free object detection by deep learningAnchor free object detection by deep learning
Anchor free object detection by deep learningYu Huang
 
CNNs: from the Basics to Recent Advances
CNNs: from the Basics to Recent AdvancesCNNs: from the Basics to Recent Advances
CNNs: from the Basics to Recent AdvancesDmytro Mishkin
 
Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...
Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...
Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...Wanjin Yu
 
fusion of Camera and lidar for autonomous driving II
fusion of Camera and lidar for autonomous driving IIfusion of Camera and lidar for autonomous driving II
fusion of Camera and lidar for autonomous driving IIYu Huang
 
Modern Convolutional Neural Network techniques for image segmentation
Modern Convolutional Neural Network techniques for image segmentationModern Convolutional Neural Network techniques for image segmentation
Modern Convolutional Neural Network techniques for image segmentationGioele Ciaparrone
 
DeepLab V3+: Encoder-Decoder with Atrous Separable Convolution for Semantic I...
DeepLab V3+: Encoder-Decoder with Atrous Separable Convolution for Semantic I...DeepLab V3+: Encoder-Decoder with Atrous Separable Convolution for Semantic I...
DeepLab V3+: Encoder-Decoder with Atrous Separable Convolution for Semantic I...Joonhyung Lee
 
3-d interpretation from single 2-d image V
3-d interpretation from single 2-d image V3-d interpretation from single 2-d image V
3-d interpretation from single 2-d image VYu Huang
 
Understanding Convolutional Neural Networks
Understanding Convolutional Neural NetworksUnderstanding Convolutional Neural Networks
Understanding Convolutional Neural NetworksJeremy Nixon
 
(Paper Review)U-GAT-IT: unsupervised generative attentional networks with ada...
(Paper Review)U-GAT-IT: unsupervised generative attentional networks with ada...(Paper Review)U-GAT-IT: unsupervised generative attentional networks with ada...
(Paper Review)U-GAT-IT: unsupervised generative attentional networks with ada...MYEONGGYU LEE
 

Was ist angesagt? (20)

[Paper] Multiscale Vision Transformers(MVit)
[Paper] Multiscale Vision Transformers(MVit)[Paper] Multiscale Vision Transformers(MVit)
[Paper] Multiscale Vision Transformers(MVit)
 
Visual odometry & slam utilizing indoor structured environments
Visual odometry & slam utilizing indoor structured environmentsVisual odometry & slam utilizing indoor structured environments
Visual odometry & slam utilizing indoor structured environments
 
[論文紹介] BlendedMVS: A Large-scale Dataset for Generalized Multi-view Stereo Ne...
[論文紹介] BlendedMVS: A Large-scale Dataset for Generalized Multi-view Stereo Ne...[論文紹介] BlendedMVS: A Large-scale Dataset for Generalized Multi-view Stereo Ne...
[論文紹介] BlendedMVS: A Large-scale Dataset for Generalized Multi-view Stereo Ne...
 
Deep image retrieval - learning global representations for image search - ub ...
Deep image retrieval - learning global representations for image search - ub ...Deep image retrieval - learning global representations for image search - ub ...
Deep image retrieval - learning global representations for image search - ub ...
 
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
Image Segmentation (D3L1 2017 UPC Deep Learning for Computer Vision)
 
高精地图数据协议标准探究
高精地图数据协议标准探究高精地图数据协议标准探究
高精地图数据协议标准探究
 
Mask R-CNN
Mask R-CNNMask R-CNN
Mask R-CNN
 
DeconvNet, DecoupledNet, TransferNet in Image Segmentation
DeconvNet, DecoupledNet, TransferNet in Image SegmentationDeconvNet, DecoupledNet, TransferNet in Image Segmentation
DeconvNet, DecoupledNet, TransferNet in Image Segmentation
 
Convolutional Neural Network and RNN for OCR problem.
Convolutional Neural Network and RNN for OCR problem.Convolutional Neural Network and RNN for OCR problem.
Convolutional Neural Network and RNN for OCR problem.
 
Multimodal Residual Learning for Visual QA
Multimodal Residual Learning for Visual QAMultimodal Residual Learning for Visual QA
Multimodal Residual Learning for Visual QA
 
Anchor free object detection by deep learning
Anchor free object detection by deep learningAnchor free object detection by deep learning
Anchor free object detection by deep learning
 
CNNs: from the Basics to Recent Advances
CNNs: from the Basics to Recent AdvancesCNNs: from the Basics to Recent Advances
CNNs: from the Basics to Recent Advances
 
Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...
Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...
Intelligent Image Enhancement and Restoration - From Prior Driven Model to Ad...
 
fusion of Camera and lidar for autonomous driving II
fusion of Camera and lidar for autonomous driving IIfusion of Camera and lidar for autonomous driving II
fusion of Camera and lidar for autonomous driving II
 
Depth estimation using deep learning
Depth estimation using deep learningDepth estimation using deep learning
Depth estimation using deep learning
 
Modern Convolutional Neural Network techniques for image segmentation
Modern Convolutional Neural Network techniques for image segmentationModern Convolutional Neural Network techniques for image segmentation
Modern Convolutional Neural Network techniques for image segmentation
 
DeepLab V3+: Encoder-Decoder with Atrous Separable Convolution for Semantic I...
DeepLab V3+: Encoder-Decoder with Atrous Separable Convolution for Semantic I...DeepLab V3+: Encoder-Decoder with Atrous Separable Convolution for Semantic I...
DeepLab V3+: Encoder-Decoder with Atrous Separable Convolution for Semantic I...
 
3-d interpretation from single 2-d image V
3-d interpretation from single 2-d image V3-d interpretation from single 2-d image V
3-d interpretation from single 2-d image V
 
Understanding Convolutional Neural Networks
Understanding Convolutional Neural NetworksUnderstanding Convolutional Neural Networks
Understanding Convolutional Neural Networks
 
(Paper Review)U-GAT-IT: unsupervised generative attentional networks with ada...
(Paper Review)U-GAT-IT: unsupervised generative attentional networks with ada...(Paper Review)U-GAT-IT: unsupervised generative attentional networks with ada...
(Paper Review)U-GAT-IT: unsupervised generative attentional networks with ada...
 

Ähnlich wie Semi-supervised concept detection by learning the structure of similarity graphs

Recent Progress on Object Detection_20170331
Recent Progress on Object Detection_20170331Recent Progress on Object Detection_20170331
Recent Progress on Object Detection_20170331Jihong Kang
 
DDGK: Learning Graph Representations for Deep Divergence Graph Kernels
DDGK: Learning Graph Representations for Deep Divergence Graph KernelsDDGK: Learning Graph Representations for Deep Divergence Graph Kernels
DDGK: Learning Graph Representations for Deep Divergence Graph Kernelsivaderivader
 
Collaborative Similarity Measure for Intra-Graph Clustering
Collaborative Similarity Measure for Intra-Graph ClusteringCollaborative Similarity Measure for Intra-Graph Clustering
Collaborative Similarity Measure for Intra-Graph ClusteringWaqas Nawaz
 
Large Scale Image Retrieval 2022.pdf
Large Scale Image Retrieval 2022.pdfLarge Scale Image Retrieval 2022.pdf
Large Scale Image Retrieval 2022.pdfSamuCerezo
 
Large-Scale Machine Learning with Apache Spark
Large-Scale Machine Learning with Apache SparkLarge-Scale Machine Learning with Apache Spark
Large-Scale Machine Learning with Apache SparkDB Tsai
 
240325_JW_labseminar[node2vec: Scalable Feature Learning for Networks].pptx
240325_JW_labseminar[node2vec: Scalable Feature Learning for Networks].pptx240325_JW_labseminar[node2vec: Scalable Feature Learning for Networks].pptx
240325_JW_labseminar[node2vec: Scalable Feature Learning for Networks].pptxthanhdowork
 
Web-Scale Graph Analytics with Apache® Spark™
Web-Scale Graph Analytics with Apache® Spark™Web-Scale Graph Analytics with Apache® Spark™
Web-Scale Graph Analytics with Apache® Spark™Databricks
 
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
Learn to Build an App to Find Similar Images using Deep Learning- Piotr TeterwakLearn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
Learn to Build an App to Find Similar Images using Deep Learning- Piotr TeterwakPyData
 
2015-06-15 Large-Scale Elastic-Net Regularized Generalized Linear Models at S...
2015-06-15 Large-Scale Elastic-Net Regularized Generalized Linear Models at S...2015-06-15 Large-Scale Elastic-Net Regularized Generalized Linear Models at S...
2015-06-15 Large-Scale Elastic-Net Regularized Generalized Linear Models at S...DB Tsai
 
Introduction to computer vision with Convoluted Neural Networks
Introduction to computer vision with Convoluted Neural NetworksIntroduction to computer vision with Convoluted Neural Networks
Introduction to computer vision with Convoluted Neural NetworksMarcinJedyk
 
Recent Developments in Spark MLlib and Beyond
Recent Developments in Spark MLlib and BeyondRecent Developments in Spark MLlib and Beyond
Recent Developments in Spark MLlib and BeyondXiangrui Meng
 
Graph Based Machine Learning with Applications to Media Analytics
Graph Based Machine Learning with Applications to Media AnalyticsGraph Based Machine Learning with Applications to Media Analytics
Graph Based Machine Learning with Applications to Media AnalyticsNYC Predictive Analytics
 
Image Segmentation Using Deep Learning : A survey
Image Segmentation Using Deep Learning : A surveyImage Segmentation Using Deep Learning : A survey
Image Segmentation Using Deep Learning : A surveyNUPUR YADAV
 
Introduction to computer vision
Introduction to computer visionIntroduction to computer vision
Introduction to computer visionMarcin Jedyk
 
[PR-325] Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal Tran...
[PR-325] Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal Tran...[PR-325] Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal Tran...
[PR-325] Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal Tran...Sunghoon Joo
 
Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]
Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]
Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]Dongmin Choi
 
Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...
Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...
Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...MLconf
 
Image-Based E-Commerce Product Discovery: A Deep Learning Case Study - Denis ...
Image-Based E-Commerce Product Discovery: A Deep Learning Case Study - Denis ...Image-Based E-Commerce Product Discovery: A Deep Learning Case Study - Denis ...
Image-Based E-Commerce Product Discovery: A Deep Learning Case Study - Denis ...Lucidworks
 

Ähnlich wie Semi-supervised concept detection by learning the structure of similarity graphs (20)

Recent Progress on Object Detection_20170331
Recent Progress on Object Detection_20170331Recent Progress on Object Detection_20170331
Recent Progress on Object Detection_20170331
 
DDGK: Learning Graph Representations for Deep Divergence Graph Kernels
DDGK: Learning Graph Representations for Deep Divergence Graph KernelsDDGK: Learning Graph Representations for Deep Divergence Graph Kernels
DDGK: Learning Graph Representations for Deep Divergence Graph Kernels
 
Collaborative Similarity Measure for Intra-Graph Clustering
Collaborative Similarity Measure for Intra-Graph ClusteringCollaborative Similarity Measure for Intra-Graph Clustering
Collaborative Similarity Measure for Intra-Graph Clustering
 
Large Scale Image Retrieval 2022.pdf
Large Scale Image Retrieval 2022.pdfLarge Scale Image Retrieval 2022.pdf
Large Scale Image Retrieval 2022.pdf
 
Fa18_P2.pptx
Fa18_P2.pptxFa18_P2.pptx
Fa18_P2.pptx
 
Large-Scale Machine Learning with Apache Spark
Large-Scale Machine Learning with Apache SparkLarge-Scale Machine Learning with Apache Spark
Large-Scale Machine Learning with Apache Spark
 
240325_JW_labseminar[node2vec: Scalable Feature Learning for Networks].pptx
240325_JW_labseminar[node2vec: Scalable Feature Learning for Networks].pptx240325_JW_labseminar[node2vec: Scalable Feature Learning for Networks].pptx
240325_JW_labseminar[node2vec: Scalable Feature Learning for Networks].pptx
 
Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)
Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)
Image Retrieval (D4L5 2017 UPC Deep Learning for Computer Vision)
 
Web-Scale Graph Analytics with Apache® Spark™
Web-Scale Graph Analytics with Apache® Spark™Web-Scale Graph Analytics with Apache® Spark™
Web-Scale Graph Analytics with Apache® Spark™
 
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
Learn to Build an App to Find Similar Images using Deep Learning- Piotr TeterwakLearn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
 
2015-06-15 Large-Scale Elastic-Net Regularized Generalized Linear Models at S...
2015-06-15 Large-Scale Elastic-Net Regularized Generalized Linear Models at S...2015-06-15 Large-Scale Elastic-Net Regularized Generalized Linear Models at S...
2015-06-15 Large-Scale Elastic-Net Regularized Generalized Linear Models at S...
 
Introduction to computer vision with Convoluted Neural Networks
Introduction to computer vision with Convoluted Neural NetworksIntroduction to computer vision with Convoluted Neural Networks
Introduction to computer vision with Convoluted Neural Networks
 
Recent Developments in Spark MLlib and Beyond
Recent Developments in Spark MLlib and BeyondRecent Developments in Spark MLlib and Beyond
Recent Developments in Spark MLlib and Beyond
 
Graph Based Machine Learning with Applications to Media Analytics
Graph Based Machine Learning with Applications to Media AnalyticsGraph Based Machine Learning with Applications to Media Analytics
Graph Based Machine Learning with Applications to Media Analytics
 
Image Segmentation Using Deep Learning : A survey
Image Segmentation Using Deep Learning : A surveyImage Segmentation Using Deep Learning : A survey
Image Segmentation Using Deep Learning : A survey
 
Introduction to computer vision
Introduction to computer visionIntroduction to computer vision
Introduction to computer vision
 
[PR-325] Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal Tran...
[PR-325] Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal Tran...[PR-325] Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal Tran...
[PR-325] Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal Tran...
 
Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]
Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]
Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]
 
Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...
Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...
Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...
 
Image-Based E-Commerce Product Discovery: A Deep Learning Case Study - Denis ...
Image-Based E-Commerce Product Discovery: A Deep Learning Case Study - Denis ...Image-Based E-Commerce Product Discovery: A Deep Learning Case Study - Denis ...
Image-Based E-Commerce Product Discovery: A Deep Learning Case Study - Denis ...
 

Mehr von Symeon Papadopoulos

DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...Symeon Papadopoulos
 
Deepfakes: An Emerging Internet Threat and their Detection
Deepfakes: An Emerging Internet Threat and their DetectionDeepfakes: An Emerging Internet Threat and their Detection
Deepfakes: An Emerging Internet Threat and their DetectionSymeon Papadopoulos
 
Knowledge-based Fusion for Image Tampering Localization
Knowledge-based Fusion for Image Tampering LocalizationKnowledge-based Fusion for Image Tampering Localization
Knowledge-based Fusion for Image Tampering LocalizationSymeon Papadopoulos
 
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...Symeon Papadopoulos
 
COVID-19 Infodemic vs Contact Tracing
COVID-19 Infodemic vs Contact TracingCOVID-19 Infodemic vs Contact Tracing
COVID-19 Infodemic vs Contact TracingSymeon Papadopoulos
 
Similarity-based retrieval of multimedia content
Similarity-based retrieval of multimedia contentSimilarity-based retrieval of multimedia content
Similarity-based retrieval of multimedia contentSymeon Papadopoulos
 
Twitter-based Sensing of City-level Air Quality
Twitter-based Sensing of City-level Air QualityTwitter-based Sensing of City-level Air Quality
Twitter-based Sensing of City-level Air QualitySymeon Papadopoulos
 
Aggregating and Analyzing the Context of Social Media Content
Aggregating and Analyzing the Context of Social Media ContentAggregating and Analyzing the Context of Social Media Content
Aggregating and Analyzing the Context of Social Media ContentSymeon Papadopoulos
 
Verifying Multimedia Content on the Internet
Verifying Multimedia Content on the InternetVerifying Multimedia Content on the Internet
Verifying Multimedia Content on the InternetSymeon Papadopoulos
 
A Web-based Service for Image Tampering Detection
A Web-based Service for Image Tampering DetectionA Web-based Service for Image Tampering Detection
A Web-based Service for Image Tampering DetectionSymeon Papadopoulos
 
Learning to detect Misleading Content on Twitter
Learning to detect Misleading Content on TwitterLearning to detect Misleading Content on Twitter
Learning to detect Misleading Content on TwitterSymeon Papadopoulos
 
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN LayersNear-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN LayersSymeon Papadopoulos
 
Verifying Multimedia Use at MediaEval 2016
Verifying Multimedia Use at MediaEval 2016Verifying Multimedia Use at MediaEval 2016
Verifying Multimedia Use at MediaEval 2016Symeon Papadopoulos
 
Placing Images with Refined Language Models and Similarity Search with PCA-re...
Placing Images with Refined Language Models and Similarity Search with PCA-re...Placing Images with Refined Language Models and Similarity Search with PCA-re...
Placing Images with Refined Language Models and Similarity Search with PCA-re...Symeon Papadopoulos
 
In-depth Exploration of Geotagging Performance
In-depth Exploration of Geotagging PerformanceIn-depth Exploration of Geotagging Performance
In-depth Exploration of Geotagging PerformanceSymeon Papadopoulos
 
Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...Symeon Papadopoulos
 
Web and Social Media Image Forensics for News Professionals
Web and Social Media Image Forensics for News ProfessionalsWeb and Social Media Image Forensics for News Professionals
Web and Social Media Image Forensics for News ProfessionalsSymeon Papadopoulos
 
Predicting News Popularity by Mining Online Discussions
Predicting News Popularity by Mining Online DiscussionsPredicting News Popularity by Mining Online Discussions
Predicting News Popularity by Mining Online DiscussionsSymeon Papadopoulos
 
Finding Diverse Social Images at MediaEval 2015
Finding Diverse Social Images at MediaEval 2015Finding Diverse Social Images at MediaEval 2015
Finding Diverse Social Images at MediaEval 2015Symeon Papadopoulos
 

Mehr von Symeon Papadopoulos (20)

DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
DeepFake Detection: Challenges, Progress and Hands-on Demonstration of Techno...
 
Deepfakes: An Emerging Internet Threat and their Detection
Deepfakes: An Emerging Internet Threat and their DetectionDeepfakes: An Emerging Internet Threat and their Detection
Deepfakes: An Emerging Internet Threat and their Detection
 
Knowledge-based Fusion for Image Tampering Localization
Knowledge-based Fusion for Image Tampering LocalizationKnowledge-based Fusion for Image Tampering Localization
Knowledge-based Fusion for Image Tampering Localization
 
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
Deepfake Detection: The Importance of Training Data Preprocessing and Practic...
 
COVID-19 Infodemic vs Contact Tracing
COVID-19 Infodemic vs Contact TracingCOVID-19 Infodemic vs Contact Tracing
COVID-19 Infodemic vs Contact Tracing
 
Similarity-based retrieval of multimedia content
Similarity-based retrieval of multimedia contentSimilarity-based retrieval of multimedia content
Similarity-based retrieval of multimedia content
 
Twitter-based Sensing of City-level Air Quality
Twitter-based Sensing of City-level Air QualityTwitter-based Sensing of City-level Air Quality
Twitter-based Sensing of City-level Air Quality
 
Aggregating and Analyzing the Context of Social Media Content
Aggregating and Analyzing the Context of Social Media ContentAggregating and Analyzing the Context of Social Media Content
Aggregating and Analyzing the Context of Social Media Content
 
Verifying Multimedia Content on the Internet
Verifying Multimedia Content on the InternetVerifying Multimedia Content on the Internet
Verifying Multimedia Content on the Internet
 
A Web-based Service for Image Tampering Detection
A Web-based Service for Image Tampering DetectionA Web-based Service for Image Tampering Detection
A Web-based Service for Image Tampering Detection
 
Learning to detect Misleading Content on Twitter
Learning to detect Misleading Content on TwitterLearning to detect Misleading Content on Twitter
Learning to detect Misleading Content on Twitter
 
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN LayersNear-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
Near-Duplicate Video Retrieval by Aggregating Intermediate CNN Layers
 
Verifying Multimedia Use at MediaEval 2016
Verifying Multimedia Use at MediaEval 2016Verifying Multimedia Use at MediaEval 2016
Verifying Multimedia Use at MediaEval 2016
 
Multimedia Privacy
Multimedia PrivacyMultimedia Privacy
Multimedia Privacy
 
Placing Images with Refined Language Models and Similarity Search with PCA-re...
Placing Images with Refined Language Models and Similarity Search with PCA-re...Placing Images with Refined Language Models and Similarity Search with PCA-re...
Placing Images with Refined Language Models and Similarity Search with PCA-re...
 
In-depth Exploration of Geotagging Performance
In-depth Exploration of Geotagging PerformanceIn-depth Exploration of Geotagging Performance
In-depth Exploration of Geotagging Performance
 
Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...Perceived versus Actual Predictability of Personal Information in Social Netw...
Perceived versus Actual Predictability of Personal Information in Social Netw...
 
Web and Social Media Image Forensics for News Professionals
Web and Social Media Image Forensics for News ProfessionalsWeb and Social Media Image Forensics for News Professionals
Web and Social Media Image Forensics for News Professionals
 
Predicting News Popularity by Mining Online Discussions
Predicting News Popularity by Mining Online DiscussionsPredicting News Popularity by Mining Online Discussions
Predicting News Popularity by Mining Online Discussions
 
Finding Diverse Social Images at MediaEval 2015
Finding Diverse Social Images at MediaEval 2015Finding Diverse Social Images at MediaEval 2015
Finding Diverse Social Images at MediaEval 2015
 

Kürzlich hochgeladen

What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 

Kürzlich hochgeladen (20)

What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 

Semi-supervised concept detection by learning the structure of similarity graphs

  • 1. Semi-supervised concept detection by learning the structure of similarity graphs Symeon Papadopoulos1, Christos Sagonas1, Ioannis Kompatsiaris1, Athena Vakali2 1 Centre for Research and Technology Hellas, Information Technologies Institute 2 Aristotle University of Thessaloniki, Informatics Department 19th International Conference on Multimedia Modeling Huangshan, China, Jan 7-9, 2012
  • 2. IMAGE TAGS CONCEPTS chocolate cake food chocolateganachebuttercream shamsd female N/A indoor people portrait nature landscape clouds water lake reflection sky mirror water flickrelite abigfave SOURCE: MIR-Flickr mklab.iti.gr #2
  • 3. Overview • Problem formulation • Related work • Graph Structure Features Approach • Evaluation – Synthetic datasets – MIR-Flickr • Conclusions mklab.iti.gr #3
  • 4. Overview • Problem formulation • Related work • Graph Structure Features Approach • Evaluation – Synthetic datasets – MIR-Flickr • Conclusions mklab.iti.gr #4
  • 5. Concept detection ML perspective • Given an image, produce a set of relevant concepts IR perspective • Given an image collection and a concept of interest, rank all images in order of relevance. mklab.iti.gr #5
  • 6. Semi-supervised learning • Transductive learning setting target concepts annotated set D-dimensional feature vector from image i concept indicator vector (labels) for image i set of unknown items Predict concepts associated with items of by processing together and . mklab.iti.gr #6
  • 7. Overview • Problem formulation • Related work • Graph Structure Features Approach • Evaluation – Synthetic datasets – MIR-Flickr • Conclusions mklab.iti.gr #7
  • 8. Related work • Neighborhood similarity (Wang et al., 2009) – Uses image similarity graphs in combination with graph-based SSL (Zhu, 2005; Zhou et al., 2004) – Not incremental • Sparse similarity graph by convex optim. (Tang et al., 2009) – Applicable to online settings - Computationally intensive training step • Hashing-based graph construction (Chen et al., 2010) – Uses KL divergence multi-label propagation, but relies on iterative computational scheme – Difficult to apply in incremental settings • Social dimensions (Tang & Liu, 2011) – Uses LEs for networked classification problems (i.e. when network between nodes is explicit) – Not incremental, not applied to multimedia mklab.iti.gr #8
  • 9. Overview • Problem formulation • Related work • Graph Structure Features Approach • Evaluation – Synthetic datasets – MIR-Flickr • Conclusions mklab.iti.gr #9
  • 10. Graph Structure Features (GSF) mklab.iti.gr #10
  • 11. Graph construction image similarity graph set of nodes-images cardinality of node set Construction options • full weighted graph • kNN graph (connect k most similar images) • εNN graph (connect images < similarity threshold) mklab.iti.gr #11
  • 12. Eigenvector/value computation Normalized graph Laplacian degree matrix (diagonal) adjacency matrix (typical form of graph Laplacian: ) non-zero eigenvalues graph structure features* by solving *aka Laplacian Eigenmaps mklab.iti.gr #12
  • 13. Graph structure feature learning • Each media item is represented by a vector • At this point, any supervised learning method could be used. [note that the whole framework is still SSL since unlabeled items are used during graph construction] • SVM is selected – good performance in several problems – good implementations available (LibSVM, LIBLINEAR) – real-valued output (IR perspective  rank images by concept) mklab.iti.gr #13
  • 14. Intuition coast coast, person coast 0.2415 -0.4552 coast, person coast, person 0.3077 coast -0.0893 -0.4552 0.2748 0.3144 -0.4663 coast 0.2415 coast coast, person 2nd eigenvector of graph Laplacian mklab.iti.gr #14
  • 15. Incremental learning setting (1) • Transductive learning setting often impractical. For each new set of unlabeled items: 1. recompute image similarity matrix 2. recompute graph structure features (LEs) 3. use SVM to obtain prediction scores • Step 2 is computationally expensive. • Devise two incremental schemes: – Linear Projection (LP) : set of k most similar images – Submanifold Analysis (SA) [cf. next slide] mklab.iti.gr #15
  • 16. Incremental learning setting (2) • Submanifold Analysis [Jia et al., 2009] – Construct (k+1)x(k+1) similarity matrix WS between new item and k most images from the annotated set – Construct sub-diagonal and sub-Laplacian matrices – Compute eigenvalues and d eigenvectors corresponding to non-zero eigenvalues [computation is lightweight since k << n] – Minimize reconstruction error: – Reconstruct approximate eigenvectors: mklab.iti.gr #16
  • 17. Fusion of multiple features Graph struct. feature fusion (F-GSF) Feature fusion (F-FEAT) Similarity graph fusion (F-SIM) Result fusion (F-RES) mklab.iti.gr #17
  • 18. Overview • Problem formulation • Related work • Graph Structure Features Approach • Evaluation – Synthetic datasets – MIR-Flickr • Conclusions mklab.iti.gr #18
  • 19. Synthetic data - experiments • Use of four 2D distributions with limited number of samples (thousands) to test many settings TWO MOONS LINES CIRCLES GAUSSIANS • Performance aspects – Parameters of approach: number of features (CD), graph construction technique (kNN, εNN) and parameters (k, ε) – Learning setting (training size, data noise, nr. of classes) – Inductive learning (LP vs SA) – Fusion method mklab.iti.gr #19
  • 20. Role of number of GSF (CD) TWO MOONS LINES noise levels CIRCLES GAUSSIANS higher CD  better mAP higher noise  higher CD mklab.iti.gr #20
  • 21. Role of graph construction technique kNN εNN kNN better and less sensitive than εΝΝ mklab.iti.gr #21
  • 22. Role of noise (σ) TWO MOONS LINES competing CIRCLES methods GAUSSIANS In most cases GSF equal or better than the expensive SVM-RBF. mklab.iti.gr #22
  • 23. Role of training samples (α%) TWO MOONS LINES CIRCLES GAUSSIANS In most cases few training samples (2-5%) are sufficient for high accuracy. mklab.iti.gr #23
  • 24. Number of classes (K) LINES CIRCLES Sufficiently good accuracy wrt. number of classes (much better than linear SVM, a bit worse than SVM-RBF). mklab.iti.gr #24
  • 25. Scalability wrt. number of features Linearly increasing cost wrt. dimensionality Constant cost wrt. dimensionality mklab.iti.gr #25
  • 26. Comparison between fusion methods LINES CIRCLES Even when one feature goes bad, result and GSF fusion still do better than the best. mklab.iti.gr #26
  • 27. Incremental schemes SA much better and less sensitive than LP. TWO MOONS LINES CIRCLES GAUSSIANS mklab.iti.gr #27
  • 28. Overview • Problem formulation • Related work • Graph Structure Features Approach • Evaluation – Synthetic datasets – MIR-Flickr • Conclusions mklab.iti.gr #28
  • 29. Experimental setting • MIR-Flickr – 25,000 images + tags – 38 concepts (24 + 14 with two interpretations [strict/rel]) • Benchmark methods – Semantic Spaces (SESPA) [Hare & Lewis, 2010] – Multiple Kernel Learning (MKL) [Guillaumin et al., 2010] mklab.iti.gr #29
  • 30. GSF vs SESPA GSF-F1, F2, F3: Single feature GSF GSF-C: Graph structure feature fusion GSF-D1, D2: Result fusion using LIBLINEAR (1) and RBF (2) mklab.iti.gr #30
  • 31. GSF vs MKL VISUAL MKL better in: baby, bird, river, sea. Possible thanks to scalable behavior wrt. TAG number of features. GSF better in: baby, bird, car, dog, river, sea. mklab.iti.gr #31
  • 33. Evaluation: adding unlabeled samples (1) ~6% relative increase in mAP GIST mklab.iti.gr #33
  • 34. Evaluation: adding unlabeled samples (2) ~12% relative increase in mAP DenseSiftV3H1 mklab.iti.gr #34
  • 35. Evaluation: adding unlabeled samples (3) ~4% relative increase in mAP TagRaw50 mklab.iti.gr #35
  • 36. Overview • Problem formulation • Related work • Graph Structure Features Approach • Evaluation – Synthetic datasets – MIR-Flickr • Conclusions mklab.iti.gr #36
  • 37. Conclusions • Concept detection approach based on the structure of image similarity graphs – Transductive learning setting – Two variants for online learning • Thorough experimental analysis – Behavior under a variety of settings/parameters – Equivalent or better behavior compared to SoA approaches • Fast: – SA with k=5 takes 38.4msec per image (not incl. feature extraction) – Future work: further analysis of computational characteristics + application to larger scale datasets (NUS-Wide, ImageNet) mklab.iti.gr #37
  • 38. Thank you Further contact: papadop@iti.gr www.socialsensor.eu mklab.iti.gr #38
  • 39. References (1) • Graph-based semi-supervised learning Zhu, X.: Semi-supervised learning with graphs. PhD Thesis, Carnegie Mellon University, 0-542-19059-1 (2005) Zhou, D., Bousquet, O., Navin Lal, T., Weston, J. Schoelkopf, B.: Learning with Local and Global Consistency. Advances in NIPS 16, MIT Press (2004) 321-328 • Related approaches Wang, M., Hua, X.-S. Tang, J., Hong, R.: Beyond distance measurement: constructing neighborhood similarity for video annotation. TMM 11 (3) (2009), 465-476 Tang, J. et al.: Inferring semantic concepts from community contributed images and noisy tags. ACM Multimedia (2009) 223-232 Chen, X. et al.: Efficient large scale image annotation by probabilistic collaborative multi-label propagation. ACM Multimedia (2010), 35-44 Tang, L., Liu, H.: Leveraging social media networks for classification. Data Mining and Knowledge Discovery 23 (3) (2011), 447-478 mklab.iti.gr #39
  • 40. References (2) • Relational classification Macskassy, S.A., Provost, F.: Classification in Networked Data: A Toolkit and a Univariate Case Study. Journal of Machine Learning Research 8, (2007), 935-983 • Laplacian Eigenmaps Mikhail, B., Partha, N.: Laplacian Eigenmaps for dimensionality reduction and data representation. Neural Computing 15 (6), MIT Press (2003) 1373-1396 Jia, P., Yin, J., Huang, X., Hu, D.: Incremental Laplacian eigenmaps by preserving adjacent information between data points. PR Letters 30 (16) (2009), 1457–1463 mklab.iti.gr #40
  • 41. References (3) • Tools Leyffer, S., Mahajan, A.: Nonlinear Constrained Optimization: Methods and Software. Preprint ANL/MCS-P1729-0310 (2010) Fan, R., Chang, K., Hsieh, C., Wang, X., Lin, C.: LIBLINEAR: A Library for Large Linear Classification. Journal of ML Research 9 (2008), 1871-1874 Chang, C.-C., Lin, C.-J.: LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology 2 (3) (2011), 27:1–27:27 • Dataset Huiskes, M.J., Michael S. Lew, M.S.: The MIR Flickr Retrieval Evaluation. Proceedings of ACM Intern. Conf. on Multimedia Information Retrieval (2008) • Competing methods Hare, J.S., Lewis, P.H.: Automatically annotating the MIR Flickr dataset. ACM ICMR (2010), 547-556 Guillaumin, M., Verbeek, J., Schmid, C.: Multimodal semi supervised learning for image classification. Proceedings of IEEE CVPR Conference (2010), 902-909 mklab.iti.gr #41