SlideShare ist ein Scribd-Unternehmen logo
1 von 35
A Fuzzy Associative Rule-
    based Approach for Pattern
     Mining and Pattern-based
                 Classification

                                         Ashish Mangalampalli
                                       Advisor: Dr. Vikram Pudi
                                   Centre for Data Engineering
       International Institute of Information Technology (IIIT)
                                                     Hyderabad
1
Outline
       Introduction
       Crisp and Fuzzy Associative Classification

       Pre-Processing and Mining
           Fuzzy Pre-Processing – FPrep
           Fuzzy ARM – FAR-Miner and FAR-HD

       Associative Classification – Our Approach
           FACISME – Fuzzy Adaption of ACME (Maximum Entropy Associative Classifier)
           Simple and Effective Associative Classifier (SEAC)
           Fuzzy Simple and Effective Associative Classifier (FSEAC)

       Associative Classification – Applications
           Efficient Fuzzy Associative Classifier for Object Classes in Images (I-FAC)
           Associative Classifier for Ad-targeting

       Conclusions
    2
Introduction
       Associative classification
           Mines huge amounts of data
           Integrates Association Rule Mining (ARM) with Classification

                         A = a, B = b, C = c → X = x

       Associative classifiers have several advantages
           Frequent itemsets capture dominant relationships between
            items/features
           Statistically significant associations make classification
            framework robust
           Low-frequency patterns (noise) are eliminated during ARM
           Rules are very transparent and easily understood
               Unlike black-box-like approach used in popular classifiers, such as
                SVMs and Artificial Neural Networks

    3
Outline
       Introduction

       Crisp and Fuzzy Associative Classification
       Pre-Processing and Mining
           Fuzzy Pre-Processing – FPrep
           Fuzzy ARM – FAR-Miner and FAR-HD

       Associative Classification – Our Approach
           Simple and Effective Associative Classifier (SEAC)
           Fuzzy Simple and Effective Associative Classifier (FSEAC)

       Associative Classification – Applications
           Efficient Fuzzy Associative Classifier for Object Classes in Images (I-FAC)
           Associative Classifier for Ad-targeting

       Conclusions

    4
Crisp Associative Classification
       Most associative classifiers are crisp
           Most real-life datasets contain binary and numerical attributes
           Use sharp partitioning
           Transform numerical attributes to binary ones, e.g. Income =
            [100K and above]

       Drawbacks of sharp partitioning
           Introduces uncertainty, especially at partition boundaries
           Small changes in intervals lead to misleading results
           Gives rise to polysemy and synonymy
           Intervals do not generally have clear semantics associated

       For example, sharp partitions for the attribute Income
           Up to 20K, 20K-100K, 100K and above
           Income = 50K would fit in the second partition
           But, so would Income = 99K
    5
Fuzzy Associative Classification
       Fuzzy logic
           Used to convert numerical attributes to fuzzy attributes
            (e.g. Income = High)
           Maintains integrity of information conveyed by numerical
            attributes
           Attribute values belong to partitions with some
            membership - interval [0, 1]




    6
Outline
       Introduction

       Crisp and Fuzzy Associative Classification

       Pre-Processing and Mining
           Fuzzy Pre-Processing – FPrep
           Fuzzy ARM – FAR-Miner and FAR-HD

       Associative Classification – Our Approach
           Simple and Effective Associative Classifier (SEAC)
           Fuzzy Simple and Effective Associative Classifier (FSEAC)

       Associative Classification – Applications
           Efficient Fuzzy Associative Classifier for Object Classes in Images (I-FAC)
           Associative Classifier for Ad-targeting

       Conclusions

    7
Pre-Processing and Mining
       Fuzzy pre-processing
           Convert crisp dataset (binary and numerical attributes)
            into fuzzy dataset (binary and fuzzy attributes)
           FPrep Algorithm used

       Efficient and robust Fuzzy ARM algorithms
           Web-scale datasets mandate such algorithms
           Fuzzy Apriori is most popular
           Many efficient crisp ARM algorithms exist like ARMOR
            and FP-Growth
           Algorithms used
               FAR-Miner for normal transactional datasets
               FAR-HD for high dimensional datasets

    8
Outline
    Introduction

    Crisp and Fuzzy Associative Classification

    Pre-Processing and Mining
        Fuzzy Pre-Processing – FPrep
        Fuzzy ARM – FAR-Miner and FAR-HD


    Associative Classification – Our Approach
        Simple and Effective Associative Classifier (SEAC)
        Fuzzy Simple and Effective Associative Classifier (FSEAC)

    Associative Classification – Applications
        Efficient Fuzzy Associative Classifier for Object Classes in Images (I-FAC)
        Associative Classifier for Ad-targeting


    Conclusions



    13
Associative Classification – Our
Approach
    AC algorithms like CPAR and CMAR only mine frequent
     itemsets
        Processed using additional (greedy) algorithms like FOIL and PRM
        Overhead in running time; process more complex

    Association rules directly used for training and scoring
        Exhaustive approach
            Controlled by appropriate support
            Not a time-intensive process
        Rule pruning and ranking take care of huge volume and
         redundancy

    Classifier built in a two-phased manner
        Global rule-mining and training
        Local rule-mining and training
        Provides better accuracy and representation/coverage
    14
Associative Classification – Our
Approach (cont’d)
    Pre-processing to generate fuzzy dataset (for fuzzy
     associative classifiers) using FPrep

    Classification Association Rules (CARs) mining using
     FAR-Miner or FAR-HD

    CARs pruning and classifier training using SEAC or
     FSEAC

    Rule ranking and application (scoring) techniques

    15
Simple and Effective Associative
Classifier (SEAC)
   Direct mining of CARs –
    faster and simpler training

   CARs used directly through
    effective pruning and sorting

   Pruning and rule-ranking
    based on
        Information gain
        Rule-length

   Two-phased manner
        Global rule-mining and training
        Local rule-mining and training


    16
SEAC - Example




Example Dataset

Scoring Example
Unlabeled: B=2, C=2
X=1 → 16, 17, 19 (IG=0.534)
X=2 → 13, 14, 20 (IG=0.657)


                              Ruleset
   17
Fuzzy Simple and Effective Associative
Classifier (FSEAC)
    Amalgamates Fuzzy Logic with Associative Classification

    Pre-processed using FPreP

    CARs mined using FAR-Miner / FAR-HD

    CARs pruned based on Fuzzy Information Gain (FIG)
     and rule length - no sorting required

    Scoring – rules applied taking µ into account
        Sorting done then
        Final score computed



    18
FSEAC - Example


                Format for Fuzzy Version of Dataset




     Example Dataset             Fuzzy Version of Example Dataset
19
FSEAC – Example (cont’d)




               Ruleset
20
SEAC and FSEAC Experimental Setup
    SEAC
        12 classifiers (Associative and non-associative)
        14 UCI ML datasets
        100-5000 records per dataset
        2-10 classes per dataset
        Up to 20 features per dataset
        10-fold Cross Validation

    FSEAC
        17 classifiers (Associative and non-associative; fuzzy and crisp)
        23 UCI ML datasets
        100-5000 records per dataset
        2-10 classes per dataset
        Up to 60 features per dataset
        10-fold Cross Validation

    21
SEAC – Results (10 fold-CV)




                              continued


22
SEAC - Results (10 fold-CV)




23
FSEAC - Results (10 fold-CV)




                               continued
24
FSEAC - Results (10 fold-CV)




25
Outline
    Introduction

    Crisp and Fuzzy Associative Classification

    Pre-Processing and Mining
        Fuzzy Pre-Processing – FPrep
        Fuzzy ARM – FAR-Miner and FAR-HD

    Associative Classification – Our Approach
        Simple and Effective Associative Classifier (SEAC)
        Fuzzy Simple and Effective Associative Classifier (FSEAC)


    Associative Classification – Applications
        Efficient Fuzzy Associative Classifier for Object Classes in Images (I-FAC)
        Associative Classifier for Ad-targeting

    Conclusions

    26
Efficient Fuzzy Associative Classifier for
Object Classes in Images (I-FAC)
    Adapts fuzzy associative classification for Object Class
     Detection in images
         Speeded-Up Robust Features (SURF) - interest point detector
          and descriptor for images
         Fuzzy clusters used as opposed to hard clustering used in Bag-
          of-words

    Only positive class (CP) examples used for mining
         Negative class (CN) in object class detection is very vague
             CN = U – CP


        Rules are pruned and ranked based on Information Gain
         Other AC algorithms use third-party algorithms for rule-
          generation from frequent itemsets
         Top k rules are used for scoring and classification

    27                                         ICPR 2010
I-FAC
    SURF points extracted from positive class images
        FCM applied to derive clusters
        Clusters (with µs) used to generate dataset for mining
            100 fuzzy clusters as opposed to1000-2000 crisp clusters-based algorithms




    ARM generates Classification Association Rules (CARs)
     associated with positive class

    CARs are pruned and sorted using
        Fuzzy Information Gain (FIG) of each rule
        Length of each rule i.e. number of attributes in each rule

    Scoring based on rule-match and FIG
    28                                                ICPR 2010
I-FAC - Performance Study
   Performs well when
    compared to BOW or SVM
       Very well at low FPRs (≤0.3)


   Fuzzy nature helps avoid
    polysemy and synonymy

   Uses only positive class
    for training



    30                                 ICPR 2010
Visual Concept Detection on MIR Flickr
    Revamped version of I-FAC

    Multi-class detection
        38 visual concepts
        e.g. car, sky, clouds, water, building, sea, face

    Experimental evaluation
        First 10K images of MIR Flick dataset
        AUC values for each concept




    31
Experimental Results (3-fold CV)




                                   continued
32
Experimental Results (3-fold CV)




33
Look-alike Modeling using Feature-Pair-
based Associative Classification
    Display-ad targeting currently done using methods which rely
     on publisher-defined segments like Behavior-targeting (BT)

    Look-alike model trained to identify similar users
        Similarity is based on historical user behavior
        Model iteratively rebuilt as more users are added
        Advertiser supplies seed list of users

    Approach for building advertiser specific audience segments
        Complements publisher defined segments such as BT
        Provides advertisers control over the audience definition

    Given a list of target users (e.g., people who clicked or
     converted on a particular category or ad campaign), find other
     similar users.

    34                                         WWW 2011
Look-alike Modeling using Feature-Pair-
based Associative Classification – cont’d
    Enumerate all feature-pairs in training set occurring in at
     least 5 positive-class records
        Feature-pairs modelled as AC rules
        Only rules for positive class used
        Works well in Tail Campaigns


    Affinity measured by Frequency-weighted LLR (F-LLR)
        FLLR = P(f) log(P(f | conv) / P(f | non-conv))
        Rules sorted in descending order by F-LLRs


    Scoring - Top k rules are applied
        Cumulative score from all rules used for classification

    35                                        WWW 2011
Performance Study
    Two pilot campaigns
        300K records each                             Lift
                                      Baseline     (Conversion          Lift (AUC)
        One record per user
                                                      Rate)
        Training window - 14          Random
         days                                         82%                   –
                                      Targeting
        Scoring window - seven      Linear SVM       301%                11%
         days
                                        GBDT          100%                2%

    Works very well for Tail              Results on a Tail Campaign
     Campaigns
        Can find meaningful                                                      Lift
                                        Baseline       Lift (Conversion Rate)
         associations in extremely                                               (AUC)
         sparse and skewed data          Random
                                                                 48%                  –
                                        Targeting
    SVM and GBDT work                 Linear SVM                -12%                 -6%
     well for Head Campaigns              GBDT                   -40%                -14%
                                            Results on a Head Campaign
    36                                             WWW 2011
Outline
    Introduction

    Crisp and Fuzzy Associative Classification

    Pre-Processing and Mining
        Fuzzy Pre-Processing – FPrep
        Fuzzy ARM – FAR-Miner and FAR-HD

    Associative Classification – Our Approach
        Simple and Effective Associative Classifier (SEAC)
        Fuzzy Simple and Effective Associative Classifier (FSEAC)
    Associative Classification – Applications
        Efficient Fuzzy Associative Classifier for Object Classes in Images (I-FAC)
        Associative Classifier for Ad-targeting


    Conclusions

    37
Conclusions
    Fuzzy pre-processing for dataset transformation

    Fuzzy ARM for various types of datasets

    Fuzzy and Crisp Associative Classifiers for various
     domains
        Customizations required for different domains
            Pre-processing
            Pruning
            Rule ranking techniques
            Rule application (scoring) techniques


    38
References
    Ashish Mangalampalli, Adwait Ratnaparkhi, Andrew O. Hatch, Abraham Bagherjeiran,
     Rajesh Parekh, and Vikram Pudi. A Feature-Pair-based Associative Classification
     Approach to Look-alike Modeling for Conversion-Oriented User-Targeting in Tail
     Campaigns. In International World Wide Web Conference (WWW), 2011.

    Ashish Mangalampalli, Vineet Chaoji, and Subhajit Sanyal. I-FAC: Efficient fuzzy
     associative classifier for object classes in images. In International Conference on
     Pattern Recognition (ICPR), 2010.

    Ashish Mangalampalli and Vikram Pudi. FPrep: Fuzzy clustering driven efficient
     automated pre-processing for fuzzy association rule mining. In IEEE International
     Conference on Fuzzy Systems (FUZZ-IEEE), 2010.

    Ashish Mangalampalli and Vikram Pudi. FACISME: Fuzzy associative classification
     using iterative scaling and maximum entropy. In IEEE International Conference on
     Fuzzy Systems (FUZZ-IEEE), 2010.

    Ashish Mangalampalli and Vikram Pudi. Fuzzy Association Rule Mining Algorithm for
     Fast and Efficient Performance on Very Large Datasets. In IEEE International
     Conference on Fuzzy Systems (FUZZ-IEEE), 2009.



    39
Thank You, and
       Questions




40

Weitere ähnliche Inhalte

Andere mochten auch

26 Machine Learning Unsupervised Fuzzy C-Means
26 Machine Learning Unsupervised Fuzzy C-Means26 Machine Learning Unsupervised Fuzzy C-Means
26 Machine Learning Unsupervised Fuzzy C-MeansAndres Mendez-Vazquez
 
PhD Thesis: Mining abstractions in scientific workflows
PhD Thesis: Mining abstractions in scientific workflowsPhD Thesis: Mining abstractions in scientific workflows
PhD Thesis: Mining abstractions in scientific workflowsdgarijo
 
Mining Fuzzy Moving Object Clusters
Mining Fuzzy Moving Object ClustersMining Fuzzy Moving Object Clusters
Mining Fuzzy Moving Object ClustersNhatHai Phan
 
Python programming advance lab api npr 2
Python programming advance lab api npr  2Python programming advance lab api npr  2
Python programming advance lab api npr 2profbnk
 
Fuzzy and Neural Approaches in Engineering MATLAB
Fuzzy and Neural Approaches in Engineering MATLABFuzzy and Neural Approaches in Engineering MATLAB
Fuzzy and Neural Approaches in Engineering MATLABESCOM
 
Market basketanalysis using r
Market basketanalysis using rMarket basketanalysis using r
Market basketanalysis using rYogesh Khandelwal
 
Market Basket Analysis in SAS
Market Basket Analysis in SASMarket Basket Analysis in SAS
Market Basket Analysis in SASAndrew Kramer
 
Data mining- Association Analysis -market basket
Data mining- Association Analysis -market basketData mining- Association Analysis -market basket
Data mining- Association Analysis -market basketSwapnil Soni
 
Masket Basket Analysis
Masket Basket AnalysisMasket Basket Analysis
Masket Basket AnalysisMarc Berman
 
Market Basket Analysis
Market Basket AnalysisMarket Basket Analysis
Market Basket AnalysisMahendra Gupta
 
Real-time Market Basket Analysis for Retail with Hadoop
Real-time Market Basket Analysis for Retail with HadoopReal-time Market Basket Analysis for Retail with Hadoop
Real-time Market Basket Analysis for Retail with HadoopDataWorks Summit
 
Correlation of subjects in school (b.ed notes)
Correlation of subjects in school (b.ed notes)Correlation of subjects in school (b.ed notes)
Correlation of subjects in school (b.ed notes)Namrata Saxena
 
Data mining slides
Data mining slidesData mining slides
Data mining slidessmj
 
Bioinformatics
BioinformaticsBioinformatics
BioinformaticsJTADrexel
 

Andere mochten auch (16)

26 Machine Learning Unsupervised Fuzzy C-Means
26 Machine Learning Unsupervised Fuzzy C-Means26 Machine Learning Unsupervised Fuzzy C-Means
26 Machine Learning Unsupervised Fuzzy C-Means
 
PhD Thesis: Mining abstractions in scientific workflows
PhD Thesis: Mining abstractions in scientific workflowsPhD Thesis: Mining abstractions in scientific workflows
PhD Thesis: Mining abstractions in scientific workflows
 
Mining Fuzzy Moving Object Clusters
Mining Fuzzy Moving Object ClustersMining Fuzzy Moving Object Clusters
Mining Fuzzy Moving Object Clusters
 
Python programming advance lab api npr 2
Python programming advance lab api npr  2Python programming advance lab api npr  2
Python programming advance lab api npr 2
 
Fuzzy and Neural Approaches in Engineering MATLAB
Fuzzy and Neural Approaches in Engineering MATLABFuzzy and Neural Approaches in Engineering MATLAB
Fuzzy and Neural Approaches in Engineering MATLAB
 
Market basketanalysis using r
Market basketanalysis using rMarket basketanalysis using r
Market basketanalysis using r
 
Market Basket Analysis in SAS
Market Basket Analysis in SASMarket Basket Analysis in SAS
Market Basket Analysis in SAS
 
Data mining- Association Analysis -market basket
Data mining- Association Analysis -market basketData mining- Association Analysis -market basket
Data mining- Association Analysis -market basket
 
Market basket analysis
Market basket analysisMarket basket analysis
Market basket analysis
 
Masket Basket Analysis
Masket Basket AnalysisMasket Basket Analysis
Masket Basket Analysis
 
Market Basket Analysis
Market Basket AnalysisMarket Basket Analysis
Market Basket Analysis
 
Real-time Market Basket Analysis for Retail with Hadoop
Real-time Market Basket Analysis for Retail with HadoopReal-time Market Basket Analysis for Retail with Hadoop
Real-time Market Basket Analysis for Retail with Hadoop
 
Market baasket analysis
Market baasket analysisMarket baasket analysis
Market baasket analysis
 
Correlation of subjects in school (b.ed notes)
Correlation of subjects in school (b.ed notes)Correlation of subjects in school (b.ed notes)
Correlation of subjects in school (b.ed notes)
 
Data mining slides
Data mining slidesData mining slides
Data mining slides
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 

Ähnlich wie PhD Defense -- Ashish Mangalampalli

Scaling Up And Speeding Up Video Analytics Inside Database Engine
Scaling Up And Speeding Up Video Analytics Inside Database EngineScaling Up And Speeding Up Video Analytics Inside Database Engine
Scaling Up And Speeding Up Video Analytics Inside Database EngineRui Liu
 
Study of Density Based Clustering Techniques on Data Streams
Study of Density Based Clustering Techniques on Data StreamsStudy of Density Based Clustering Techniques on Data Streams
Study of Density Based Clustering Techniques on Data StreamsIJERA Editor
 
lecture14-learning-ranking.ppt
lecture14-learning-ranking.pptlecture14-learning-ranking.ppt
lecture14-learning-ranking.pptVishalKumar725248
 
lecture14-learning-ranking.ppt
lecture14-learning-ranking.pptlecture14-learning-ranking.ppt
lecture14-learning-ranking.pptVishalKumar725248
 
IRJET- Comparative Study of PCA, KPCA, KFA and LDA Algorithms for Face Re...
IRJET-  	  Comparative Study of PCA, KPCA, KFA and LDA Algorithms for Face Re...IRJET-  	  Comparative Study of PCA, KPCA, KFA and LDA Algorithms for Face Re...
IRJET- Comparative Study of PCA, KPCA, KFA and LDA Algorithms for Face Re...IRJET Journal
 
Adaptive Neuro-Fuzzy Inference System (ANFIS) for segmentation of image ROI a...
Adaptive Neuro-Fuzzy Inference System (ANFIS) for segmentation of image ROI a...Adaptive Neuro-Fuzzy Inference System (ANFIS) for segmentation of image ROI a...
Adaptive Neuro-Fuzzy Inference System (ANFIS) for segmentation of image ROI a...IRJET Journal
 
Forecasting time series for business and operations data: A tutorial
Forecasting time series for business and operations data: A tutorialForecasting time series for business and operations data: A tutorial
Forecasting time series for business and operations data: A tutorialColleen Farrelly
 
Real-Time Pertinent Maneuver Recognition for Surveillance
Real-Time Pertinent Maneuver Recognition for SurveillanceReal-Time Pertinent Maneuver Recognition for Surveillance
Real-Time Pertinent Maneuver Recognition for SurveillanceIRJET Journal
 
Multisensor Data Fusion : Techno Briefing
Multisensor Data Fusion : Techno BriefingMultisensor Data Fusion : Techno Briefing
Multisensor Data Fusion : Techno BriefingPaveen Juntama
 
B4UConference_machine learning_deeplearning
B4UConference_machine learning_deeplearningB4UConference_machine learning_deeplearning
B4UConference_machine learning_deeplearningHoa Le
 
Topology Aware Resource Allocation
Topology Aware Resource AllocationTopology Aware Resource Allocation
Topology Aware Resource AllocationSujith Jay Nair
 
SVM-KNN Hybrid Method for MR Image
SVM-KNN Hybrid Method for MR ImageSVM-KNN Hybrid Method for MR Image
SVM-KNN Hybrid Method for MR ImageIRJET Journal
 
Machine learning in Dynamic Adaptive Streaming over HTTP (DASH)
Machine learning in Dynamic Adaptive Streaming over HTTP (DASH)Machine learning in Dynamic Adaptive Streaming over HTTP (DASH)
Machine learning in Dynamic Adaptive Streaming over HTTP (DASH)Eswar Publications
 
Svm Classifier Algorithm for Data Stream Mining Using Hive and R
Svm Classifier Algorithm for Data Stream Mining Using Hive and RSvm Classifier Algorithm for Data Stream Mining Using Hive and R
Svm Classifier Algorithm for Data Stream Mining Using Hive and RIRJET Journal
 
Assisting Code Search with Automatic Query Reformulation for Bug Localization
Assisting Code Search with Automatic Query Reformulation for Bug LocalizationAssisting Code Search with Automatic Query Reformulation for Bug Localization
Assisting Code Search with Automatic Query Reformulation for Bug LocalizationBunyamin Sisman
 
VauhkonenVohraMadaan-ProjectDeepLearningBenchMarks
VauhkonenVohraMadaan-ProjectDeepLearningBenchMarksVauhkonenVohraMadaan-ProjectDeepLearningBenchMarks
VauhkonenVohraMadaan-ProjectDeepLearningBenchMarksMumtaz Hannah Vauhkonen
 

Ähnlich wie PhD Defense -- Ashish Mangalampalli (20)

Scaling Up And Speeding Up Video Analytics Inside Database Engine
Scaling Up And Speeding Up Video Analytics Inside Database EngineScaling Up And Speeding Up Video Analytics Inside Database Engine
Scaling Up And Speeding Up Video Analytics Inside Database Engine
 
E502024047
E502024047E502024047
E502024047
 
E502024047
E502024047E502024047
E502024047
 
Study of Density Based Clustering Techniques on Data Streams
Study of Density Based Clustering Techniques on Data StreamsStudy of Density Based Clustering Techniques on Data Streams
Study of Density Based Clustering Techniques on Data Streams
 
lecture14-learning-ranking.ppt
lecture14-learning-ranking.pptlecture14-learning-ranking.ppt
lecture14-learning-ranking.ppt
 
lecture14-learning-ranking.ppt
lecture14-learning-ranking.pptlecture14-learning-ranking.ppt
lecture14-learning-ranking.ppt
 
IRJET- Comparative Study of PCA, KPCA, KFA and LDA Algorithms for Face Re...
IRJET-  	  Comparative Study of PCA, KPCA, KFA and LDA Algorithms for Face Re...IRJET-  	  Comparative Study of PCA, KPCA, KFA and LDA Algorithms for Face Re...
IRJET- Comparative Study of PCA, KPCA, KFA and LDA Algorithms for Face Re...
 
Adaptive Neuro-Fuzzy Inference System (ANFIS) for segmentation of image ROI a...
Adaptive Neuro-Fuzzy Inference System (ANFIS) for segmentation of image ROI a...Adaptive Neuro-Fuzzy Inference System (ANFIS) for segmentation of image ROI a...
Adaptive Neuro-Fuzzy Inference System (ANFIS) for segmentation of image ROI a...
 
Forecasting time series for business and operations data: A tutorial
Forecasting time series for business and operations data: A tutorialForecasting time series for business and operations data: A tutorial
Forecasting time series for business and operations data: A tutorial
 
Real-Time Pertinent Maneuver Recognition for Surveillance
Real-Time Pertinent Maneuver Recognition for SurveillanceReal-Time Pertinent Maneuver Recognition for Surveillance
Real-Time Pertinent Maneuver Recognition for Surveillance
 
Multisensor Data Fusion : Techno Briefing
Multisensor Data Fusion : Techno BriefingMultisensor Data Fusion : Techno Briefing
Multisensor Data Fusion : Techno Briefing
 
AI and Deep Learning
AI and Deep Learning AI and Deep Learning
AI and Deep Learning
 
B4UConference_machine learning_deeplearning
B4UConference_machine learning_deeplearningB4UConference_machine learning_deeplearning
B4UConference_machine learning_deeplearning
 
Topology Aware Resource Allocation
Topology Aware Resource AllocationTopology Aware Resource Allocation
Topology Aware Resource Allocation
 
Stock Market Prediction Using ANN
Stock Market Prediction Using ANNStock Market Prediction Using ANN
Stock Market Prediction Using ANN
 
SVM-KNN Hybrid Method for MR Image
SVM-KNN Hybrid Method for MR ImageSVM-KNN Hybrid Method for MR Image
SVM-KNN Hybrid Method for MR Image
 
Machine learning in Dynamic Adaptive Streaming over HTTP (DASH)
Machine learning in Dynamic Adaptive Streaming over HTTP (DASH)Machine learning in Dynamic Adaptive Streaming over HTTP (DASH)
Machine learning in Dynamic Adaptive Streaming over HTTP (DASH)
 
Svm Classifier Algorithm for Data Stream Mining Using Hive and R
Svm Classifier Algorithm for Data Stream Mining Using Hive and RSvm Classifier Algorithm for Data Stream Mining Using Hive and R
Svm Classifier Algorithm for Data Stream Mining Using Hive and R
 
Assisting Code Search with Automatic Query Reformulation for Bug Localization
Assisting Code Search with Automatic Query Reformulation for Bug LocalizationAssisting Code Search with Automatic Query Reformulation for Bug Localization
Assisting Code Search with Automatic Query Reformulation for Bug Localization
 
VauhkonenVohraMadaan-ProjectDeepLearningBenchMarks
VauhkonenVohraMadaan-ProjectDeepLearningBenchMarksVauhkonenVohraMadaan-ProjectDeepLearningBenchMarks
VauhkonenVohraMadaan-ProjectDeepLearningBenchMarks
 

Kürzlich hochgeladen

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 

Kürzlich hochgeladen (20)

Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 

PhD Defense -- Ashish Mangalampalli

  • 1. A Fuzzy Associative Rule- based Approach for Pattern Mining and Pattern-based Classification Ashish Mangalampalli Advisor: Dr. Vikram Pudi Centre for Data Engineering International Institute of Information Technology (IIIT) Hyderabad 1
  • 2. Outline  Introduction  Crisp and Fuzzy Associative Classification  Pre-Processing and Mining  Fuzzy Pre-Processing – FPrep  Fuzzy ARM – FAR-Miner and FAR-HD  Associative Classification – Our Approach  FACISME – Fuzzy Adaption of ACME (Maximum Entropy Associative Classifier)  Simple and Effective Associative Classifier (SEAC)  Fuzzy Simple and Effective Associative Classifier (FSEAC)  Associative Classification – Applications  Efficient Fuzzy Associative Classifier for Object Classes in Images (I-FAC)  Associative Classifier for Ad-targeting  Conclusions 2
  • 3. Introduction  Associative classification  Mines huge amounts of data  Integrates Association Rule Mining (ARM) with Classification A = a, B = b, C = c → X = x  Associative classifiers have several advantages  Frequent itemsets capture dominant relationships between items/features  Statistically significant associations make classification framework robust  Low-frequency patterns (noise) are eliminated during ARM  Rules are very transparent and easily understood  Unlike black-box-like approach used in popular classifiers, such as SVMs and Artificial Neural Networks 3
  • 4. Outline  Introduction  Crisp and Fuzzy Associative Classification  Pre-Processing and Mining  Fuzzy Pre-Processing – FPrep  Fuzzy ARM – FAR-Miner and FAR-HD  Associative Classification – Our Approach  Simple and Effective Associative Classifier (SEAC)  Fuzzy Simple and Effective Associative Classifier (FSEAC)  Associative Classification – Applications  Efficient Fuzzy Associative Classifier for Object Classes in Images (I-FAC)  Associative Classifier for Ad-targeting  Conclusions 4
  • 5. Crisp Associative Classification  Most associative classifiers are crisp  Most real-life datasets contain binary and numerical attributes  Use sharp partitioning  Transform numerical attributes to binary ones, e.g. Income = [100K and above]  Drawbacks of sharp partitioning  Introduces uncertainty, especially at partition boundaries  Small changes in intervals lead to misleading results  Gives rise to polysemy and synonymy  Intervals do not generally have clear semantics associated  For example, sharp partitions for the attribute Income  Up to 20K, 20K-100K, 100K and above  Income = 50K would fit in the second partition  But, so would Income = 99K 5
  • 6. Fuzzy Associative Classification  Fuzzy logic  Used to convert numerical attributes to fuzzy attributes (e.g. Income = High)  Maintains integrity of information conveyed by numerical attributes  Attribute values belong to partitions with some membership - interval [0, 1] 6
  • 7. Outline  Introduction  Crisp and Fuzzy Associative Classification  Pre-Processing and Mining  Fuzzy Pre-Processing – FPrep  Fuzzy ARM – FAR-Miner and FAR-HD  Associative Classification – Our Approach  Simple and Effective Associative Classifier (SEAC)  Fuzzy Simple and Effective Associative Classifier (FSEAC)  Associative Classification – Applications  Efficient Fuzzy Associative Classifier for Object Classes in Images (I-FAC)  Associative Classifier for Ad-targeting  Conclusions 7
  • 8. Pre-Processing and Mining  Fuzzy pre-processing  Convert crisp dataset (binary and numerical attributes) into fuzzy dataset (binary and fuzzy attributes)  FPrep Algorithm used  Efficient and robust Fuzzy ARM algorithms  Web-scale datasets mandate such algorithms  Fuzzy Apriori is most popular  Many efficient crisp ARM algorithms exist like ARMOR and FP-Growth  Algorithms used  FAR-Miner for normal transactional datasets  FAR-HD for high dimensional datasets 8
  • 9. Outline  Introduction  Crisp and Fuzzy Associative Classification  Pre-Processing and Mining  Fuzzy Pre-Processing – FPrep  Fuzzy ARM – FAR-Miner and FAR-HD  Associative Classification – Our Approach  Simple and Effective Associative Classifier (SEAC)  Fuzzy Simple and Effective Associative Classifier (FSEAC)  Associative Classification – Applications  Efficient Fuzzy Associative Classifier for Object Classes in Images (I-FAC)  Associative Classifier for Ad-targeting  Conclusions 13
  • 10. Associative Classification – Our Approach  AC algorithms like CPAR and CMAR only mine frequent itemsets  Processed using additional (greedy) algorithms like FOIL and PRM  Overhead in running time; process more complex  Association rules directly used for training and scoring  Exhaustive approach  Controlled by appropriate support  Not a time-intensive process  Rule pruning and ranking take care of huge volume and redundancy  Classifier built in a two-phased manner  Global rule-mining and training  Local rule-mining and training  Provides better accuracy and representation/coverage 14
  • 11. Associative Classification – Our Approach (cont’d)  Pre-processing to generate fuzzy dataset (for fuzzy associative classifiers) using FPrep  Classification Association Rules (CARs) mining using FAR-Miner or FAR-HD  CARs pruning and classifier training using SEAC or FSEAC  Rule ranking and application (scoring) techniques 15
  • 12. Simple and Effective Associative Classifier (SEAC)  Direct mining of CARs – faster and simpler training  CARs used directly through effective pruning and sorting  Pruning and rule-ranking based on  Information gain  Rule-length  Two-phased manner  Global rule-mining and training  Local rule-mining and training 16
  • 13. SEAC - Example Example Dataset Scoring Example Unlabeled: B=2, C=2 X=1 → 16, 17, 19 (IG=0.534) X=2 → 13, 14, 20 (IG=0.657) Ruleset 17
  • 14. Fuzzy Simple and Effective Associative Classifier (FSEAC)  Amalgamates Fuzzy Logic with Associative Classification  Pre-processed using FPreP  CARs mined using FAR-Miner / FAR-HD  CARs pruned based on Fuzzy Information Gain (FIG) and rule length - no sorting required  Scoring – rules applied taking µ into account  Sorting done then  Final score computed 18
  • 15. FSEAC - Example Format for Fuzzy Version of Dataset Example Dataset Fuzzy Version of Example Dataset 19
  • 16. FSEAC – Example (cont’d) Ruleset 20
  • 17. SEAC and FSEAC Experimental Setup  SEAC  12 classifiers (Associative and non-associative)  14 UCI ML datasets  100-5000 records per dataset  2-10 classes per dataset  Up to 20 features per dataset  10-fold Cross Validation  FSEAC  17 classifiers (Associative and non-associative; fuzzy and crisp)  23 UCI ML datasets  100-5000 records per dataset  2-10 classes per dataset  Up to 60 features per dataset  10-fold Cross Validation 21
  • 18. SEAC – Results (10 fold-CV) continued 22
  • 19. SEAC - Results (10 fold-CV) 23
  • 20. FSEAC - Results (10 fold-CV) continued 24
  • 21. FSEAC - Results (10 fold-CV) 25
  • 22. Outline  Introduction  Crisp and Fuzzy Associative Classification  Pre-Processing and Mining  Fuzzy Pre-Processing – FPrep  Fuzzy ARM – FAR-Miner and FAR-HD  Associative Classification – Our Approach  Simple and Effective Associative Classifier (SEAC)  Fuzzy Simple and Effective Associative Classifier (FSEAC)  Associative Classification – Applications  Efficient Fuzzy Associative Classifier for Object Classes in Images (I-FAC)  Associative Classifier for Ad-targeting  Conclusions 26
  • 23. Efficient Fuzzy Associative Classifier for Object Classes in Images (I-FAC)  Adapts fuzzy associative classification for Object Class Detection in images  Speeded-Up Robust Features (SURF) - interest point detector and descriptor for images  Fuzzy clusters used as opposed to hard clustering used in Bag- of-words  Only positive class (CP) examples used for mining  Negative class (CN) in object class detection is very vague  CN = U – CP  Rules are pruned and ranked based on Information Gain  Other AC algorithms use third-party algorithms for rule- generation from frequent itemsets  Top k rules are used for scoring and classification 27 ICPR 2010
  • 24. I-FAC  SURF points extracted from positive class images  FCM applied to derive clusters  Clusters (with µs) used to generate dataset for mining  100 fuzzy clusters as opposed to1000-2000 crisp clusters-based algorithms  ARM generates Classification Association Rules (CARs) associated with positive class  CARs are pruned and sorted using  Fuzzy Information Gain (FIG) of each rule  Length of each rule i.e. number of attributes in each rule  Scoring based on rule-match and FIG 28 ICPR 2010
  • 25. I-FAC - Performance Study  Performs well when compared to BOW or SVM  Very well at low FPRs (≤0.3)  Fuzzy nature helps avoid polysemy and synonymy  Uses only positive class for training 30 ICPR 2010
  • 26. Visual Concept Detection on MIR Flickr  Revamped version of I-FAC  Multi-class detection  38 visual concepts  e.g. car, sky, clouds, water, building, sea, face  Experimental evaluation  First 10K images of MIR Flick dataset  AUC values for each concept 31
  • 27. Experimental Results (3-fold CV) continued 32
  • 29. Look-alike Modeling using Feature-Pair- based Associative Classification  Display-ad targeting currently done using methods which rely on publisher-defined segments like Behavior-targeting (BT)  Look-alike model trained to identify similar users  Similarity is based on historical user behavior  Model iteratively rebuilt as more users are added  Advertiser supplies seed list of users  Approach for building advertiser specific audience segments  Complements publisher defined segments such as BT  Provides advertisers control over the audience definition  Given a list of target users (e.g., people who clicked or converted on a particular category or ad campaign), find other similar users. 34 WWW 2011
  • 30. Look-alike Modeling using Feature-Pair- based Associative Classification – cont’d  Enumerate all feature-pairs in training set occurring in at least 5 positive-class records  Feature-pairs modelled as AC rules  Only rules for positive class used  Works well in Tail Campaigns  Affinity measured by Frequency-weighted LLR (F-LLR)  FLLR = P(f) log(P(f | conv) / P(f | non-conv))  Rules sorted in descending order by F-LLRs  Scoring - Top k rules are applied  Cumulative score from all rules used for classification 35 WWW 2011
  • 31. Performance Study  Two pilot campaigns  300K records each Lift Baseline (Conversion Lift (AUC)  One record per user Rate)  Training window - 14 Random days 82% – Targeting  Scoring window - seven Linear SVM 301% 11% days GBDT 100% 2%  Works very well for Tail Results on a Tail Campaign Campaigns  Can find meaningful Lift Baseline Lift (Conversion Rate) associations in extremely (AUC) sparse and skewed data Random 48% – Targeting  SVM and GBDT work Linear SVM -12% -6% well for Head Campaigns GBDT -40% -14% Results on a Head Campaign 36 WWW 2011
  • 32. Outline  Introduction  Crisp and Fuzzy Associative Classification  Pre-Processing and Mining  Fuzzy Pre-Processing – FPrep  Fuzzy ARM – FAR-Miner and FAR-HD  Associative Classification – Our Approach  Simple and Effective Associative Classifier (SEAC)  Fuzzy Simple and Effective Associative Classifier (FSEAC)  Associative Classification – Applications  Efficient Fuzzy Associative Classifier for Object Classes in Images (I-FAC)  Associative Classifier for Ad-targeting  Conclusions 37
  • 33. Conclusions  Fuzzy pre-processing for dataset transformation  Fuzzy ARM for various types of datasets  Fuzzy and Crisp Associative Classifiers for various domains  Customizations required for different domains  Pre-processing  Pruning  Rule ranking techniques  Rule application (scoring) techniques 38
  • 34. References  Ashish Mangalampalli, Adwait Ratnaparkhi, Andrew O. Hatch, Abraham Bagherjeiran, Rajesh Parekh, and Vikram Pudi. A Feature-Pair-based Associative Classification Approach to Look-alike Modeling for Conversion-Oriented User-Targeting in Tail Campaigns. In International World Wide Web Conference (WWW), 2011.  Ashish Mangalampalli, Vineet Chaoji, and Subhajit Sanyal. I-FAC: Efficient fuzzy associative classifier for object classes in images. In International Conference on Pattern Recognition (ICPR), 2010.  Ashish Mangalampalli and Vikram Pudi. FPrep: Fuzzy clustering driven efficient automated pre-processing for fuzzy association rule mining. In IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), 2010.  Ashish Mangalampalli and Vikram Pudi. FACISME: Fuzzy associative classification using iterative scaling and maximum entropy. In IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), 2010.  Ashish Mangalampalli and Vikram Pudi. Fuzzy Association Rule Mining Algorithm for Fast and Efficient Performance on Very Large Datasets. In IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), 2009. 39
  • 35. Thank You, and Questions 40