SlideShare ist ein Scribd-Unternehmen logo
1 von 6
Downloaden Sie, um offline zu lesen
The International Journal of Engineering
And Science (IJES)
||Volume|| 1 ||Issue|| 1 ||Pages|| 95-100 ||2012||
ISSN: 2319 – 1813 ISBN: 2319 – 1805

       Actionable Knowledge Discovery using Multi-Step Mining
                                    1
                                     DharaniK,2Kalpana Gudikandula
                     1
                         Department of CS, JNTU H, DRK College of Engineering and Technology
                                            Hyderabad, Andhra Pradesh, India
                          2
                            Department of IT, JNTU H, DRK Institute of Science and Technology
                                            Hyderabad, Andhra Pradesh, India




--------------------------------------------------------Abstract-------------------------------------------------------------------
Data mining is a process of obtaining trends or patterns in historical data. Such trends form business intelligence
that in turn leads to taking well informed decisions. However, data min ing with a single technique does not yield
actionable knowledge. Th is is because enterprises have huge databases and heterogeneous in nature. They also
have complex data and min ing such data needs multi-step min ing instead of single step mining. When mu ltiple
approaches are involved, they provide business intelligence in all aspects. That kind of information can lead to
actionable knowledge. Recently data min ing has got tremendous usage in the real world. Th e drawback of
existing approaches is that insufficient business intelligence in case of huge enterprises. This paper presents the
combination of existing works and algorithms. We work on multiple data sources, multip le methods and
mu ltip le features. The combined patterns thus obtained from co mplex business data provide actionable
knowledge. A prototype application has been built to test the efficiency of the proposed framework wh ich
combines mult iple data sources, mu ltiple methods and multip le features in min ing process. The empirical results
revealed that the proposed approach is effective and can be used in the real world.
Index Terms : Data mining, act ionable knowledge discovery, multi-method mining, mu lti-feature min ing,
mu lti-source mining
-------------------------------------------------------------------------------------------------------------------------------- -----
 Date of Submission: 19, November, 2012                                         Date of Publication: 30, November 2012
--------------------------------------------------------------------------------------------------------------------------------------

1. Introduction
           Data mining at enterprise level operates on huge amount of data such as government transactions,
banks, insurance companies and so on. Inevitably, these businesses produce complex data that might be
distributed in nature. When mining is made on such data with a single-step, it produces business intelligence as
a particular aspect. However, this is not sufficient in enterprise where different aspects and standpoints are to be
considered before taking business decisions. It is required that the enterprises p erform min ing based on multiple
features, data sources and methods. This is known as combined mining. The co mb ined min ing can produce
patterns that reflect all aspects of the enterprise. Thus the derived intelligence can be used to take business
decisions that lead to profits. This kind of knowledge is known as actionable knowledge. The act ionable
knowledge is discovered through multip le data sources, multip le methods and multip le features. The intelligence
thus obtained is dependable and reliable. As businesses are growing in complex data, there is a need for mining
on complex data. However, it is challenging to discover actionable knowledge using complex data sources. This
is because the generated business intelligence must be comprehensive and informat ion that provides enough
knowledge to take enterprise business decisions. There are many traditional methods being used in data min ing.
Therefore it is very challenging to comb ine them and use them to generate actionable knowledge. The
challenges in the mult i min ing process can be categorized into multip le data sources, mu ltiple methods, mu ltiple
features, post analysis and mining, joining mu ltip le relational database and data sampling as well. Data sampling
is generally not acceptable in real world data mining applications. Due to space and time limits combining
mu ltip le tables or joining them may not be possible. As data mining methods are developed for various data
sources keeping some assumptions in mind, it is challenging to combine them for discovering actionable
knowledge in real t ime applications. Co mbined association rules, comb ined rule clusters and combined rule
pairs concepts areproposed in [1], [22] and [25]. These papers exposed the mining on comp lex data with respect
to mult iple data sets and obtaining comprehensive knowledge. Co mbined association rule is nothing but a set of

www.theijes.com                                               The IJES                                                    Page 95
Actionable Knowledge Discovery Using Multi-Step…

mu ltip le item sets that are heterogeneous in nature. Co mb ined rules pairs are nothing but derived fro m co mbined
association rules by combining ru le clusters. The comb ined rule pairs are also derived fro m co mb ined rules
only. Traditional algorith ms such as FPGro wth [15] can’t be used to derive combined association rules. This
paper aims at presenting a new comprehensive framework for co mbined min ing. As a matter of fact, this p aper
makes use of existing methods or techniques as part of the framework. Therefore it integrates mu lti source
combined min ing, mu lti-method combined mining, and multi-feature co mbined mining. Multip le features might
include demographics of customer, behavior, business impacts and also transactional data. Multi method might
include clustering, classificat ion and so on. Multiple data sources do mean that the mining process takes data
fro m mu ltiple related data sources. The deliverables of the proposed framewo rk co mb ined patterns or
combinedassociation rules. The fo llo wing are the general aspects of comb ined min ing.
 Co mbined patterns generated by using various features can reflect the characteristics of enterprise closely
  and such business intelligence is always reliable.
 The combined min ing with respect to multip le data sources can give patterns that cover many aspects of the
  enterprise as data is taken from different data sources.
 Multiple methods mining results in patterns that reflect the in depth nature of data and also the advantages
  provided by various mining algorith ms are with this kind of co mb ined min ing.
 Metrics are of mult iple interestingness in nature can be applied to generated patterns that can verify the
  significance of generated patterns. Instead of providing a specific algorith m and get results pertaining to that
  algorith m, it is good idea to combine them and get more accurate and actionable knowledge.

          The contributions of this paper include the concept of combined mining that is based on the existing
works. It discusses the framework and various combined min ing such as multi-source combined mining, mult i-
method comb ined mining, and mu lti-feature co mbined mining. Provides various techniques for pattern
interaction and novel patterns such as clustered patterns, combined patterns etc. Interestingness metrics are
evaluated. Discovering practically the combined patterns or actionable knowledge from real world businesses
such as banks.

2. Related Work
          Mining is a process of extracting trends or patterns from historical data. These trends or patterns can
provide business intelligence that leads to actionable knowledge. There are many data mining methods or
algorith ms that exist for mining data to get patterns. However, all the existing algorithms are single-step mining
algorith ms. This means that they provide business intelligence inadequately. They may not be able to reflect the
complex needs of an enterprise to take decisions correctly. When multiple data mining techniques are combined
it is possible to get actionable knowledge that can cater to the needs of an enterprise. In this paper comb ining
mining algorith ms [1] have been implemented using a prototype application that demonstrates the efficiency of
combined min ing. The combined actionable knowledge can’t be provided by existing algorithms such as
FPGrowth [15]. The existing works on data mining operations on complex data or enterprise generally of
different types such as direct mining approaches; post min ing of patterns; data sets with extra feat ures; mu ltiple
methods integration; and also joining mult iple relational tables. Harmony [19] proposed an approach to mine for
discriminative patterns. Other such experiments include contrast patterns [5], model based search tree [6]. These
algorith ms attempted to use multip le features or mining techniques. Comb ined mining is best used to provide
actionable knowledge in spite of comp lex data sets, and features.There are four categories of combined mining
approaches in literature. A common ly used approach [28] is the post mining or post analysis of obtained
patterns. It is best used to prune the rules obtained after mining database or reducing redundancy or even
summarizing the patterns obtained [12]. In [1] co mbined min ing proposed contain direct mining methods. In [1
mu ltisource combined mining, mu lti-method combined mining and multi-feature comb ined mining were
introduced. These algorithms are used in this paper and implemented in the prototype application that is used to
demonstrate the efficiency of the combined mining. In these approaches also post mining of patterns is used
when it is necessary. Multisource combined mining considers data of various natures. It does mean that it
combines mu ltiple data sets as it is required by an enterprise which has data of different kinds. The result of
mu ltisource data mining is the actionable knowledge that reflects different angles of an enterprise.

         Multi-feature data mining is used to have data with multip le features. This kind of min ing also leads to
actionable knowledge that reflects the comp lex needs of an enterprise. Here mu ltip le features are used. For
instance the prototype application in this paper has been implemented using banking data containing customers’
demographic data and transactional data. Multi-method combined mining mixes up many existing data mining
techniques that are very useful independently for a particular purpose only. However, in mu lti-method combined

www.theijes.com                                       The IJES                                             Page 96
Actionable Knowledge Discovery Using Multi-Step…

mining various min ing techniques are combined to obtain actionable knowledge. For instance apriori and ID3
can be combined. The aprio ri algorith m provides association rules wh ile the ID3 is meant for p roviding decision
tree. The co mbination of these two can provide actionable knowledge that helps in taking well informed
decisions. Other data mining algorith ms that are existed and can be used in combination are clustering, rarity
mining, regression, association rule mining, sequence classification [15], rule based mining [7] etc. W ith the
help of combined min ing sequential and cluster patterns can be used further. It does mean that the result of one
method can be used in another method and the process can be repeated if required. It is also possible to have a
chain of mining operations until the desired actionable knowledge is derived.

3. PROPOSED MINING FRAMEWORK
           The proposed framework co mbines the existing algorith ms or methods to integrate multiple features,
mu ltip le data sources and multiple min ing methods like classification. The co mbined mining can be done with
mu ltip le features, mult iple data sources and multip le methods. The architecture of the proposed framework is
shown in fig. 1 which is co mmon framework for all these combined mining approaches.




                            Fig. 1 – A rchitecture of proposed min ing framework [1]

          The common arch itecture for mu lti-feature mining, mu lti-method mining and multi-source mining are
described here. There are multip le min ing approaches that make use of given data source (s) and perform one or
more min ing algorith ms or methods. Each mining approach produces a set of patterns. All the patterns obtained
fro m mu ltip le mining approaches are merged together. This is done by a component known as pattern merger.
Pattern merger is a program that combines all pattern sets obtained from various min ing techniques. The result
of pattern merging process is the final deliverables. The deliverables are nothing but actionable knowledge that
facilitates comprehensive decision making. Th is framework is useful to all enterprises where there is comp lexity
in business data and business intelligence has to cover many aspects of that business. Especially this framewo rk
is suitable for do mains like banking, insurance where monetary transactions are stored and maintained. The
historical data has to be mined and right decisions are to be made in case of a ll monetary decisions such as
issuing loans and other customer centric services. The algorith ms of [1] are used for the experiments made using
a prototype application. The ensuing sections present algorith ms and the experimental results.

4. Algorithms
       This section provides description and algorithms for mu ltisource combined mining, mult i-feature
combined mining and mult i-method combined mining.

4.1 Multisource Combined Mi ning
         The combination of mult iple data sources (D): The comb ined pattern set P consists o f mult iple ato mic
patterns identified in several data sources. By mining mu ltiple data sources, combined patterns are generated
which reflect mu ltip le aspects of nature across the business lines.




www.theijes.com                                      The IJES                                            Page 97
Actionable Knowledge Discovery Using Multi-Step…




                              Fig. 2 – A lgorith m for mu ltisource co mbined mining

4.2 Multi-Feature Combined Mini ng
          The combination of multip le features (F): The combined pattern set involves multiple features, namely,
e.g., features of customer demographics and behavior. By involv ing mu ltiple heterogeneous features, combined
patterns are generated which reflect mult iple aspects of concerns and characteristics in businesses.




                             Fig. 3 – A lgorith m for mu lti-feature co mb ined min ing

4.3 Multi-method Combi ned Mining
         Multi-method comb ined mining is another approach to discover more informat ive knowledge in
complex data. The focus of mult i-method combined min ing is on combining mu ltiple data mining algorithms as
needed in order to generate more informat ive knowledge. The co mbination of mu lt iple methods (R): The
patterns in the combined set reflect the results mined by multip le data min ing methods like association mining
and classification. By applying mu ltiple methods in pattern min ing, combined patterns are generated which
disclose a deep and comprehensive essence of data by taking advantage of different methods. The algorithms
used for multi-method combined mining are Apriori Algorith m and ID3 Algorithm. These two algorithms are
well known and existing algorith ms in data mining do main.

5. Experimental Results
         The environment used to built the prototype application used to demonstrate the efficiency of various
combined mining algorith ms described in the previous section is Java Standard Edition (JSE 6.0) , Net Beans
IDE and Oracle 10G Exp ress Edition. The hardware used is a PC with 2GB RAM, and 2.9x GHz processor. The
dataset used for the experiments is pertaining to banking do main.




                                   Fig. 4 – Input Screen for synthetic dataset


www.theijes.com                                       The IJES                                          Page 98
Actionable Knowledge Discovery Using Multi-Step…

         The dataset used in related to banking domain. The data min ing makes use of customers’ demographic
and also transactional data in order to find patterns and provide actionable knowledge finally.




                                 Fig. 5 – Results of mult isource combined mining

 As can be seen in fig. 5, (a) shows patterns while (b) shows actionable knowledge that is whether loan can be
                                         given to given customer or not.




                                Fig. 6 – Dynamics of Incremental Cluster Patterns

     As can be seen in fig. 6, it v isualizes the dynamics of clusters patterns in terms of confidence, support,
           contribution and impact. The PLN, DOC, DOC, REA, IES represent a series of activities.

6. Conclusion
          Enterprises generally are distributed in nature with mult iple databases, multip le servers and so on. The
data in such businesses is very comple x. Often the applications exhib it multip le features and the applications
also heterogeneous in nature. For instance the database might have details pertaining to business impact,
business appearance, service usage, behavior, preferences and demographics. Considering complex business
scenarios and requirements of business intelligence, it is time consuming to have multiple single mining
methods or features or data sources. The single piece of informat ion fro m a single min ing approach may not
reflect actual requirement of the enterprise. Therefore, we proposed a mining framework that combines multiple
data sources, mult iple methods and multiple in order to obtain mult iple pattern sets and finally the deliverables
that is actionable knowledge that helps in taking comprehensive and well informed decisions. Such combined
mining frameworks yield best actionable knowledge that can provide profits to organizations that use it. Thus
the proposed framewo rk can be used in domains like banking, insurance etc. to take effective decisions in
monetary matters. Multiple data sources, features, and methods used in the proposed framework are not entirely
new things. They are taken from existing works and customized to get the required framework for co mbined
mining. The data mining framework is applied to many real time solutions and the experiments made with
prototype application revealed that the framework is very effect ive and able to provide actionable knowledge.

www.theijes.com                                       The IJES                                              Page 99
Actionable Knowledge Discovery Using Multi-Step…

References
[1].   Longbing Cao, Senior Member, IEEE, Huaifeng Zhang, Member, IEEE, Yanchang Zhao, Member, IEEE,
       Dan Luo, and Chengqi Zhang, Senior Member, IEEE. Co mbined Mining: Discovering Informat ive
       Knowledge in Co mplex Data. IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—
       PART B: CYBERNETICS, VOL. 41, NO. 3, JUNE 2011
[2].   H. Zhang, Y. Zhao, L. Cao, and C. Zhang, ―Comb ined association rule mining,‖ in Proc. PAKDD, 2008,
       pp. 1069–1074.
[3].   Y. Zhao, H. Zhang, L. Cao, C. Zhang, and H. Bohlscheid, ―Comb ined pattern mining : Fro m learned rules
       to actionable knowledge,‖ in Proc. AI, 2008, pp. 393–403.
[4].   J. Pei, J. Han, B. Mortazav i-Asl, J. Wang, H. Pinto, Q. Chen, U. Dayal, and M.-C. Hsu, ―Min ing
       sequential patterns by pattern-growth: The PrefixSpan approach,‖ IEEE Trans. Knowl. Data Eng., vol.
       16, no. 11, pp. 1424–1440, Nov. 2004.
[5].   J. Wang and G. Karypis, ―HARM ONY: Efficiently mining the best rules for classificat ion,‖ in Proc.
       SDM, 2005, pp. 205– 216.
[6].   G. Dong and J. Li, ―Efficient min ing of emerg ing patterns: Discovering trends and differences,‖ in Proc.
       KDD, 1999, pp. 43–52.
[7].   W. Fan, K. Zhang, J. Gao, X. Yan, J. Han, P. Yu, and O. Verscheure, ―Direct min ing of discriminative
       and essential graphical and itemset features via model-based search tree,‖ in Proc. KDD, 2008, pp. 230–
       238.
[8].   Y. Zhao, C. Zhang, and L. Cao, Eds., Post-Mining of Association Rules: Techniques for Effective
       Knowledge Extraction. Hershey, PA: Inf. Sci.Ref., 2009.
[9].   B. Liu, W. Hsu, and Y. Ma, ―Pruning and summarizing the discovered associations,‖ in Proc. KDD,
       1999, pp. 125– 134.

[10]. K. K. R. Hewawasam, K. Premaratne, and M.-L. Shyu, ―Ru le mining and classification in a situation
      assessment application: A belief-theoretic approach for handling data imperfect ions,‖ IEEE Trans. Syst.,
      Man, Cybern.B, Cybern., vol. 37, no. 6, pp. 1446–1459, Dec. 2007.

                                                  AUTHORS

                                                     K. Dharani is a student of
                                                     DRK co llege of Engineering
                                                     and     Technology,    Ranga
                                                     Reddy, Andhra Pradesh, India.
                                                     He has received B.Tech
                                                     degree in Co mputer Science
                                                     and Engineering and M.Tech
                                                     Degree in Co mputer Science.
                                                     Her main research interest
                                                     includes             Software
                                                     Engineering.
                                                     Kalpana     Gudikandula     is
                                                     working      as      Associate
                                                     Professor at DRK Institute of
                                                     Science & Technology, Ranga
                                                     Reddy, and Andhra Pradesh,
                                                     India. She has received
                                                     M.Tech Degree in Co mputer
                                                     Science. His Main Interest
                                                     includes Cloud Co mputing,
                                                     Software Eng ineering.




www.theijes.com                                     The IJES                                          Page 100

Weitere ähnliche Inhalte

Was ist angesagt?

New approaches of Data Mining for the Internet of things with systems: Litera...
New approaches of Data Mining for the Internet of things with systems: Litera...New approaches of Data Mining for the Internet of things with systems: Litera...
New approaches of Data Mining for the Internet of things with systems: Litera...IRJET Journal
 
IRJET- Improving the Performance of Smart Heterogeneous Big Data
IRJET- Improving the Performance of Smart Heterogeneous Big DataIRJET- Improving the Performance of Smart Heterogeneous Big Data
IRJET- Improving the Performance of Smart Heterogeneous Big DataIRJET Journal
 
Recommendation system using bloom filter in mapreduce
Recommendation system using bloom filter in mapreduceRecommendation system using bloom filter in mapreduce
Recommendation system using bloom filter in mapreduceIJDKP
 
A statistical data fusion technique in virtual data integration environment
A statistical data fusion technique in virtual data integration environmentA statistical data fusion technique in virtual data integration environment
A statistical data fusion technique in virtual data integration environmentIJDKP
 
Effective data mining for proper
Effective data mining for properEffective data mining for proper
Effective data mining for properIJDKP
 
Introduction to feature subset selection method
Introduction to feature subset selection methodIntroduction to feature subset selection method
Introduction to feature subset selection methodIJSRD
 
A new hybrid algorithm for business intelligence recommender system
A new hybrid algorithm for business intelligence recommender systemA new hybrid algorithm for business intelligence recommender system
A new hybrid algorithm for business intelligence recommender systemIJNSA Journal
 
Mayer_R_212017705
Mayer_R_212017705Mayer_R_212017705
Mayer_R_212017705Ryno Mayer
 
6 ijaems sept-2015-6-a review of data security primitives in data mining
6 ijaems sept-2015-6-a review of data security primitives in data mining6 ijaems sept-2015-6-a review of data security primitives in data mining
6 ijaems sept-2015-6-a review of data security primitives in data miningINFOGAIN PUBLICATION
 
Multiple Minimum Support Implementations with Dynamic Matrix Apriori Algorith...
Multiple Minimum Support Implementations with Dynamic Matrix Apriori Algorith...Multiple Minimum Support Implementations with Dynamic Matrix Apriori Algorith...
Multiple Minimum Support Implementations with Dynamic Matrix Apriori Algorith...ijsrd.com
 
Study of Data Mining Methods and its Applications
Study of  Data Mining Methods and its ApplicationsStudy of  Data Mining Methods and its Applications
Study of Data Mining Methods and its ApplicationsIRJET Journal
 
Data Mining based on Hashing Technique
Data Mining based on Hashing TechniqueData Mining based on Hashing Technique
Data Mining based on Hashing Techniqueijtsrd
 
Variance rover system
Variance rover systemVariance rover system
Variance rover systemeSAT Journals
 
Variance rover system web analytics tool using data
Variance rover system web analytics tool using dataVariance rover system web analytics tool using data
Variance rover system web analytics tool using dataeSAT Publishing House
 
Ec3212561262
Ec3212561262Ec3212561262
Ec3212561262IJMER
 
Data Mining of Project Management Data: An Analysis of Applied Research Studies.
Data Mining of Project Management Data: An Analysis of Applied Research Studies.Data Mining of Project Management Data: An Analysis of Applied Research Studies.
Data Mining of Project Management Data: An Analysis of Applied Research Studies.Gurdal Ertek
 

Was ist angesagt? (18)

New approaches of Data Mining for the Internet of things with systems: Litera...
New approaches of Data Mining for the Internet of things with systems: Litera...New approaches of Data Mining for the Internet of things with systems: Litera...
New approaches of Data Mining for the Internet of things with systems: Litera...
 
IRJET- Improving the Performance of Smart Heterogeneous Big Data
IRJET- Improving the Performance of Smart Heterogeneous Big DataIRJET- Improving the Performance of Smart Heterogeneous Big Data
IRJET- Improving the Performance of Smart Heterogeneous Big Data
 
Recommendation system using bloom filter in mapreduce
Recommendation system using bloom filter in mapreduceRecommendation system using bloom filter in mapreduce
Recommendation system using bloom filter in mapreduce
 
A statistical data fusion technique in virtual data integration environment
A statistical data fusion technique in virtual data integration environmentA statistical data fusion technique in virtual data integration environment
A statistical data fusion technique in virtual data integration environment
 
Effective data mining for proper
Effective data mining for properEffective data mining for proper
Effective data mining for proper
 
Introduction to feature subset selection method
Introduction to feature subset selection methodIntroduction to feature subset selection method
Introduction to feature subset selection method
 
A new hybrid algorithm for business intelligence recommender system
A new hybrid algorithm for business intelligence recommender systemA new hybrid algorithm for business intelligence recommender system
A new hybrid algorithm for business intelligence recommender system
 
Mayer_R_212017705
Mayer_R_212017705Mayer_R_212017705
Mayer_R_212017705
 
Data mining
Data miningData mining
Data mining
 
6 ijaems sept-2015-6-a review of data security primitives in data mining
6 ijaems sept-2015-6-a review of data security primitives in data mining6 ijaems sept-2015-6-a review of data security primitives in data mining
6 ijaems sept-2015-6-a review of data security primitives in data mining
 
Multiple Minimum Support Implementations with Dynamic Matrix Apriori Algorith...
Multiple Minimum Support Implementations with Dynamic Matrix Apriori Algorith...Multiple Minimum Support Implementations with Dynamic Matrix Apriori Algorith...
Multiple Minimum Support Implementations with Dynamic Matrix Apriori Algorith...
 
Study of Data Mining Methods and its Applications
Study of  Data Mining Methods and its ApplicationsStudy of  Data Mining Methods and its Applications
Study of Data Mining Methods and its Applications
 
Data Mining based on Hashing Technique
Data Mining based on Hashing TechniqueData Mining based on Hashing Technique
Data Mining based on Hashing Technique
 
Variance rover system
Variance rover systemVariance rover system
Variance rover system
 
Variance rover system web analytics tool using data
Variance rover system web analytics tool using dataVariance rover system web analytics tool using data
Variance rover system web analytics tool using data
 
Ec3212561262
Ec3212561262Ec3212561262
Ec3212561262
 
4113ijaia09
4113ijaia094113ijaia09
4113ijaia09
 
Data Mining of Project Management Data: An Analysis of Applied Research Studies.
Data Mining of Project Management Data: An Analysis of Applied Research Studies.Data Mining of Project Management Data: An Analysis of Applied Research Studies.
Data Mining of Project Management Data: An Analysis of Applied Research Studies.
 

Andere mochten auch

The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)theijes
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)theijes
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)theijes
 
The International Journal of Engineering and Science
The International Journal of Engineering and ScienceThe International Journal of Engineering and Science
The International Journal of Engineering and Sciencetheijes
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)theijes
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)theijes
 
The International Journal of Engineering and Science
The International Journal of Engineering and ScienceThe International Journal of Engineering and Science
The International Journal of Engineering and Sciencetheijes
 
The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)theijes
 
Y01201770183
Y01201770183Y01201770183
Y01201770183theijes
 
The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)theijes
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)theijes
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)theijes
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)theijes
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)theijes
 
The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)theijes
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)theijes
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)theijes
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)theijes
 
The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)theijes
 

Andere mochten auch (19)

The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
 
The International Journal of Engineering and Science
The International Journal of Engineering and ScienceThe International Journal of Engineering and Science
The International Journal of Engineering and Science
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
 
The International Journal of Engineering and Science
The International Journal of Engineering and ScienceThe International Journal of Engineering and Science
The International Journal of Engineering and Science
 
The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)
 
Y01201770183
Y01201770183Y01201770183
Y01201770183
 
The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
 
The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
 
The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)The International Journal of Engineering and Science (The IJES)
The International Journal of Engineering and Science (The IJES)
 
The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)The International Journal of Engineering and Science (IJES)
The International Journal of Engineering and Science (IJES)
 

Ähnlich wie The International Journal of Engineering and Science

Mining Social Media Data for Understanding Drugs Usage
Mining Social Media Data for Understanding Drugs  UsageMining Social Media Data for Understanding Drugs  Usage
Mining Social Media Data for Understanding Drugs UsageIRJET Journal
 
McKinsey Global Institute Big data The next frontier for innova.docx
McKinsey Global Institute Big data The next frontier for innova.docxMcKinsey Global Institute Big data The next frontier for innova.docx
McKinsey Global Institute Big data The next frontier for innova.docxandreecapon
 
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...IRJET Journal
 
PERFORMING DATA MINING IN (SRMS) THROUGH VERTICAL APPROACH WITH ASSOCIATION R...
PERFORMING DATA MINING IN (SRMS) THROUGH VERTICAL APPROACH WITH ASSOCIATION R...PERFORMING DATA MINING IN (SRMS) THROUGH VERTICAL APPROACH WITH ASSOCIATION R...
PERFORMING DATA MINING IN (SRMS) THROUGH VERTICAL APPROACH WITH ASSOCIATION R...Editor IJMTER
 
Predictive Modelling Analytics through Data Mining
Predictive Modelling Analytics through Data MiningPredictive Modelling Analytics through Data Mining
Predictive Modelling Analytics through Data MiningIRJET Journal
 
Extract Business Process Performance using Data Mining
Extract Business Process Performance using Data MiningExtract Business Process Performance using Data Mining
Extract Business Process Performance using Data MiningIJERA Editor
 
A Survey on Data Mining
A Survey on Data MiningA Survey on Data Mining
A Survey on Data MiningIOSR Journals
 
TTG Int.LTD Data Mining Technique
TTG Int.LTD Data Mining TechniqueTTG Int.LTD Data Mining Technique
TTG Int.LTD Data Mining TechniqueMehmet Beyaz
 
The Survey of Data Mining Applications And Feature Scope
The Survey of Data Mining Applications  And Feature Scope The Survey of Data Mining Applications  And Feature Scope
The Survey of Data Mining Applications And Feature Scope IJCSEIT Journal
 
Data Mining – A Perspective Approach
Data Mining – A Perspective ApproachData Mining – A Perspective Approach
Data Mining – A Perspective ApproachIRJET Journal
 
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVEDATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVEIJDKP
 
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...theijes
 
Fundamentals of data mining and its applications
Fundamentals of data mining and its applicationsFundamentals of data mining and its applications
Fundamentals of data mining and its applicationsSubrat Swain
 
Data modeling techniques used for big data in enterprise networks
Data modeling techniques used for big data in enterprise networksData modeling techniques used for big data in enterprise networks
Data modeling techniques used for big data in enterprise networksDr. Richard Otieno
 
Mining High Utility Patterns in Large Databases using Mapreduce Framework
Mining High Utility Patterns in Large Databases using Mapreduce FrameworkMining High Utility Patterns in Large Databases using Mapreduce Framework
Mining High Utility Patterns in Large Databases using Mapreduce FrameworkIRJET Journal
 
Combined mining approach to generate patterns for complex data
Combined mining approach to generate patterns for complex dataCombined mining approach to generate patterns for complex data
Combined mining approach to generate patterns for complex datacsandit
 
COMBINED MINING APPROACH TO GENERATE PATTERNS FOR COMPLEX DATA
COMBINED MINING APPROACH TO GENERATE PATTERNS FOR COMPLEX DATACOMBINED MINING APPROACH TO GENERATE PATTERNS FOR COMPLEX DATA
COMBINED MINING APPROACH TO GENERATE PATTERNS FOR COMPLEX DATAcscpconf
 

Ähnlich wie The International Journal of Engineering and Science (20)

F035431037
F035431037F035431037
F035431037
 
Mining Social Media Data for Understanding Drugs Usage
Mining Social Media Data for Understanding Drugs  UsageMining Social Media Data for Understanding Drugs  Usage
Mining Social Media Data for Understanding Drugs Usage
 
Data Mining Applications And Feature Scope Survey
Data Mining Applications And Feature Scope SurveyData Mining Applications And Feature Scope Survey
Data Mining Applications And Feature Scope Survey
 
McKinsey Global Institute Big data The next frontier for innova.docx
McKinsey Global Institute Big data The next frontier for innova.docxMcKinsey Global Institute Big data The next frontier for innova.docx
McKinsey Global Institute Big data The next frontier for innova.docx
 
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...IRJET-  	  Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
IRJET- Improved Model for Big Data Analytics using Dynamic Multi-Swarm Op...
 
PERFORMING DATA MINING IN (SRMS) THROUGH VERTICAL APPROACH WITH ASSOCIATION R...
PERFORMING DATA MINING IN (SRMS) THROUGH VERTICAL APPROACH WITH ASSOCIATION R...PERFORMING DATA MINING IN (SRMS) THROUGH VERTICAL APPROACH WITH ASSOCIATION R...
PERFORMING DATA MINING IN (SRMS) THROUGH VERTICAL APPROACH WITH ASSOCIATION R...
 
Predictive Modelling Analytics through Data Mining
Predictive Modelling Analytics through Data MiningPredictive Modelling Analytics through Data Mining
Predictive Modelling Analytics through Data Mining
 
Extract Business Process Performance using Data Mining
Extract Business Process Performance using Data MiningExtract Business Process Performance using Data Mining
Extract Business Process Performance using Data Mining
 
A Survey on Data Mining
A Survey on Data MiningA Survey on Data Mining
A Survey on Data Mining
 
TTG Int.LTD Data Mining Technique
TTG Int.LTD Data Mining TechniqueTTG Int.LTD Data Mining Technique
TTG Int.LTD Data Mining Technique
 
The Survey of Data Mining Applications And Feature Scope
The Survey of Data Mining Applications  And Feature Scope The Survey of Data Mining Applications  And Feature Scope
The Survey of Data Mining Applications And Feature Scope
 
Data Mining – A Perspective Approach
Data Mining – A Perspective ApproachData Mining – A Perspective Approach
Data Mining – A Perspective Approach
 
I1802055259
I1802055259I1802055259
I1802055259
 
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVEDATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
 
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
Applying K-Means Clustering Algorithm to Discover Knowledge from Insurance Da...
 
Fundamentals of data mining and its applications
Fundamentals of data mining and its applicationsFundamentals of data mining and its applications
Fundamentals of data mining and its applications
 
Data modeling techniques used for big data in enterprise networks
Data modeling techniques used for big data in enterprise networksData modeling techniques used for big data in enterprise networks
Data modeling techniques used for big data in enterprise networks
 
Mining High Utility Patterns in Large Databases using Mapreduce Framework
Mining High Utility Patterns in Large Databases using Mapreduce FrameworkMining High Utility Patterns in Large Databases using Mapreduce Framework
Mining High Utility Patterns in Large Databases using Mapreduce Framework
 
Combined mining approach to generate patterns for complex data
Combined mining approach to generate patterns for complex dataCombined mining approach to generate patterns for complex data
Combined mining approach to generate patterns for complex data
 
COMBINED MINING APPROACH TO GENERATE PATTERNS FOR COMPLEX DATA
COMBINED MINING APPROACH TO GENERATE PATTERNS FOR COMPLEX DATACOMBINED MINING APPROACH TO GENERATE PATTERNS FOR COMPLEX DATA
COMBINED MINING APPROACH TO GENERATE PATTERNS FOR COMPLEX DATA
 

The International Journal of Engineering and Science

  • 1. The International Journal of Engineering And Science (IJES) ||Volume|| 1 ||Issue|| 1 ||Pages|| 95-100 ||2012|| ISSN: 2319 – 1813 ISBN: 2319 – 1805 Actionable Knowledge Discovery using Multi-Step Mining 1 DharaniK,2Kalpana Gudikandula 1 Department of CS, JNTU H, DRK College of Engineering and Technology Hyderabad, Andhra Pradesh, India 2 Department of IT, JNTU H, DRK Institute of Science and Technology Hyderabad, Andhra Pradesh, India --------------------------------------------------------Abstract------------------------------------------------------------------- Data mining is a process of obtaining trends or patterns in historical data. Such trends form business intelligence that in turn leads to taking well informed decisions. However, data min ing with a single technique does not yield actionable knowledge. Th is is because enterprises have huge databases and heterogeneous in nature. They also have complex data and min ing such data needs multi-step min ing instead of single step mining. When mu ltiple approaches are involved, they provide business intelligence in all aspects. That kind of information can lead to actionable knowledge. Recently data min ing has got tremendous usage in the real world. Th e drawback of existing approaches is that insufficient business intelligence in case of huge enterprises. This paper presents the combination of existing works and algorithms. We work on multiple data sources, multip le methods and mu ltip le features. The combined patterns thus obtained from co mplex business data provide actionable knowledge. A prototype application has been built to test the efficiency of the proposed framework wh ich combines mult iple data sources, mu ltiple methods and multip le features in min ing process. The empirical results revealed that the proposed approach is effective and can be used in the real world. Index Terms : Data mining, act ionable knowledge discovery, multi-method mining, mu lti-feature min ing, mu lti-source mining -------------------------------------------------------------------------------------------------------------------------------- ----- Date of Submission: 19, November, 2012 Date of Publication: 30, November 2012 -------------------------------------------------------------------------------------------------------------------------------------- 1. Introduction Data mining at enterprise level operates on huge amount of data such as government transactions, banks, insurance companies and so on. Inevitably, these businesses produce complex data that might be distributed in nature. When mining is made on such data with a single-step, it produces business intelligence as a particular aspect. However, this is not sufficient in enterprise where different aspects and standpoints are to be considered before taking business decisions. It is required that the enterprises p erform min ing based on multiple features, data sources and methods. This is known as combined mining. The co mb ined min ing can produce patterns that reflect all aspects of the enterprise. Thus the derived intelligence can be used to take business decisions that lead to profits. This kind of knowledge is known as actionable knowledge. The act ionable knowledge is discovered through multip le data sources, multip le methods and multip le features. The intelligence thus obtained is dependable and reliable. As businesses are growing in complex data, there is a need for mining on complex data. However, it is challenging to discover actionable knowledge using complex data sources. This is because the generated business intelligence must be comprehensive and informat ion that provides enough knowledge to take enterprise business decisions. There are many traditional methods being used in data min ing. Therefore it is very challenging to comb ine them and use them to generate actionable knowledge. The challenges in the mult i min ing process can be categorized into multip le data sources, mu ltiple methods, mu ltiple features, post analysis and mining, joining mu ltip le relational database and data sampling as well. Data sampling is generally not acceptable in real world data mining applications. Due to space and time limits combining mu ltip le tables or joining them may not be possible. As data mining methods are developed for various data sources keeping some assumptions in mind, it is challenging to combine them for discovering actionable knowledge in real t ime applications. Co mbined association rules, comb ined rule clusters and combined rule pairs concepts areproposed in [1], [22] and [25]. These papers exposed the mining on comp lex data with respect to mult iple data sets and obtaining comprehensive knowledge. Co mbined association rule is nothing but a set of www.theijes.com The IJES Page 95
  • 2. Actionable Knowledge Discovery Using Multi-Step… mu ltip le item sets that are heterogeneous in nature. Co mb ined rules pairs are nothing but derived fro m co mbined association rules by combining ru le clusters. The comb ined rule pairs are also derived fro m co mb ined rules only. Traditional algorith ms such as FPGro wth [15] can’t be used to derive combined association rules. This paper aims at presenting a new comprehensive framework for co mbined min ing. As a matter of fact, this p aper makes use of existing methods or techniques as part of the framework. Therefore it integrates mu lti source combined min ing, mu lti-method combined mining, and multi-feature co mbined mining. Multip le features might include demographics of customer, behavior, business impacts and also transactional data. Multi method might include clustering, classificat ion and so on. Multiple data sources do mean that the mining process takes data fro m mu ltiple related data sources. The deliverables of the proposed framewo rk co mb ined patterns or combinedassociation rules. The fo llo wing are the general aspects of comb ined min ing.  Co mbined patterns generated by using various features can reflect the characteristics of enterprise closely and such business intelligence is always reliable.  The combined min ing with respect to multip le data sources can give patterns that cover many aspects of the enterprise as data is taken from different data sources.  Multiple methods mining results in patterns that reflect the in depth nature of data and also the advantages provided by various mining algorith ms are with this kind of co mb ined min ing.  Metrics are of mult iple interestingness in nature can be applied to generated patterns that can verify the significance of generated patterns. Instead of providing a specific algorith m and get results pertaining to that algorith m, it is good idea to combine them and get more accurate and actionable knowledge. The contributions of this paper include the concept of combined mining that is based on the existing works. It discusses the framework and various combined min ing such as multi-source combined mining, mult i- method comb ined mining, and mu lti-feature co mbined mining. Provides various techniques for pattern interaction and novel patterns such as clustered patterns, combined patterns etc. Interestingness metrics are evaluated. Discovering practically the combined patterns or actionable knowledge from real world businesses such as banks. 2. Related Work Mining is a process of extracting trends or patterns from historical data. These trends or patterns can provide business intelligence that leads to actionable knowledge. There are many data mining methods or algorith ms that exist for mining data to get patterns. However, all the existing algorithms are single-step mining algorith ms. This means that they provide business intelligence inadequately. They may not be able to reflect the complex needs of an enterprise to take decisions correctly. When multiple data mining techniques are combined it is possible to get actionable knowledge that can cater to the needs of an enterprise. In this paper comb ining mining algorith ms [1] have been implemented using a prototype application that demonstrates the efficiency of combined min ing. The combined actionable knowledge can’t be provided by existing algorithms such as FPGrowth [15]. The existing works on data mining operations on complex data or enterprise generally of different types such as direct mining approaches; post min ing of patterns; data sets with extra feat ures; mu ltiple methods integration; and also joining mult iple relational tables. Harmony [19] proposed an approach to mine for discriminative patterns. Other such experiments include contrast patterns [5], model based search tree [6]. These algorith ms attempted to use multip le features or mining techniques. Comb ined mining is best used to provide actionable knowledge in spite of comp lex data sets, and features.There are four categories of combined mining approaches in literature. A common ly used approach [28] is the post mining or post analysis of obtained patterns. It is best used to prune the rules obtained after mining database or reducing redundancy or even summarizing the patterns obtained [12]. In [1] co mbined min ing proposed contain direct mining methods. In [1 mu ltisource combined mining, mu lti-method combined mining and multi-feature comb ined mining were introduced. These algorithms are used in this paper and implemented in the prototype application that is used to demonstrate the efficiency of the combined mining. In these approaches also post mining of patterns is used when it is necessary. Multisource combined mining considers data of various natures. It does mean that it combines mu ltiple data sets as it is required by an enterprise which has data of different kinds. The result of mu ltisource data mining is the actionable knowledge that reflects different angles of an enterprise. Multi-feature data mining is used to have data with multip le features. This kind of min ing also leads to actionable knowledge that reflects the comp lex needs of an enterprise. Here mu ltip le features are used. For instance the prototype application in this paper has been implemented using banking data containing customers’ demographic data and transactional data. Multi-method combined mining mixes up many existing data mining techniques that are very useful independently for a particular purpose only. However, in mu lti-method combined www.theijes.com The IJES Page 96
  • 3. Actionable Knowledge Discovery Using Multi-Step… mining various min ing techniques are combined to obtain actionable knowledge. For instance apriori and ID3 can be combined. The aprio ri algorith m provides association rules wh ile the ID3 is meant for p roviding decision tree. The co mbination of these two can provide actionable knowledge that helps in taking well informed decisions. Other data mining algorith ms that are existed and can be used in combination are clustering, rarity mining, regression, association rule mining, sequence classification [15], rule based mining [7] etc. W ith the help of combined min ing sequential and cluster patterns can be used further. It does mean that the result of one method can be used in another method and the process can be repeated if required. It is also possible to have a chain of mining operations until the desired actionable knowledge is derived. 3. PROPOSED MINING FRAMEWORK The proposed framework co mbines the existing algorith ms or methods to integrate multiple features, mu ltip le data sources and multiple min ing methods like classification. The co mbined mining can be done with mu ltip le features, mult iple data sources and multip le methods. The architecture of the proposed framework is shown in fig. 1 which is co mmon framework for all these combined mining approaches. Fig. 1 – A rchitecture of proposed min ing framework [1] The common arch itecture for mu lti-feature mining, mu lti-method mining and multi-source mining are described here. There are multip le min ing approaches that make use of given data source (s) and perform one or more min ing algorith ms or methods. Each mining approach produces a set of patterns. All the patterns obtained fro m mu ltip le mining approaches are merged together. This is done by a component known as pattern merger. Pattern merger is a program that combines all pattern sets obtained from various min ing techniques. The result of pattern merging process is the final deliverables. The deliverables are nothing but actionable knowledge that facilitates comprehensive decision making. Th is framework is useful to all enterprises where there is comp lexity in business data and business intelligence has to cover many aspects of that business. Especially this framewo rk is suitable for do mains like banking, insurance where monetary transactions are stored and maintained. The historical data has to be mined and right decisions are to be made in case of a ll monetary decisions such as issuing loans and other customer centric services. The algorith ms of [1] are used for the experiments made using a prototype application. The ensuing sections present algorith ms and the experimental results. 4. Algorithms This section provides description and algorithms for mu ltisource combined mining, mult i-feature combined mining and mult i-method combined mining. 4.1 Multisource Combined Mi ning The combination of mult iple data sources (D): The comb ined pattern set P consists o f mult iple ato mic patterns identified in several data sources. By mining mu ltiple data sources, combined patterns are generated which reflect mu ltip le aspects of nature across the business lines. www.theijes.com The IJES Page 97
  • 4. Actionable Knowledge Discovery Using Multi-Step… Fig. 2 – A lgorith m for mu ltisource co mbined mining 4.2 Multi-Feature Combined Mini ng The combination of multip le features (F): The combined pattern set involves multiple features, namely, e.g., features of customer demographics and behavior. By involv ing mu ltiple heterogeneous features, combined patterns are generated which reflect mult iple aspects of concerns and characteristics in businesses. Fig. 3 – A lgorith m for mu lti-feature co mb ined min ing 4.3 Multi-method Combi ned Mining Multi-method comb ined mining is another approach to discover more informat ive knowledge in complex data. The focus of mult i-method combined min ing is on combining mu ltiple data mining algorithms as needed in order to generate more informat ive knowledge. The co mbination of mu lt iple methods (R): The patterns in the combined set reflect the results mined by multip le data min ing methods like association mining and classification. By applying mu ltiple methods in pattern min ing, combined patterns are generated which disclose a deep and comprehensive essence of data by taking advantage of different methods. The algorithms used for multi-method combined mining are Apriori Algorith m and ID3 Algorithm. These two algorithms are well known and existing algorith ms in data mining do main. 5. Experimental Results The environment used to built the prototype application used to demonstrate the efficiency of various combined mining algorith ms described in the previous section is Java Standard Edition (JSE 6.0) , Net Beans IDE and Oracle 10G Exp ress Edition. The hardware used is a PC with 2GB RAM, and 2.9x GHz processor. The dataset used for the experiments is pertaining to banking do main. Fig. 4 – Input Screen for synthetic dataset www.theijes.com The IJES Page 98
  • 5. Actionable Knowledge Discovery Using Multi-Step… The dataset used in related to banking domain. The data min ing makes use of customers’ demographic and also transactional data in order to find patterns and provide actionable knowledge finally. Fig. 5 – Results of mult isource combined mining As can be seen in fig. 5, (a) shows patterns while (b) shows actionable knowledge that is whether loan can be given to given customer or not. Fig. 6 – Dynamics of Incremental Cluster Patterns As can be seen in fig. 6, it v isualizes the dynamics of clusters patterns in terms of confidence, support, contribution and impact. The PLN, DOC, DOC, REA, IES represent a series of activities. 6. Conclusion Enterprises generally are distributed in nature with mult iple databases, multip le servers and so on. The data in such businesses is very comple x. Often the applications exhib it multip le features and the applications also heterogeneous in nature. For instance the database might have details pertaining to business impact, business appearance, service usage, behavior, preferences and demographics. Considering complex business scenarios and requirements of business intelligence, it is time consuming to have multiple single mining methods or features or data sources. The single piece of informat ion fro m a single min ing approach may not reflect actual requirement of the enterprise. Therefore, we proposed a mining framework that combines multiple data sources, mult iple methods and multiple in order to obtain mult iple pattern sets and finally the deliverables that is actionable knowledge that helps in taking comprehensive and well informed decisions. Such combined mining frameworks yield best actionable knowledge that can provide profits to organizations that use it. Thus the proposed framewo rk can be used in domains like banking, insurance etc. to take effective decisions in monetary matters. Multiple data sources, features, and methods used in the proposed framework are not entirely new things. They are taken from existing works and customized to get the required framework for co mbined mining. The data mining framework is applied to many real time solutions and the experiments made with prototype application revealed that the framework is very effect ive and able to provide actionable knowledge. www.theijes.com The IJES Page 99
  • 6. Actionable Knowledge Discovery Using Multi-Step… References [1]. Longbing Cao, Senior Member, IEEE, Huaifeng Zhang, Member, IEEE, Yanchang Zhao, Member, IEEE, Dan Luo, and Chengqi Zhang, Senior Member, IEEE. Co mbined Mining: Discovering Informat ive Knowledge in Co mplex Data. IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS— PART B: CYBERNETICS, VOL. 41, NO. 3, JUNE 2011 [2]. H. Zhang, Y. Zhao, L. Cao, and C. Zhang, ―Comb ined association rule mining,‖ in Proc. PAKDD, 2008, pp. 1069–1074. [3]. Y. Zhao, H. Zhang, L. Cao, C. Zhang, and H. Bohlscheid, ―Comb ined pattern mining : Fro m learned rules to actionable knowledge,‖ in Proc. AI, 2008, pp. 393–403. [4]. J. Pei, J. Han, B. Mortazav i-Asl, J. Wang, H. Pinto, Q. Chen, U. Dayal, and M.-C. Hsu, ―Min ing sequential patterns by pattern-growth: The PrefixSpan approach,‖ IEEE Trans. Knowl. Data Eng., vol. 16, no. 11, pp. 1424–1440, Nov. 2004. [5]. J. Wang and G. Karypis, ―HARM ONY: Efficiently mining the best rules for classificat ion,‖ in Proc. SDM, 2005, pp. 205– 216. [6]. G. Dong and J. Li, ―Efficient min ing of emerg ing patterns: Discovering trends and differences,‖ in Proc. KDD, 1999, pp. 43–52. [7]. W. Fan, K. Zhang, J. Gao, X. Yan, J. Han, P. Yu, and O. Verscheure, ―Direct min ing of discriminative and essential graphical and itemset features via model-based search tree,‖ in Proc. KDD, 2008, pp. 230– 238. [8]. Y. Zhao, C. Zhang, and L. Cao, Eds., Post-Mining of Association Rules: Techniques for Effective Knowledge Extraction. Hershey, PA: Inf. Sci.Ref., 2009. [9]. B. Liu, W. Hsu, and Y. Ma, ―Pruning and summarizing the discovered associations,‖ in Proc. KDD, 1999, pp. 125– 134. [10]. K. K. R. Hewawasam, K. Premaratne, and M.-L. Shyu, ―Ru le mining and classification in a situation assessment application: A belief-theoretic approach for handling data imperfect ions,‖ IEEE Trans. Syst., Man, Cybern.B, Cybern., vol. 37, no. 6, pp. 1446–1459, Dec. 2007. AUTHORS K. Dharani is a student of DRK co llege of Engineering and Technology, Ranga Reddy, Andhra Pradesh, India. He has received B.Tech degree in Co mputer Science and Engineering and M.Tech Degree in Co mputer Science. Her main research interest includes Software Engineering. Kalpana Gudikandula is working as Associate Professor at DRK Institute of Science & Technology, Ranga Reddy, and Andhra Pradesh, India. She has received M.Tech Degree in Co mputer Science. His Main Interest includes Cloud Co mputing, Software Eng ineering. www.theijes.com The IJES Page 100