SlideShare ist ein Scribd-Unternehmen logo
1 von 13
Downloaden Sie, um offline zu lesen
Ertek, G., Kuruca, C., Aydin, C., Erel, B.F., Dogan, H., Duman, M., Ocal, M., and Ok, Z.D. (2004).
"Visual and analytical mining of sales transaction data for production planning and marketing.”
4th International Symposium on Intelligent Manufacturing Systems, Sakarya, Turkey.

Note: This is the final draft version of this paper. Please cite this paper (or this final draft) as
above. You can download this final draft from http://research.sabanciuniv.edu.




                Visual and analytical mining of transactions data

                for production planning for production planning

                                         and marketing

                               Gurdal Ertek, Can Kuruca, Cenk Aydin,

                         Besim Ferit Erel, Harun Dogan, Mustafa Duman,

                                    Mete Ocal, Zeynep Damla Ok




                                          Sabanci University

                                           Istanbul, Turkey
Proceedings of 4th International Symposium on
Intelligent Manufacturing Systems, September 6-8, 2004: 848-859
©Sakarya University, Department of Industrial Engineering



      Visual and analytical mining of sales transaction data for production planning and
                                          marketing

 Gurdal Ertek*, Can Kuruca, Cenk Aydin, Besim Ferit Erel, Harun Dogan, Mustafa Duman, Mete
                                  Ocal, Zeynep Damla Ok



                                                  (*) Corresponding author

                   Sabanci University, Faculty of Engineering and Natural Sciences,
                               Orhanli, Tuzla, 34956, Istanbul, Turkey

                                                 Tel: +90(216)483-9568
                                                 Fax: +90(216)483-9550
                                             Email: ertekg@sabanciuniv.edu




                                                                  848
Visual and analytical mining of transactions data for production planning and marketing

 Gurdal Ertek, Can Kuruca, Cenk Aydin, Besim Ferit Erel, Harun Dogan, Mustafa Duman, Mete
                                 Ocal, Zeynep Damla Ok



Abstract

Recent developments in information technology paved the way for the collection of large
amounts of data pertaining to various aspects of an enterprise. The greatest challenge faced in
processing these massive amounts of raw data gathered turns out to be the effective
management of data with the ultimate purpose of deriving necessary and meaningful information
out of it. The following paper presents an attempt to illustrate the combination of visual and
analytical data mining techniques for planning of marketing and production activities. The
primary phases of the proposed framework consist of filtering, clustering and comparison steps
implemented using interactive pie charts, K-Means algorithm and parallel coordinate plots
respectively. A prototype decision support system is developed and a sample analysis session is
conducted to demonstrate the applicability of the framework.

Submission areas: Decision support systems, data mining, information technologies




                                              849
Introduction

Widespread use of information technology has resulted in massive collections of data regarding
most aspects of an enterprise. The amount of data on the marketing side has exploded due to
widespread usage of barcode systems, accounting and Enterprise Resorce Planning (ERP)
software and also due to collection of Business-to-Consumer (B2C) and Business-to-Business
(B2B) electronic commerce data. The amount of data that comes from manufacturing processes
has also exploded, due to application of Computer Integrated Manufacturing (CIM) systems,
barcode and radio frequency technology, which provide bulky amounts of real time data.

Effective collection, management, reporting, interpretive analysis and mining of enterprise data
can help in establishing effective control of manufacturing activities, achieving effective
production planning and increased sales, and consequently increasing the firm’s profitability.
Data mining can also serve the purpose of increasing customer satisfaction by offering and
timely delivering them products that they are willing to purchase. Keeping existing customers is
typically much more profitable than trying to acquire new customers. This observation is indeed
the underlying concept of Customer Relationship Management (CRM) systems (Shaw et al.,
2001).

As suggested earlier, data from various areas of an enterprise is now widely available; however,
in this paper, only sales transactions data is considered. The reason for choosing this particular
data type is that sales transactions data is collected and archived in almost every firm and it is
actually the essential input to two very critical aspects of enterprise planning, namely marketing
and production planning. A framework for the analysis of this type of data is proposed and
implemented in the software developed, namely CuReMa. The main contribution of the
framework and the prototype software is the integration of visual and analytical data mining
techniques for marketing and production planning. Kreuseler and Schumann (2002) present a
similar approach combining visual and analytical data mining techniques without focusing on
particular enterprise data.

The paper starts by offering a brief review of the literature concerned with visual and analytical
data mining techniques. In the following sections, we present the framework we propose and
explain how it is implemented in CuReMa. Before concluding, the applicability of the proposed
framework is demonstrated with a sample analysis session with the software.

Literature Review

Analytical data mining techniques are widely used and implemented (Han and Kamber, 2001). In
recent years, visual mining of data has also gained importance. The traditional exploratory
graphical data analysis methods such as scatter plots, box plots (Chambers et al., 1983) have
been enriched with a wide array of new visual representations and methods (de Oliveira and
Levkowitz, 2003). The field of computer science involved in such representations and methods is
referred to as information visualization. The number of journal articles in information visualization
has shown significant increase from 1990’s to present (Chen, 2002), indicating that the field is a
promising branch of computer science.

Work related to this paper can be grouped in two categories, based on their scope:
  a) Data mining for marketing:



                                                850
Shaw et al. (2001) provide a recent survey of data mining applications for marketing.
       They present a taxonomy of data mining tasks and provide five broad categories:
       Dependency analysis, class identification, concept description, deviation detection, and
       data visualization. Our study suggests a framework that relates to all but the first of these
       categories.
       Keim et al. (2002) develop a visualization technique named “pixel bar charts” for analysis
       of very large multi-attribute data sets and demonstrate applicability of their approach by
       analyzing real-world e-commerce data sets.
    b) Data mining for manufacturing:
       Applications of information visualization in the domain of manufacturing include data
       representation for engineering design and data analysis for predicting product failure
       rates (Spence, 2001, p28—30 and p60—61, respectively).
       Dabbas and Chen (2001) present an integrated relational database approach for
       semiconductor wafer fabrication, which resulted in improved manufacturing performance.
       They describe how they combined multiple data sources and various reports under the
       integrated approach.

Data mining tools that can handle fairly large amounts of data and allow derivation of insights
are numerous. Spotfire, Miner3D and XLMiner1 are among successful commercial products that
can be used to analyze data from a variety of domains. Software libraries which allow building
customized visual interfaces to domain-specific applications are also available: For instance,
Eick (2000) presents ADVIZOR as “a flexible software environment for visual information
discovery” that allows creating visual query and analysis applications. Our implementation has
some similarities to these products: For example, one of the similarities between CuReMa,
Spotfire and Miner3D is that all of them allow the user to conduct a query on products or
customers within a given range through the use of sliders.

Proposed Framework

An approach that integrates visual data mining methods with analytical methods for mining sales
transaction data is proposed to perform the three critical functions listed below, and to answer
the given questions as well as many other unlisted ones:

1) Filtering

This initial step includes filtering of data in different dimensions and displaying and analyzing the
filtered data. In this stage of the analysis, the products or the customers within a cluster or a
group are represented with pie charts. This representation allows the identification of significant
items and outliers in the cluster or the group. The answers to the following questions can be
obtained as a result of filtering:
     • What are the total sales of top-selling n products for the top-purchasing c customers
         within time interval (t1, t2) and what is the share of each product?
     • What are the total sales of n slowest-moving products for a given set of customers
         (selected from a list sorted based on sales) within time interval (t1, t2) and what is the
         share of each product?
     • How much sales were generated for each product represented in the pie chart?


1
 The information regarding these products can be found on http://www.spotfire.com,
http://www.miner3d.com, and http://www.resample.com/xlminer/ respectively.



                                                 851
•   What are the total purchases of top-purchasing c customers from a given set of products
         (selected from a list sorted based on sales) within time interval (t1, t2), and what is the
         share of each customer?
     •   How many purchases were made by each of the customers represented in the pie chart?

2) Clustering

The second step involves clustering products and customers with respect to selected measures
and generating related reports. This phase provides the answer for the questions such as:
    • How can the products be grouped with respect to seasonal sales patterns?
    • How can the market be segmented with respect to seasonal purchasing patterns of
       customers?
Many other useful statistics about each cluster may be acquired after clustering. These statistics
include the revenue generated by each product, the product with the highest sales level, and the
product purchased by the greatest number of customers.

3) Comparison

Finally, clusters are compared with respect to selected measures. One of the questions
answered through comparison would be the following:
   • How do the products differ from each other with respect to seasonal sales patterns? Are
        there significant differences among given clusters?

The questions above focus on customer and product clusters, yet CuReMa can answer the
same questions based on customer and product groups, which could come with the dataset. The
sample analysis session will demonstrate how some of these questions are answered using
CuReMa.

An Implementation of the Framework: CuReMa

The data mining framework has been implemented as a prototype decision support system to
demonstrate the viability of the proposed approach. Real world data from a regional distributor of
automotive spare parts, covering approximately 18 months of sales transactions, has been
analyzed with the developed software. The software, named CuReMa after Customer
Relationship Management, allows mining of the dataset for marketing and production planning
purposes.

The implementation has been done in Java programming language (Doke et al., 2002) using the
Eclipse Integrated Development Environment (IDE)2 under Microsoft Windows operating system.
CuReMa was built based on a 3-tier design, separating data access, interface and business
classes (Doke et at. 2002). MySQL is employed as the database server and MySQL Control
Center3 is used in constructing and maintaining the database. Java and MySQL were selected
primarily due to their platform independence, which allows porting the developed system to
various operating systems, such as Linux and MacOS. Java language also has the advantages
of being purely object oriented and having extensive libraries that allow rapid prototyping. Being
interpreted at runtime -as opposed to being executed as native code- makes Java programs run


2
    The information on Eclipse IDE can be retrieved from http://www.eclipse.org
3
    MySQL Control Center can be accessed through http://www.mysql.com/products/mysqlcc/



                                                 852
slower than programs written in some other languages. However, implementation of distributed
shared memory and remote method invocation (not implemented in CuReMa) can enable
programmers to build scalable Java programs (Kielmann et al., 2001).

The relational database in MySQL contains a number of tables. The table “transactions”, (shown
in Figure 1) contains the sales transactions data and forms the source of all the other tables.




       Figure 1. Snapshot of the database, illustrating the fields of the “transactions” table

We now present how each function proposed in the framework is implemented:

Filtering: Interactive Visual Querying

In CuReMa, the user can perform visual query of the dataset using pull down menus and sliders.
The sliders enable filtering by time, by customer and by product name. The customers and the
products are sorted in decreasing order of total sales from the left to the right. Positions of the
sliders are translated into SQL (Structured Query Language) (Elmasri and Navathe, 1994, Ch7)
statements and reflected on the pie charts in real time. One such statement (that corresponds to
the filtering illustrated in Figure 2) is the following:

SELECT ItemName AS NAME,
SUM(TOTAL) AS TOT FROM deneme3.reference
WHERE DATE <='2003.6.6' AND DATE >= '2003.3.5'
AND CustomerNo>= 1 AND 173 >= CustomerNo
AND 1 <= ItemNo AND 259 >= ItemNo
GROUP BY NAME ORDER BY TOT DESC

Pie charts display the share of each customer/product within the selected ranges.

Clustering: Analytical Data Mining

The analytical data mining method implemented in CuReMa is K-Means clustering (Han and
Kamber, 2001, Ch8). This method starts with a set of entities, accompanied with given attribute



                                                853
values, and proceeds by clustering the entities into a specified number of clusters. The attribute
values are the average sales in each month of the year.

K- Means algorithm initially selects random values as the K-cluster means. Then, it examines
each object pertaining to the dataset and assigns the object to the closest cluster possible,
whose proximity is measured by the aggregate distance between the cluster mean and the
object's actual values. An epoch consists of N iterations where N is the number of elements in
the dataset. As the new objects are added to the clusters for various epochs, the means are
dynamically updated. If the total number of moves of all data objects from a cluster to another is
zero at the end of an epoch, the algorithm ceases and the partitioning is completed.

The algorithm may end up with fewer clusters than specified: For instance, while exploration of
the automotive spare parts dataset in CuReMa, it has been observed that a customer clustering
run with a desired number of 7 clusters resulted in only 3 clusters.

Comparison: Visual Data Mining

Parallel coordinate plots (Inselberg and Dimsdale, 1990) are used for comparison of
product/customer clusters, and are implemented in Mondrian and XMDV4 software. Parallel
coordinate plot “maps a k-dimensional data or object space onto the 2D display by drawing k
equally spaced axes in parallel” (de Oliveira and Levkowitz, 2003). In the plot, each axis
corresponds to an attribute, and each line corresponds to an element of the dataset. In CuReMa,
customer purchases and product sales can be compared in a parallel coordinate plot based on
monthly sales.

Sample Analysis Session

In this section we present how the user can interact with CuReMa in an analysis session.




4
 Mondrian and XMDV can be accessed from http://www.theusrus.de/Mondrian/index.html and
http://davis.wpi.edu/~xmdv/ respectively.



                                                854
Figure 2. Filtering of products




Figure 3. Clustering of products



              855
The analysis begins with Filtering, where the user selects slicing based on products from the
pull-down menus in the Graph Frame (Figure 2). The user selects the data for the last 3 months
for all customers (with sales in the last 3 months) and top-selling 100 products from the sliders.
The top-selling product is observed to be “MOT.YAG” (motor oil) by selecting the largest slice in
the pie chart.

Next, the user performs clustering of the products from the previous analysis by clicking the
“Cluster” button on the Graph Frame (Figure 3). Since products are displayed, the check box
next to Product is selected. The user also selects clustering criteria, number of clusters, and the
time range.

Clustering is completed in approximately 2 minutes on a PC with Intel Pentium IIIE 933 MHz
processor and 384 MB of RAM. Then the user selects the item named
“Top100ProductsInLast3Months” in the Query Frame, “Parallel Coordinate Plot” from Graphics
menu and clicks “Draw Charts” button to obtain the plot in Figure 4. In this plot, products in
clusters 2 and 3 are observed to exhibit almost constant sales throughout the selected months
whereas products in cluster 1 have the largest amount of sales, but also a greater variance as
compared to others. A report can be generated inside Reports Frame by clicking “Generate
Report” button in the left bottom corner.




                            Figure 4. Comparison of product clusters

The user can also analyze customers, clustering them into similar-behaving groups. Figure 5
offers the result of one such clustering. In this stage of the analysis, all the customers are
selected at all the times, and clustering is performed. It can be observed that cluster 1 shows




                                                856
stable sales, with minimal variability. Trying to increase the sales volume to these customers can
be a profitable strategy, since these customers do not cause big fluctuations in production
schedules. Clusters 2 and 3 are observed to have very high levels of sales in March, while
clusters 0 and 4 exhibit high sales volumes in August and September respectively. These
fluctuations should be further investigated by examining the reports generated and by filtering
the sales transaction data for these months.




                           Figure 5. Comparison of customer clusters

Conclusions and Future Work

In this paper a framework is presented for the analysis of sales transaction data using visual and
analytical data mining techniques. The framework suggests applying filtering, clustering and
comparison through interactive pie charts, K-Means method and parallel coordinate plots,
respectively. The framework is implemented in a software to demonstrate how the analysis can
be carried out. A sample session is exhibited.

The framework proposed in this paper can be extended to answer a greater range of questions.
Visual metaphors reported in information visualization survey papers (e.g. de Oliveira and
Levkowitz, 2003) can be adapted to the current framework, allowing observation and query of
other aspects regarding the data. From an analytical point of view, other data mining techniques
such as deriving association rules for revealing patterns (Changchien and Lu, 2001) and
performing cross-tabulation analysis (Verhoef et al., 2002) can be made part of the framework
and implemented in CuReMa.




                                               857
CuReMa can be tested and used with a variety of transactional datasets coming from different
industries (e.g. from an e-commerce website) and the scope of the program can be broadened
to include other data fields, such as ages and income levels of customers, and marginal profits
of the products. On the methodological side, clustering algorithms other than K-Means can be
built into the software and their performance can be compared to the performance of K-Means.
One such method is SOM (Self Organized Maps), which is implemented in coherence within a
visual data mining framework by Kreuseler and Schumann (2002). The clustering function in the
software can be extended by taking into consideration other attributes besides monthly sales
averages. One such attribute can be recency of purchases by customers and by products:
Verhoef et al. (2002) report that recency and frequency of purchases are among the most
popular variables used for clustering while carrying out market segmentation analysis.

Data analyzed can have domain-specific considerations. For example, in many countries around
the world, inflation is a real-world fact that cannot be neglected if the analyst attempts to perform
a valid and insightful analysis. Adding tables into the database that keep track of the inflation
indices on a monthly basis and adjusting calculations based on these indices would be pretty
valuable for the dataset analyzed in such a study. Another domain-specific consideration would
be the effects of holidays. One could take into account religious holidays in Turkey which turn
out to be around ten days earlier from the previous year. This and similar considerations pose
significant challenges.

One possible area of future work is to investigate the implemented software from the point of
view of human-computer interaction (Shneiderman, 1998). One can investigate the performance
of various visual components and their arrangements by formal usability tests on groups of users
and analyze results.

Acknowledgement

The authors would like to thank Ender Yalcin for providing the sales transaction data and for
explaining the processes involved in decision making. The authors would also like to thank Selim
Balcisoy, Pelin Gulsah Canbolat and the anonymous referees for their valuable suggestions and
remarks.

References

Chambers, J. M., Cleveland, W. S., Kleiner, B. and Tukey, P. A. (1983) Graphical Methods for
Data Analysis, Wadsworth & Brooks/Cole, Pacific Grove, California.

Changchien, S. W. and Lu, T. (2001) Mining association rules procedure to support on-line
recommendation by customers and products fragmentation. Expert Systems with Applications,
20, 325--335.

Chen, C. (2002) Information visualization. Information Visualization, 1, 1--4.

Dabbas, R. M. and Chen, H. (2001) Mining semiconductor manufacturing data for productivity
improvement – an integrated relational database approach. Computers in Industry, 45, 29--44.

De Oliveira, M. C. F. and Levkowitz, H. (2003) From visual data exploration to visual data
mining: A survey. IEEE Transactions on Visualization and Computer Graphics, 9, 378--394.




                                                 858
Doke, E. R., Satzinger, J. W. and Williams, S. R. (2002) Object-oriented application
development using Java, Course Technology, Boston, Massachusetts.

Eick, S. G. (2000) Visual discovery and analysis. IEEE Transactions on Visualization and
Computer Graphics, 6, 44--58.

Elmasri, R. and Navathe, S. (1994) Fundamentals of Database Systems, Addison-Wesley,
Menlo Park, California.

Han, J. and Kamber, M. (2001) Data Mining: Concepts and Techniques, Morgan Kaufmann, San
Francisco.

Inselberg, A. and Dimsdale, B. (1990) Parallel Coordinates: A tool for visualizing
multidimensional geometry. Proc. IEEE Visualization ’90, 361--375.

Keim, D. A., Hao, M. C., Dayal, U. and Hsu, M. (2002) Pixel bar charts: a visualization technique
for very large multi-attribute data sets. Information Visualization, 1, 20—34.

Kielmann, T. K., Hatcher, P., Bouge, L. and Bal, H. E. (2001) Enabling Java for high-
performance computing. Communications of the ACM, 44, 110--117.

Kreuseler, M. and Schumann, H. (2002) A flexible approach for visual data mining. IEEE
Transactions on Visualization and Computer Graphics, 8, 39--51.

Shaw, M. J., Subramaniam, C., Tan, G. W. and Welge, M. E. (2001) Knowledge management
and data mining for marketing. Decision Support Systems, 31, 127--137.

Shneiderman, B. (1998) Designing the User Interface: Strategies for Effective Human-Computer
Interaction, Addison Wesley Longman, Reading, Massachusetts.

Spence, R. (2001) Information Visualization, ACM Press, London.

Verhoef, P. C., Spring, P. N., Hoekstra, J. C. and Leeflang, P. S.H. (2002) The commercial use
of segmentation and predictive modeling techniques for database marketing in the Netherlands.
Decision Support Systems, 34, 471--481.




                                               859

Weitere ähnliche Inhalte

Was ist angesagt?

A study of supply chain management
A study of supply chain managementA study of supply chain management
A study of supply chain managementchiragkolambe
 
Cheap assignment help USA
Cheap assignment help USACheap assignment help USA
Cheap assignment help USANicole Valerio
 
Selection of the Best Proposal using FAHP: Case of Procurement of IT Master P...
Selection of the Best Proposal using FAHP: Case of Procurement of IT Master P...Selection of the Best Proposal using FAHP: Case of Procurement of IT Master P...
Selection of the Best Proposal using FAHP: Case of Procurement of IT Master P...IJECEIAES
 
A Study on Impact of Designation & Employment Role Consumption of Multi-Funct...
A Study on Impact of Designation & Employment Role Consumption of Multi-Funct...A Study on Impact of Designation & Employment Role Consumption of Multi-Funct...
A Study on Impact of Designation & Employment Role Consumption of Multi-Funct...inventionjournals
 
MODEL OF MULTIPLE ARTIFICIAL NEURAL NETWORKS ORIENTED ON SALES PREDICTION AND...
MODEL OF MULTIPLE ARTIFICIAL NEURAL NETWORKS ORIENTED ON SALES PREDICTION AND...MODEL OF MULTIPLE ARTIFICIAL NEURAL NETWORKS ORIENTED ON SALES PREDICTION AND...
MODEL OF MULTIPLE ARTIFICIAL NEURAL NETWORKS ORIENTED ON SALES PREDICTION AND...ijscai
 
Selection of Articles using Data Analytics for Behavioral Dissertation Resear...
Selection of Articles using Data Analytics for Behavioral Dissertation Resear...Selection of Articles using Data Analytics for Behavioral Dissertation Resear...
Selection of Articles using Data Analytics for Behavioral Dissertation Resear...PhD Assistance
 
3.roadmapping the
3.roadmapping the3.roadmapping the
3.roadmapping thelibfsb
 
A NEW MULTI CRITERIA DECISION MAKING METHOD : APPROACH OF LOGARITHMIC CONCEPT...
A NEW MULTI CRITERIA DECISION MAKING METHOD : APPROACH OF LOGARITHMIC CONCEPT...A NEW MULTI CRITERIA DECISION MAKING METHOD : APPROACH OF LOGARITHMIC CONCEPT...
A NEW MULTI CRITERIA DECISION MAKING METHOD : APPROACH OF LOGARITHMIC CONCEPT...ijaia
 
MACHINE LEARNING ALGORITHMS FOR HETEROGENEOUS DATA: A COMPARATIVE STUDY
MACHINE LEARNING ALGORITHMS FOR HETEROGENEOUS DATA: A COMPARATIVE STUDYMACHINE LEARNING ALGORITHMS FOR HETEROGENEOUS DATA: A COMPARATIVE STUDY
MACHINE LEARNING ALGORITHMS FOR HETEROGENEOUS DATA: A COMPARATIVE STUDYIAEME Publication
 
Thesis - Mechanizing optimization of warehouses by implementation of machine ...
Thesis - Mechanizing optimization of warehouses by implementation of machine ...Thesis - Mechanizing optimization of warehouses by implementation of machine ...
Thesis - Mechanizing optimization of warehouses by implementation of machine ...Shrikant Samarth
 
MINING DISCIPLINARY RECORDS OF STUDENT WELFARE AND FORMATION OFFICE: AN EXPLO...
MINING DISCIPLINARY RECORDS OF STUDENT WELFARE AND FORMATION OFFICE: AN EXPLO...MINING DISCIPLINARY RECORDS OF STUDENT WELFARE AND FORMATION OFFICE: AN EXPLO...
MINING DISCIPLINARY RECORDS OF STUDENT WELFARE AND FORMATION OFFICE: AN EXPLO...IJITCA Journal
 
Image-Based Date Fruit Classification
Image-Based Date Fruit ClassificationImage-Based Date Fruit Classification
Image-Based Date Fruit Classificationtoukaigi
 
Impact and Implications of Operations Research in Stock Market
Impact and Implications of Operations Research in Stock MarketImpact and Implications of Operations Research in Stock Market
Impact and Implications of Operations Research in Stock Marketinventionjournals
 

Was ist angesagt? (15)

A study of supply chain management
A study of supply chain managementA study of supply chain management
A study of supply chain management
 
Cheap assignment help USA
Cheap assignment help USACheap assignment help USA
Cheap assignment help USA
 
Selection of the Best Proposal using FAHP: Case of Procurement of IT Master P...
Selection of the Best Proposal using FAHP: Case of Procurement of IT Master P...Selection of the Best Proposal using FAHP: Case of Procurement of IT Master P...
Selection of the Best Proposal using FAHP: Case of Procurement of IT Master P...
 
A Study on Impact of Designation & Employment Role Consumption of Multi-Funct...
A Study on Impact of Designation & Employment Role Consumption of Multi-Funct...A Study on Impact of Designation & Employment Role Consumption of Multi-Funct...
A Study on Impact of Designation & Employment Role Consumption of Multi-Funct...
 
MODEL OF MULTIPLE ARTIFICIAL NEURAL NETWORKS ORIENTED ON SALES PREDICTION AND...
MODEL OF MULTIPLE ARTIFICIAL NEURAL NETWORKS ORIENTED ON SALES PREDICTION AND...MODEL OF MULTIPLE ARTIFICIAL NEURAL NETWORKS ORIENTED ON SALES PREDICTION AND...
MODEL OF MULTIPLE ARTIFICIAL NEURAL NETWORKS ORIENTED ON SALES PREDICTION AND...
 
Selection of Articles using Data Analytics for Behavioral Dissertation Resear...
Selection of Articles using Data Analytics for Behavioral Dissertation Resear...Selection of Articles using Data Analytics for Behavioral Dissertation Resear...
Selection of Articles using Data Analytics for Behavioral Dissertation Resear...
 
3.roadmapping the
3.roadmapping the3.roadmapping the
3.roadmapping the
 
50120140503019
5012014050301950120140503019
50120140503019
 
A NEW MULTI CRITERIA DECISION MAKING METHOD : APPROACH OF LOGARITHMIC CONCEPT...
A NEW MULTI CRITERIA DECISION MAKING METHOD : APPROACH OF LOGARITHMIC CONCEPT...A NEW MULTI CRITERIA DECISION MAKING METHOD : APPROACH OF LOGARITHMIC CONCEPT...
A NEW MULTI CRITERIA DECISION MAKING METHOD : APPROACH OF LOGARITHMIC CONCEPT...
 
MACHINE LEARNING ALGORITHMS FOR HETEROGENEOUS DATA: A COMPARATIVE STUDY
MACHINE LEARNING ALGORITHMS FOR HETEROGENEOUS DATA: A COMPARATIVE STUDYMACHINE LEARNING ALGORITHMS FOR HETEROGENEOUS DATA: A COMPARATIVE STUDY
MACHINE LEARNING ALGORITHMS FOR HETEROGENEOUS DATA: A COMPARATIVE STUDY
 
Thesis - Mechanizing optimization of warehouses by implementation of machine ...
Thesis - Mechanizing optimization of warehouses by implementation of machine ...Thesis - Mechanizing optimization of warehouses by implementation of machine ...
Thesis - Mechanizing optimization of warehouses by implementation of machine ...
 
MINING DISCIPLINARY RECORDS OF STUDENT WELFARE AND FORMATION OFFICE: AN EXPLO...
MINING DISCIPLINARY RECORDS OF STUDENT WELFARE AND FORMATION OFFICE: AN EXPLO...MINING DISCIPLINARY RECORDS OF STUDENT WELFARE AND FORMATION OFFICE: AN EXPLO...
MINING DISCIPLINARY RECORDS OF STUDENT WELFARE AND FORMATION OFFICE: AN EXPLO...
 
COMPARATIVE ANALYSIS OF DIFFERENT MACHINE LEARNING ALGORITHMS FOR PLANT DISEA...
COMPARATIVE ANALYSIS OF DIFFERENT MACHINE LEARNING ALGORITHMS FOR PLANT DISEA...COMPARATIVE ANALYSIS OF DIFFERENT MACHINE LEARNING ALGORITHMS FOR PLANT DISEA...
COMPARATIVE ANALYSIS OF DIFFERENT MACHINE LEARNING ALGORITHMS FOR PLANT DISEA...
 
Image-Based Date Fruit Classification
Image-Based Date Fruit ClassificationImage-Based Date Fruit Classification
Image-Based Date Fruit Classification
 
Impact and Implications of Operations Research in Stock Market
Impact and Implications of Operations Research in Stock MarketImpact and Implications of Operations Research in Stock Market
Impact and Implications of Operations Research in Stock Market
 

Andere mochten auch

Visual Mining of Science Citation Data for Benchmarking Scientific and Techno...
Visual Mining of Science Citation Data for Benchmarking Scientific and Techno...Visual Mining of Science Citation Data for Benchmarking Scientific and Techno...
Visual Mining of Science Citation Data for Benchmarking Scientific and Techno...ertekg
 
Financial Benchmarking Of Transportation Companies In The New York Stock Exc...
Financial Benchmarking Of Transportation Companies  In The New York Stock Exc...Financial Benchmarking Of Transportation Companies  In The New York Stock Exc...
Financial Benchmarking Of Transportation Companies In The New York Stock Exc...ertekg
 
Simulation Modeling For Quality And Productivity In Steel Cord Manufacturing
Simulation Modeling For Quality And Productivity In Steel Cord ManufacturingSimulation Modeling For Quality And Productivity In Steel Cord Manufacturing
Simulation Modeling For Quality And Productivity In Steel Cord Manufacturingertekg
 
Depolama Sistemleri
Depolama SistemleriDepolama Sistemleri
Depolama Sistemleriertekg
 
Rule-based expert systems for supporting university students
Rule-based expert systems for supporting university studentsRule-based expert systems for supporting university students
Rule-based expert systems for supporting university studentsertekg
 
Application of local search methods for solving a quadratic assignment proble...
Application of local search methods for solving a quadratic assignment proble...Application of local search methods for solving a quadratic assignment proble...
Application of local search methods for solving a quadratic assignment proble...ertekg
 
Gorsel Veri Madenciligi’nde bir Vaka Calısması: OSS Verilerinin Analizi
Gorsel Veri Madenciligi’nde bir Vaka Calısması: OSS Verilerinin AnaliziGorsel Veri Madenciligi’nde bir Vaka Calısması: OSS Verilerinin Analizi
Gorsel Veri Madenciligi’nde bir Vaka Calısması: OSS Verilerinin Analiziertekg
 
Optimizing Waste Collection In An Organized Industrial Region: A Case Study
Optimizing Waste Collection  In An Organized Industrial Region: A Case StudyOptimizing Waste Collection  In An Organized Industrial Region: A Case Study
Optimizing Waste Collection In An Organized Industrial Region: A Case Studyertekg
 
Design Requirements For a Tendon Rehabilitation Robot: Results From a Survey ...
Design Requirements For a Tendon Rehabilitation Robot: Results From a Survey ...Design Requirements For a Tendon Rehabilitation Robot: Results From a Survey ...
Design Requirements For a Tendon Rehabilitation Robot: Results From a Survey ...ertekg
 
Teaching Warehousing Concepts through Interactive Animations and 3-D Models
Teaching Warehousing Concepts through Interactive Animations  and 3-D ModelsTeaching Warehousing Concepts through Interactive Animations  and 3-D Models
Teaching Warehousing Concepts through Interactive Animations and 3-D Modelsertekg
 
Encapsulating And Representing The Knowledge On The Evolution Of An Engineeri...
Encapsulating And Representing The Knowledge On The Evolution Of An Engineeri...Encapsulating And Representing The Knowledge On The Evolution Of An Engineeri...
Encapsulating And Representing The Knowledge On The Evolution Of An Engineeri...ertekg
 
The Bullwhip Effect In Supply Chain Reflections After A Decade
The Bullwhip Effect In Supply Chain  Reflections After A DecadeThe Bullwhip Effect In Supply Chain  Reflections After A Decade
The Bullwhip Effect In Supply Chain Reflections After A Decadeertekg
 
Application Of Local Search Methods For Solving A Quadratic Assignment Probl...
Application Of Local Search Methods For Solving  A Quadratic Assignment Probl...Application Of Local Search Methods For Solving  A Quadratic Assignment Probl...
Application Of Local Search Methods For Solving A Quadratic Assignment Probl...ertekg
 
Compiere kurulumu
Compiere kurulumuCompiere kurulumu
Compiere kurulumuertekg
 
Re-Mining Item Associations: Methodology and a Case Study in Apparel Retailing
Re-Mining Item Associations: Methodology and a Case Study in Apparel RetailingRe-Mining Item Associations: Methodology and a Case Study in Apparel Retailing
Re-Mining Item Associations: Methodology and a Case Study in Apparel Retailingertekg
 
Impact of Cross Aisles in a Rectangular Warehouse: A Computational Study
Impact of Cross Aisles in a Rectangular Warehouse:  A Computational StudyImpact of Cross Aisles in a Rectangular Warehouse:  A Computational Study
Impact of Cross Aisles in a Rectangular Warehouse: A Computational Studyertekg
 
Statistical Scoring Algorithm for Learning and Study Skills
Statistical Scoring Algorithm for Learning and Study SkillsStatistical Scoring Algorithm for Learning and Study Skills
Statistical Scoring Algorithm for Learning and Study Skillsertekg
 
A Framework for Automated Association Mining Over Multiple Databases
A Framework for Automated Association Mining Over Multiple DatabasesA Framework for Automated Association Mining Over Multiple Databases
A Framework for Automated Association Mining Over Multiple Databasesertekg
 
Supplier and Buyer Driven Channels in a Two-Stage Supply Chain
Supplier and Buyer Driven Channels in a  Two-Stage Supply ChainSupplier and Buyer Driven Channels in a  Two-Stage Supply Chain
Supplier and Buyer Driven Channels in a Two-Stage Supply Chainertekg
 
Re-mining Positive and Negative Association Mining Results
Re-mining Positive and Negative Association Mining ResultsRe-mining Positive and Negative Association Mining Results
Re-mining Positive and Negative Association Mining Resultsertekg
 

Andere mochten auch (20)

Visual Mining of Science Citation Data for Benchmarking Scientific and Techno...
Visual Mining of Science Citation Data for Benchmarking Scientific and Techno...Visual Mining of Science Citation Data for Benchmarking Scientific and Techno...
Visual Mining of Science Citation Data for Benchmarking Scientific and Techno...
 
Financial Benchmarking Of Transportation Companies In The New York Stock Exc...
Financial Benchmarking Of Transportation Companies  In The New York Stock Exc...Financial Benchmarking Of Transportation Companies  In The New York Stock Exc...
Financial Benchmarking Of Transportation Companies In The New York Stock Exc...
 
Simulation Modeling For Quality And Productivity In Steel Cord Manufacturing
Simulation Modeling For Quality And Productivity In Steel Cord ManufacturingSimulation Modeling For Quality And Productivity In Steel Cord Manufacturing
Simulation Modeling For Quality And Productivity In Steel Cord Manufacturing
 
Depolama Sistemleri
Depolama SistemleriDepolama Sistemleri
Depolama Sistemleri
 
Rule-based expert systems for supporting university students
Rule-based expert systems for supporting university studentsRule-based expert systems for supporting university students
Rule-based expert systems for supporting university students
 
Application of local search methods for solving a quadratic assignment proble...
Application of local search methods for solving a quadratic assignment proble...Application of local search methods for solving a quadratic assignment proble...
Application of local search methods for solving a quadratic assignment proble...
 
Gorsel Veri Madenciligi’nde bir Vaka Calısması: OSS Verilerinin Analizi
Gorsel Veri Madenciligi’nde bir Vaka Calısması: OSS Verilerinin AnaliziGorsel Veri Madenciligi’nde bir Vaka Calısması: OSS Verilerinin Analizi
Gorsel Veri Madenciligi’nde bir Vaka Calısması: OSS Verilerinin Analizi
 
Optimizing Waste Collection In An Organized Industrial Region: A Case Study
Optimizing Waste Collection  In An Organized Industrial Region: A Case StudyOptimizing Waste Collection  In An Organized Industrial Region: A Case Study
Optimizing Waste Collection In An Organized Industrial Region: A Case Study
 
Design Requirements For a Tendon Rehabilitation Robot: Results From a Survey ...
Design Requirements For a Tendon Rehabilitation Robot: Results From a Survey ...Design Requirements For a Tendon Rehabilitation Robot: Results From a Survey ...
Design Requirements For a Tendon Rehabilitation Robot: Results From a Survey ...
 
Teaching Warehousing Concepts through Interactive Animations and 3-D Models
Teaching Warehousing Concepts through Interactive Animations  and 3-D ModelsTeaching Warehousing Concepts through Interactive Animations  and 3-D Models
Teaching Warehousing Concepts through Interactive Animations and 3-D Models
 
Encapsulating And Representing The Knowledge On The Evolution Of An Engineeri...
Encapsulating And Representing The Knowledge On The Evolution Of An Engineeri...Encapsulating And Representing The Knowledge On The Evolution Of An Engineeri...
Encapsulating And Representing The Knowledge On The Evolution Of An Engineeri...
 
The Bullwhip Effect In Supply Chain Reflections After A Decade
The Bullwhip Effect In Supply Chain  Reflections After A DecadeThe Bullwhip Effect In Supply Chain  Reflections After A Decade
The Bullwhip Effect In Supply Chain Reflections After A Decade
 
Application Of Local Search Methods For Solving A Quadratic Assignment Probl...
Application Of Local Search Methods For Solving  A Quadratic Assignment Probl...Application Of Local Search Methods For Solving  A Quadratic Assignment Probl...
Application Of Local Search Methods For Solving A Quadratic Assignment Probl...
 
Compiere kurulumu
Compiere kurulumuCompiere kurulumu
Compiere kurulumu
 
Re-Mining Item Associations: Methodology and a Case Study in Apparel Retailing
Re-Mining Item Associations: Methodology and a Case Study in Apparel RetailingRe-Mining Item Associations: Methodology and a Case Study in Apparel Retailing
Re-Mining Item Associations: Methodology and a Case Study in Apparel Retailing
 
Impact of Cross Aisles in a Rectangular Warehouse: A Computational Study
Impact of Cross Aisles in a Rectangular Warehouse:  A Computational StudyImpact of Cross Aisles in a Rectangular Warehouse:  A Computational Study
Impact of Cross Aisles in a Rectangular Warehouse: A Computational Study
 
Statistical Scoring Algorithm for Learning and Study Skills
Statistical Scoring Algorithm for Learning and Study SkillsStatistical Scoring Algorithm for Learning and Study Skills
Statistical Scoring Algorithm for Learning and Study Skills
 
A Framework for Automated Association Mining Over Multiple Databases
A Framework for Automated Association Mining Over Multiple DatabasesA Framework for Automated Association Mining Over Multiple Databases
A Framework for Automated Association Mining Over Multiple Databases
 
Supplier and Buyer Driven Channels in a Two-Stage Supply Chain
Supplier and Buyer Driven Channels in a  Two-Stage Supply ChainSupplier and Buyer Driven Channels in a  Two-Stage Supply Chain
Supplier and Buyer Driven Channels in a Two-Stage Supply Chain
 
Re-mining Positive and Negative Association Mining Results
Re-mining Positive and Negative Association Mining ResultsRe-mining Positive and Negative Association Mining Results
Re-mining Positive and Negative Association Mining Results
 

Ähnlich wie Visual and analytical mining of transactions data for production planning for production planning and marketing

Visual and analytical mining of sales transaction data for production plannin...
Visual and analytical mining of sales transaction data for production plannin...Visual and analytical mining of sales transaction data for production plannin...
Visual and analytical mining of sales transaction data for production plannin...Gurdal Ertek
 
A machine learning model for predicting innovation effort of firms
A machine learning model for predicting innovation effort of  firmsA machine learning model for predicting innovation effort of  firms
A machine learning model for predicting innovation effort of firmsIJECEIAES
 
DESIGN, DEVELOPMENT & IMPLEMENTATION OF ONTOLOGICAL KNOWLEDGE BASED SYSTEM FO...
DESIGN, DEVELOPMENT & IMPLEMENTATION OF ONTOLOGICAL KNOWLEDGE BASED SYSTEM FO...DESIGN, DEVELOPMENT & IMPLEMENTATION OF ONTOLOGICAL KNOWLEDGE BASED SYSTEM FO...
DESIGN, DEVELOPMENT & IMPLEMENTATION OF ONTOLOGICAL KNOWLEDGE BASED SYSTEM FO...IJDKP
 
Electronics health records and business analytics a cloud based approach
Electronics health records and business analytics a cloud based approachElectronics health records and business analytics a cloud based approach
Electronics health records and business analytics a cloud based approachIAEME Publication
 
Data Mining and Knowledge Management
Data Mining and Knowledge ManagementData Mining and Knowledge Management
Data Mining and Knowledge ManagementIRJET Journal
 
A Survey on Data Mining
A Survey on Data MiningA Survey on Data Mining
A Survey on Data MiningIOSR Journals
 
A CASE STUDY OF INNOVATION OF AN INFORMATION COMMUNICATION SYSTEM AND UPGRADE...
A CASE STUDY OF INNOVATION OF AN INFORMATION COMMUNICATION SYSTEM AND UPGRADE...A CASE STUDY OF INNOVATION OF AN INFORMATION COMMUNICATION SYSTEM AND UPGRADE...
A CASE STUDY OF INNOVATION OF AN INFORMATION COMMUNICATION SYSTEM AND UPGRADE...ijaia
 
IRJET- Comparative Analysis of Various Tools for Data Mining and Big Data...
IRJET-  	  Comparative Analysis of Various Tools for Data Mining and Big Data...IRJET-  	  Comparative Analysis of Various Tools for Data Mining and Big Data...
IRJET- Comparative Analysis of Various Tools for Data Mining and Big Data...IRJET Journal
 
Continuity in the development of seamless mobility: An approach for a system-...
Continuity in the development of seamless mobility: An approach for a system-...Continuity in the development of seamless mobility: An approach for a system-...
Continuity in the development of seamless mobility: An approach for a system-...IRJET Journal
 
big-book-of-data-science-2ndedition.pdf
big-book-of-data-science-2ndedition.pdfbig-book-of-data-science-2ndedition.pdf
big-book-of-data-science-2ndedition.pdfssuserd397dd
 
Rule-based Information Extraction for Airplane Crashes Reports
Rule-based Information Extraction for Airplane Crashes ReportsRule-based Information Extraction for Airplane Crashes Reports
Rule-based Information Extraction for Airplane Crashes ReportsCSCJournals
 
Rule-based Information Extraction for Airplane Crashes Reports
Rule-based Information Extraction for Airplane Crashes ReportsRule-based Information Extraction for Airplane Crashes Reports
Rule-based Information Extraction for Airplane Crashes ReportsCSCJournals
 
STOCKSENTIX: A MACHINE LEARNING APPROACH TO STOCKMARKET
STOCKSENTIX: A MACHINE LEARNING APPROACH TO STOCKMARKETSTOCKSENTIX: A MACHINE LEARNING APPROACH TO STOCKMARKET
STOCKSENTIX: A MACHINE LEARNING APPROACH TO STOCKMARKETIRJET Journal
 
Mining Social Media Data for Understanding Drugs Usage
Mining Social Media Data for Understanding Drugs  UsageMining Social Media Data for Understanding Drugs  Usage
Mining Social Media Data for Understanding Drugs UsageIRJET Journal
 
Impact of business intelligence tools in executive information systems
Impact of business intelligence tools in executive information systemsImpact of business intelligence tools in executive information systems
Impact of business intelligence tools in executive information systemsIAEME Publication
 
Impact of business intelligence tools in executive information systems
Impact of business intelligence tools in executive information systemsImpact of business intelligence tools in executive information systems
Impact of business intelligence tools in executive information systemsIAEME Publication
 
Impact of business intelligence tools in executive information systems
Impact of business intelligence tools in executive information systemsImpact of business intelligence tools in executive information systems
Impact of business intelligence tools in executive information systemsiaemedu
 

Ähnlich wie Visual and analytical mining of transactions data for production planning for production planning and marketing (20)

Visual and analytical mining of sales transaction data for production plannin...
Visual and analytical mining of sales transaction data for production plannin...Visual and analytical mining of sales transaction data for production plannin...
Visual and analytical mining of sales transaction data for production plannin...
 
A machine learning model for predicting innovation effort of firms
A machine learning model for predicting innovation effort of  firmsA machine learning model for predicting innovation effort of  firms
A machine learning model for predicting innovation effort of firms
 
DESIGN, DEVELOPMENT & IMPLEMENTATION OF ONTOLOGICAL KNOWLEDGE BASED SYSTEM FO...
DESIGN, DEVELOPMENT & IMPLEMENTATION OF ONTOLOGICAL KNOWLEDGE BASED SYSTEM FO...DESIGN, DEVELOPMENT & IMPLEMENTATION OF ONTOLOGICAL KNOWLEDGE BASED SYSTEM FO...
DESIGN, DEVELOPMENT & IMPLEMENTATION OF ONTOLOGICAL KNOWLEDGE BASED SYSTEM FO...
 
50320140502003
5032014050200350320140502003
50320140502003
 
Electronics health records and business analytics a cloud based approach
Electronics health records and business analytics a cloud based approachElectronics health records and business analytics a cloud based approach
Electronics health records and business analytics a cloud based approach
 
Data Mining and Knowledge Management
Data Mining and Knowledge ManagementData Mining and Knowledge Management
Data Mining and Knowledge Management
 
A Survey on Data Mining
A Survey on Data MiningA Survey on Data Mining
A Survey on Data Mining
 
Ijetcas14 409
Ijetcas14 409Ijetcas14 409
Ijetcas14 409
 
50320140502003
5032014050200350320140502003
50320140502003
 
A CASE STUDY OF INNOVATION OF AN INFORMATION COMMUNICATION SYSTEM AND UPGRADE...
A CASE STUDY OF INNOVATION OF AN INFORMATION COMMUNICATION SYSTEM AND UPGRADE...A CASE STUDY OF INNOVATION OF AN INFORMATION COMMUNICATION SYSTEM AND UPGRADE...
A CASE STUDY OF INNOVATION OF AN INFORMATION COMMUNICATION SYSTEM AND UPGRADE...
 
IRJET- Comparative Analysis of Various Tools for Data Mining and Big Data...
IRJET-  	  Comparative Analysis of Various Tools for Data Mining and Big Data...IRJET-  	  Comparative Analysis of Various Tools for Data Mining and Big Data...
IRJET- Comparative Analysis of Various Tools for Data Mining and Big Data...
 
Continuity in the development of seamless mobility: An approach for a system-...
Continuity in the development of seamless mobility: An approach for a system-...Continuity in the development of seamless mobility: An approach for a system-...
Continuity in the development of seamless mobility: An approach for a system-...
 
big-book-of-data-science-2ndedition.pdf
big-book-of-data-science-2ndedition.pdfbig-book-of-data-science-2ndedition.pdf
big-book-of-data-science-2ndedition.pdf
 
Rule-based Information Extraction for Airplane Crashes Reports
Rule-based Information Extraction for Airplane Crashes ReportsRule-based Information Extraction for Airplane Crashes Reports
Rule-based Information Extraction for Airplane Crashes Reports
 
Rule-based Information Extraction for Airplane Crashes Reports
Rule-based Information Extraction for Airplane Crashes ReportsRule-based Information Extraction for Airplane Crashes Reports
Rule-based Information Extraction for Airplane Crashes Reports
 
STOCKSENTIX: A MACHINE LEARNING APPROACH TO STOCKMARKET
STOCKSENTIX: A MACHINE LEARNING APPROACH TO STOCKMARKETSTOCKSENTIX: A MACHINE LEARNING APPROACH TO STOCKMARKET
STOCKSENTIX: A MACHINE LEARNING APPROACH TO STOCKMARKET
 
Mining Social Media Data for Understanding Drugs Usage
Mining Social Media Data for Understanding Drugs  UsageMining Social Media Data for Understanding Drugs  Usage
Mining Social Media Data for Understanding Drugs Usage
 
Impact of business intelligence tools in executive information systems
Impact of business intelligence tools in executive information systemsImpact of business intelligence tools in executive information systems
Impact of business intelligence tools in executive information systems
 
Impact of business intelligence tools in executive information systems
Impact of business intelligence tools in executive information systemsImpact of business intelligence tools in executive information systems
Impact of business intelligence tools in executive information systems
 
Impact of business intelligence tools in executive information systems
Impact of business intelligence tools in executive information systemsImpact of business intelligence tools in executive information systems
Impact of business intelligence tools in executive information systems
 

Mehr von ertekg

Optimizing the electric charge station network of EŞARJ
Optimizing the electric charge station network of EŞARJOptimizing the electric charge station network of EŞARJ
Optimizing the electric charge station network of EŞARJertekg
 
Competitiveness of Top 100 U.S. Universities: A Benchmark Study Using Data En...
Competitiveness of Top 100 U.S. Universities: A Benchmark Study Using Data En...Competitiveness of Top 100 U.S. Universities: A Benchmark Study Using Data En...
Competitiveness of Top 100 U.S. Universities: A Benchmark Study Using Data En...ertekg
 
Industrial Benchmarking through Information Visualization and Data Envelopmen...
Industrial Benchmarking through Information Visualization and Data Envelopmen...Industrial Benchmarking through Information Visualization and Data Envelopmen...
Industrial Benchmarking through Information Visualization and Data Envelopmen...ertekg
 
Risk Factors and Identifiers for Alzheimer’s Disease: A Data Mining Analysis
Risk Factors and Identifiers for Alzheimer’s Disease:  A Data Mining AnalysisRisk Factors and Identifiers for Alzheimer’s Disease:  A Data Mining Analysis
Risk Factors and Identifiers for Alzheimer’s Disease: A Data Mining Analysisertekg
 
Text Mining with RapidMiner
Text Mining with RapidMinerText Mining with RapidMiner
Text Mining with RapidMinerertekg
 
Competitive Pattern-Based Strategies under Complexity: The Case of Turkish Ma...
Competitive Pattern-Based Strategies under Complexity: The Case of Turkish Ma...Competitive Pattern-Based Strategies under Complexity: The Case of Turkish Ma...
Competitive Pattern-Based Strategies under Complexity: The Case of Turkish Ma...ertekg
 
A Tutorial On Crossdocking
A Tutorial On CrossdockingA Tutorial On Crossdocking
A Tutorial On Crossdockingertekg
 
Demonstrating Warehousing Concepts Through Interactive Animations
Demonstrating Warehousing Concepts Through Interactive AnimationsDemonstrating Warehousing Concepts Through Interactive Animations
Demonstrating Warehousing Concepts Through Interactive Animationsertekg
 
A Framework for Visualizing Association Mining Results
A Framework for Visualizing  Association Mining ResultsA Framework for Visualizing  Association Mining Results
A Framework for Visualizing Association Mining Resultsertekg
 
Application of the Cutting Stock Problem to a Construction Company: A Case Study
Application of the Cutting Stock Problem to a Construction Company: A Case StudyApplication of the Cutting Stock Problem to a Construction Company: A Case Study
Application of the Cutting Stock Problem to a Construction Company: A Case Studyertekg
 
Benchmarking The Turkish Apparel Retail Industry Through Data Envelopment Ana...
Benchmarking The Turkish Apparel Retail Industry Through Data Envelopment Ana...Benchmarking The Turkish Apparel Retail Industry Through Data Envelopment Ana...
Benchmarking The Turkish Apparel Retail Industry Through Data Envelopment Ana...ertekg
 
An Open Source Java Code For Visualizing Supply Chain Problems
An Open Source Java Code For Visualizing Supply Chain ProblemsAn Open Source Java Code For Visualizing Supply Chain Problems
An Open Source Java Code For Visualizing Supply Chain Problemsertekg
 
Dairy Logistics: A Tutorial
Dairy Logistics: A TutorialDairy Logistics: A Tutorial
Dairy Logistics: A Tutorialertekg
 
A Taxonomy Of Logistics Innovations
A Taxonomy Of Logistics InnovationsA Taxonomy Of Logistics Innovations
A Taxonomy Of Logistics Innovationsertekg
 
Innovation in Product Form And Function: Customer Perception Of Their Value
Innovation in Product Form And Function: Customer Perception Of Their ValueInnovation in Product Form And Function: Customer Perception Of Their Value
Innovation in Product Form And Function: Customer Perception Of Their Valueertekg
 
Developing Competitive Strategies in Higher Education through Visual Data Mining
Developing Competitive Strategies in Higher Education through Visual Data MiningDeveloping Competitive Strategies in Higher Education through Visual Data Mining
Developing Competitive Strategies in Higher Education through Visual Data Miningertekg
 
Kimya Sanayinde Su Tasarrufu İçin Karar Destek Sistemi
Kimya Sanayinde Su Tasarrufu İçin Karar  Destek Sistemi Kimya Sanayinde Su Tasarrufu İçin Karar  Destek Sistemi
Kimya Sanayinde Su Tasarrufu İçin Karar Destek Sistemi ertekg
 

Mehr von ertekg (17)

Optimizing the electric charge station network of EŞARJ
Optimizing the electric charge station network of EŞARJOptimizing the electric charge station network of EŞARJ
Optimizing the electric charge station network of EŞARJ
 
Competitiveness of Top 100 U.S. Universities: A Benchmark Study Using Data En...
Competitiveness of Top 100 U.S. Universities: A Benchmark Study Using Data En...Competitiveness of Top 100 U.S. Universities: A Benchmark Study Using Data En...
Competitiveness of Top 100 U.S. Universities: A Benchmark Study Using Data En...
 
Industrial Benchmarking through Information Visualization and Data Envelopmen...
Industrial Benchmarking through Information Visualization and Data Envelopmen...Industrial Benchmarking through Information Visualization and Data Envelopmen...
Industrial Benchmarking through Information Visualization and Data Envelopmen...
 
Risk Factors and Identifiers for Alzheimer’s Disease: A Data Mining Analysis
Risk Factors and Identifiers for Alzheimer’s Disease:  A Data Mining AnalysisRisk Factors and Identifiers for Alzheimer’s Disease:  A Data Mining Analysis
Risk Factors and Identifiers for Alzheimer’s Disease: A Data Mining Analysis
 
Text Mining with RapidMiner
Text Mining with RapidMinerText Mining with RapidMiner
Text Mining with RapidMiner
 
Competitive Pattern-Based Strategies under Complexity: The Case of Turkish Ma...
Competitive Pattern-Based Strategies under Complexity: The Case of Turkish Ma...Competitive Pattern-Based Strategies under Complexity: The Case of Turkish Ma...
Competitive Pattern-Based Strategies under Complexity: The Case of Turkish Ma...
 
A Tutorial On Crossdocking
A Tutorial On CrossdockingA Tutorial On Crossdocking
A Tutorial On Crossdocking
 
Demonstrating Warehousing Concepts Through Interactive Animations
Demonstrating Warehousing Concepts Through Interactive AnimationsDemonstrating Warehousing Concepts Through Interactive Animations
Demonstrating Warehousing Concepts Through Interactive Animations
 
A Framework for Visualizing Association Mining Results
A Framework for Visualizing  Association Mining ResultsA Framework for Visualizing  Association Mining Results
A Framework for Visualizing Association Mining Results
 
Application of the Cutting Stock Problem to a Construction Company: A Case Study
Application of the Cutting Stock Problem to a Construction Company: A Case StudyApplication of the Cutting Stock Problem to a Construction Company: A Case Study
Application of the Cutting Stock Problem to a Construction Company: A Case Study
 
Benchmarking The Turkish Apparel Retail Industry Through Data Envelopment Ana...
Benchmarking The Turkish Apparel Retail Industry Through Data Envelopment Ana...Benchmarking The Turkish Apparel Retail Industry Through Data Envelopment Ana...
Benchmarking The Turkish Apparel Retail Industry Through Data Envelopment Ana...
 
An Open Source Java Code For Visualizing Supply Chain Problems
An Open Source Java Code For Visualizing Supply Chain ProblemsAn Open Source Java Code For Visualizing Supply Chain Problems
An Open Source Java Code For Visualizing Supply Chain Problems
 
Dairy Logistics: A Tutorial
Dairy Logistics: A TutorialDairy Logistics: A Tutorial
Dairy Logistics: A Tutorial
 
A Taxonomy Of Logistics Innovations
A Taxonomy Of Logistics InnovationsA Taxonomy Of Logistics Innovations
A Taxonomy Of Logistics Innovations
 
Innovation in Product Form And Function: Customer Perception Of Their Value
Innovation in Product Form And Function: Customer Perception Of Their ValueInnovation in Product Form And Function: Customer Perception Of Their Value
Innovation in Product Form And Function: Customer Perception Of Their Value
 
Developing Competitive Strategies in Higher Education through Visual Data Mining
Developing Competitive Strategies in Higher Education through Visual Data MiningDeveloping Competitive Strategies in Higher Education through Visual Data Mining
Developing Competitive Strategies in Higher Education through Visual Data Mining
 
Kimya Sanayinde Su Tasarrufu İçin Karar Destek Sistemi
Kimya Sanayinde Su Tasarrufu İçin Karar  Destek Sistemi Kimya Sanayinde Su Tasarrufu İçin Karar  Destek Sistemi
Kimya Sanayinde Su Tasarrufu İçin Karar Destek Sistemi
 

Kürzlich hochgeladen

FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607dollysharma2066
 
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
Keppel Ltd. 1Q 2024 Business Update  Presentation SlidesKeppel Ltd. 1Q 2024 Business Update  Presentation Slides
Keppel Ltd. 1Q 2024 Business Update Presentation SlidesKeppelCorporation
 
Islamabad Escorts | Call 03070433345 | Escort Service in Islamabad
Islamabad Escorts | Call 03070433345 | Escort Service in IslamabadIslamabad Escorts | Call 03070433345 | Escort Service in Islamabad
Islamabad Escorts | Call 03070433345 | Escort Service in IslamabadAyesha Khan
 
Buy gmail accounts.pdf Buy Old Gmail Accounts
Buy gmail accounts.pdf Buy Old Gmail AccountsBuy gmail accounts.pdf Buy Old Gmail Accounts
Buy gmail accounts.pdf Buy Old Gmail AccountsBuy Verified Accounts
 
Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...
Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...
Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...lizamodels9
 
BEST Call Girls In Greater Noida ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Greater Noida ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,BEST Call Girls In Greater Noida ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Greater Noida ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,noida100girls
 
2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis Usage2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis UsageNeil Kimberley
 
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,noida100girls
 
8447779800, Low rate Call girls in Uttam Nagar Delhi NCR
8447779800, Low rate Call girls in Uttam Nagar Delhi NCR8447779800, Low rate Call girls in Uttam Nagar Delhi NCR
8447779800, Low rate Call girls in Uttam Nagar Delhi NCRashishs7044
 
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deckPitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deckHajeJanKamps
 
Organizational Structure Running A Successful Business
Organizational Structure Running A Successful BusinessOrganizational Structure Running A Successful Business
Organizational Structure Running A Successful BusinessSeta Wicaksana
 
Kenya Coconut Production Presentation by Dr. Lalith Perera
Kenya Coconut Production Presentation by Dr. Lalith PereraKenya Coconut Production Presentation by Dr. Lalith Perera
Kenya Coconut Production Presentation by Dr. Lalith Pereraictsugar
 
Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Kirill Klimov
 
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCRashishs7044
 
Kenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby AfricaKenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby Africaictsugar
 
Innovation Conference 5th March 2024.pdf
Innovation Conference 5th March 2024.pdfInnovation Conference 5th March 2024.pdf
Innovation Conference 5th March 2024.pdfrichard876048
 
Annual General Meeting Presentation Slides
Annual General Meeting Presentation SlidesAnnual General Meeting Presentation Slides
Annual General Meeting Presentation SlidesKeppelCorporation
 
8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCR8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCRashishs7044
 
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...lizamodels9
 

Kürzlich hochgeladen (20)

FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607
 
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
Keppel Ltd. 1Q 2024 Business Update  Presentation SlidesKeppel Ltd. 1Q 2024 Business Update  Presentation Slides
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
 
Islamabad Escorts | Call 03070433345 | Escort Service in Islamabad
Islamabad Escorts | Call 03070433345 | Escort Service in IslamabadIslamabad Escorts | Call 03070433345 | Escort Service in Islamabad
Islamabad Escorts | Call 03070433345 | Escort Service in Islamabad
 
Buy gmail accounts.pdf Buy Old Gmail Accounts
Buy gmail accounts.pdf Buy Old Gmail AccountsBuy gmail accounts.pdf Buy Old Gmail Accounts
Buy gmail accounts.pdf Buy Old Gmail Accounts
 
Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...
Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...
Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...
 
BEST Call Girls In Greater Noida ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Greater Noida ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,BEST Call Girls In Greater Noida ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Greater Noida ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
 
2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis Usage2024 Numerator Consumer Study of Cannabis Usage
2024 Numerator Consumer Study of Cannabis Usage
 
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
BEST Call Girls In Old Faridabad ✨ 9773824855 ✨ Escorts Service In Delhi Ncr,
 
8447779800, Low rate Call girls in Uttam Nagar Delhi NCR
8447779800, Low rate Call girls in Uttam Nagar Delhi NCR8447779800, Low rate Call girls in Uttam Nagar Delhi NCR
8447779800, Low rate Call girls in Uttam Nagar Delhi NCR
 
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deckPitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
 
Organizational Structure Running A Successful Business
Organizational Structure Running A Successful BusinessOrganizational Structure Running A Successful Business
Organizational Structure Running A Successful Business
 
Kenya Coconut Production Presentation by Dr. Lalith Perera
Kenya Coconut Production Presentation by Dr. Lalith PereraKenya Coconut Production Presentation by Dr. Lalith Perera
Kenya Coconut Production Presentation by Dr. Lalith Perera
 
Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024
 
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
 
Kenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby AfricaKenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby Africa
 
Innovation Conference 5th March 2024.pdf
Innovation Conference 5th March 2024.pdfInnovation Conference 5th March 2024.pdf
Innovation Conference 5th March 2024.pdf
 
Annual General Meeting Presentation Slides
Annual General Meeting Presentation SlidesAnnual General Meeting Presentation Slides
Annual General Meeting Presentation Slides
 
8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCR8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCR
 
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...
Call Girls In Radisson Blu Hotel New Delhi Paschim Vihar ❤️8860477959 Escorts...
 
Corporate Profile 47Billion Information Technology
Corporate Profile 47Billion Information TechnologyCorporate Profile 47Billion Information Technology
Corporate Profile 47Billion Information Technology
 

Visual and analytical mining of transactions data for production planning for production planning and marketing

  • 1. Ertek, G., Kuruca, C., Aydin, C., Erel, B.F., Dogan, H., Duman, M., Ocal, M., and Ok, Z.D. (2004). "Visual and analytical mining of sales transaction data for production planning and marketing.” 4th International Symposium on Intelligent Manufacturing Systems, Sakarya, Turkey. Note: This is the final draft version of this paper. Please cite this paper (or this final draft) as above. You can download this final draft from http://research.sabanciuniv.edu. Visual and analytical mining of transactions data for production planning for production planning and marketing Gurdal Ertek, Can Kuruca, Cenk Aydin, Besim Ferit Erel, Harun Dogan, Mustafa Duman, Mete Ocal, Zeynep Damla Ok Sabanci University Istanbul, Turkey
  • 2. Proceedings of 4th International Symposium on Intelligent Manufacturing Systems, September 6-8, 2004: 848-859 ©Sakarya University, Department of Industrial Engineering Visual and analytical mining of sales transaction data for production planning and marketing Gurdal Ertek*, Can Kuruca, Cenk Aydin, Besim Ferit Erel, Harun Dogan, Mustafa Duman, Mete Ocal, Zeynep Damla Ok (*) Corresponding author Sabanci University, Faculty of Engineering and Natural Sciences, Orhanli, Tuzla, 34956, Istanbul, Turkey Tel: +90(216)483-9568 Fax: +90(216)483-9550 Email: ertekg@sabanciuniv.edu 848
  • 3. Visual and analytical mining of transactions data for production planning and marketing Gurdal Ertek, Can Kuruca, Cenk Aydin, Besim Ferit Erel, Harun Dogan, Mustafa Duman, Mete Ocal, Zeynep Damla Ok Abstract Recent developments in information technology paved the way for the collection of large amounts of data pertaining to various aspects of an enterprise. The greatest challenge faced in processing these massive amounts of raw data gathered turns out to be the effective management of data with the ultimate purpose of deriving necessary and meaningful information out of it. The following paper presents an attempt to illustrate the combination of visual and analytical data mining techniques for planning of marketing and production activities. The primary phases of the proposed framework consist of filtering, clustering and comparison steps implemented using interactive pie charts, K-Means algorithm and parallel coordinate plots respectively. A prototype decision support system is developed and a sample analysis session is conducted to demonstrate the applicability of the framework. Submission areas: Decision support systems, data mining, information technologies 849
  • 4. Introduction Widespread use of information technology has resulted in massive collections of data regarding most aspects of an enterprise. The amount of data on the marketing side has exploded due to widespread usage of barcode systems, accounting and Enterprise Resorce Planning (ERP) software and also due to collection of Business-to-Consumer (B2C) and Business-to-Business (B2B) electronic commerce data. The amount of data that comes from manufacturing processes has also exploded, due to application of Computer Integrated Manufacturing (CIM) systems, barcode and radio frequency technology, which provide bulky amounts of real time data. Effective collection, management, reporting, interpretive analysis and mining of enterprise data can help in establishing effective control of manufacturing activities, achieving effective production planning and increased sales, and consequently increasing the firm’s profitability. Data mining can also serve the purpose of increasing customer satisfaction by offering and timely delivering them products that they are willing to purchase. Keeping existing customers is typically much more profitable than trying to acquire new customers. This observation is indeed the underlying concept of Customer Relationship Management (CRM) systems (Shaw et al., 2001). As suggested earlier, data from various areas of an enterprise is now widely available; however, in this paper, only sales transactions data is considered. The reason for choosing this particular data type is that sales transactions data is collected and archived in almost every firm and it is actually the essential input to two very critical aspects of enterprise planning, namely marketing and production planning. A framework for the analysis of this type of data is proposed and implemented in the software developed, namely CuReMa. The main contribution of the framework and the prototype software is the integration of visual and analytical data mining techniques for marketing and production planning. Kreuseler and Schumann (2002) present a similar approach combining visual and analytical data mining techniques without focusing on particular enterprise data. The paper starts by offering a brief review of the literature concerned with visual and analytical data mining techniques. In the following sections, we present the framework we propose and explain how it is implemented in CuReMa. Before concluding, the applicability of the proposed framework is demonstrated with a sample analysis session with the software. Literature Review Analytical data mining techniques are widely used and implemented (Han and Kamber, 2001). In recent years, visual mining of data has also gained importance. The traditional exploratory graphical data analysis methods such as scatter plots, box plots (Chambers et al., 1983) have been enriched with a wide array of new visual representations and methods (de Oliveira and Levkowitz, 2003). The field of computer science involved in such representations and methods is referred to as information visualization. The number of journal articles in information visualization has shown significant increase from 1990’s to present (Chen, 2002), indicating that the field is a promising branch of computer science. Work related to this paper can be grouped in two categories, based on their scope: a) Data mining for marketing: 850
  • 5. Shaw et al. (2001) provide a recent survey of data mining applications for marketing. They present a taxonomy of data mining tasks and provide five broad categories: Dependency analysis, class identification, concept description, deviation detection, and data visualization. Our study suggests a framework that relates to all but the first of these categories. Keim et al. (2002) develop a visualization technique named “pixel bar charts” for analysis of very large multi-attribute data sets and demonstrate applicability of their approach by analyzing real-world e-commerce data sets. b) Data mining for manufacturing: Applications of information visualization in the domain of manufacturing include data representation for engineering design and data analysis for predicting product failure rates (Spence, 2001, p28—30 and p60—61, respectively). Dabbas and Chen (2001) present an integrated relational database approach for semiconductor wafer fabrication, which resulted in improved manufacturing performance. They describe how they combined multiple data sources and various reports under the integrated approach. Data mining tools that can handle fairly large amounts of data and allow derivation of insights are numerous. Spotfire, Miner3D and XLMiner1 are among successful commercial products that can be used to analyze data from a variety of domains. Software libraries which allow building customized visual interfaces to domain-specific applications are also available: For instance, Eick (2000) presents ADVIZOR as “a flexible software environment for visual information discovery” that allows creating visual query and analysis applications. Our implementation has some similarities to these products: For example, one of the similarities between CuReMa, Spotfire and Miner3D is that all of them allow the user to conduct a query on products or customers within a given range through the use of sliders. Proposed Framework An approach that integrates visual data mining methods with analytical methods for mining sales transaction data is proposed to perform the three critical functions listed below, and to answer the given questions as well as many other unlisted ones: 1) Filtering This initial step includes filtering of data in different dimensions and displaying and analyzing the filtered data. In this stage of the analysis, the products or the customers within a cluster or a group are represented with pie charts. This representation allows the identification of significant items and outliers in the cluster or the group. The answers to the following questions can be obtained as a result of filtering: • What are the total sales of top-selling n products for the top-purchasing c customers within time interval (t1, t2) and what is the share of each product? • What are the total sales of n slowest-moving products for a given set of customers (selected from a list sorted based on sales) within time interval (t1, t2) and what is the share of each product? • How much sales were generated for each product represented in the pie chart? 1 The information regarding these products can be found on http://www.spotfire.com, http://www.miner3d.com, and http://www.resample.com/xlminer/ respectively. 851
  • 6. What are the total purchases of top-purchasing c customers from a given set of products (selected from a list sorted based on sales) within time interval (t1, t2), and what is the share of each customer? • How many purchases were made by each of the customers represented in the pie chart? 2) Clustering The second step involves clustering products and customers with respect to selected measures and generating related reports. This phase provides the answer for the questions such as: • How can the products be grouped with respect to seasonal sales patterns? • How can the market be segmented with respect to seasonal purchasing patterns of customers? Many other useful statistics about each cluster may be acquired after clustering. These statistics include the revenue generated by each product, the product with the highest sales level, and the product purchased by the greatest number of customers. 3) Comparison Finally, clusters are compared with respect to selected measures. One of the questions answered through comparison would be the following: • How do the products differ from each other with respect to seasonal sales patterns? Are there significant differences among given clusters? The questions above focus on customer and product clusters, yet CuReMa can answer the same questions based on customer and product groups, which could come with the dataset. The sample analysis session will demonstrate how some of these questions are answered using CuReMa. An Implementation of the Framework: CuReMa The data mining framework has been implemented as a prototype decision support system to demonstrate the viability of the proposed approach. Real world data from a regional distributor of automotive spare parts, covering approximately 18 months of sales transactions, has been analyzed with the developed software. The software, named CuReMa after Customer Relationship Management, allows mining of the dataset for marketing and production planning purposes. The implementation has been done in Java programming language (Doke et al., 2002) using the Eclipse Integrated Development Environment (IDE)2 under Microsoft Windows operating system. CuReMa was built based on a 3-tier design, separating data access, interface and business classes (Doke et at. 2002). MySQL is employed as the database server and MySQL Control Center3 is used in constructing and maintaining the database. Java and MySQL were selected primarily due to their platform independence, which allows porting the developed system to various operating systems, such as Linux and MacOS. Java language also has the advantages of being purely object oriented and having extensive libraries that allow rapid prototyping. Being interpreted at runtime -as opposed to being executed as native code- makes Java programs run 2 The information on Eclipse IDE can be retrieved from http://www.eclipse.org 3 MySQL Control Center can be accessed through http://www.mysql.com/products/mysqlcc/ 852
  • 7. slower than programs written in some other languages. However, implementation of distributed shared memory and remote method invocation (not implemented in CuReMa) can enable programmers to build scalable Java programs (Kielmann et al., 2001). The relational database in MySQL contains a number of tables. The table “transactions”, (shown in Figure 1) contains the sales transactions data and forms the source of all the other tables. Figure 1. Snapshot of the database, illustrating the fields of the “transactions” table We now present how each function proposed in the framework is implemented: Filtering: Interactive Visual Querying In CuReMa, the user can perform visual query of the dataset using pull down menus and sliders. The sliders enable filtering by time, by customer and by product name. The customers and the products are sorted in decreasing order of total sales from the left to the right. Positions of the sliders are translated into SQL (Structured Query Language) (Elmasri and Navathe, 1994, Ch7) statements and reflected on the pie charts in real time. One such statement (that corresponds to the filtering illustrated in Figure 2) is the following: SELECT ItemName AS NAME, SUM(TOTAL) AS TOT FROM deneme3.reference WHERE DATE <='2003.6.6' AND DATE >= '2003.3.5' AND CustomerNo>= 1 AND 173 >= CustomerNo AND 1 <= ItemNo AND 259 >= ItemNo GROUP BY NAME ORDER BY TOT DESC Pie charts display the share of each customer/product within the selected ranges. Clustering: Analytical Data Mining The analytical data mining method implemented in CuReMa is K-Means clustering (Han and Kamber, 2001, Ch8). This method starts with a set of entities, accompanied with given attribute 853
  • 8. values, and proceeds by clustering the entities into a specified number of clusters. The attribute values are the average sales in each month of the year. K- Means algorithm initially selects random values as the K-cluster means. Then, it examines each object pertaining to the dataset and assigns the object to the closest cluster possible, whose proximity is measured by the aggregate distance between the cluster mean and the object's actual values. An epoch consists of N iterations where N is the number of elements in the dataset. As the new objects are added to the clusters for various epochs, the means are dynamically updated. If the total number of moves of all data objects from a cluster to another is zero at the end of an epoch, the algorithm ceases and the partitioning is completed. The algorithm may end up with fewer clusters than specified: For instance, while exploration of the automotive spare parts dataset in CuReMa, it has been observed that a customer clustering run with a desired number of 7 clusters resulted in only 3 clusters. Comparison: Visual Data Mining Parallel coordinate plots (Inselberg and Dimsdale, 1990) are used for comparison of product/customer clusters, and are implemented in Mondrian and XMDV4 software. Parallel coordinate plot “maps a k-dimensional data or object space onto the 2D display by drawing k equally spaced axes in parallel” (de Oliveira and Levkowitz, 2003). In the plot, each axis corresponds to an attribute, and each line corresponds to an element of the dataset. In CuReMa, customer purchases and product sales can be compared in a parallel coordinate plot based on monthly sales. Sample Analysis Session In this section we present how the user can interact with CuReMa in an analysis session. 4 Mondrian and XMDV can be accessed from http://www.theusrus.de/Mondrian/index.html and http://davis.wpi.edu/~xmdv/ respectively. 854
  • 9. Figure 2. Filtering of products Figure 3. Clustering of products 855
  • 10. The analysis begins with Filtering, where the user selects slicing based on products from the pull-down menus in the Graph Frame (Figure 2). The user selects the data for the last 3 months for all customers (with sales in the last 3 months) and top-selling 100 products from the sliders. The top-selling product is observed to be “MOT.YAG” (motor oil) by selecting the largest slice in the pie chart. Next, the user performs clustering of the products from the previous analysis by clicking the “Cluster” button on the Graph Frame (Figure 3). Since products are displayed, the check box next to Product is selected. The user also selects clustering criteria, number of clusters, and the time range. Clustering is completed in approximately 2 minutes on a PC with Intel Pentium IIIE 933 MHz processor and 384 MB of RAM. Then the user selects the item named “Top100ProductsInLast3Months” in the Query Frame, “Parallel Coordinate Plot” from Graphics menu and clicks “Draw Charts” button to obtain the plot in Figure 4. In this plot, products in clusters 2 and 3 are observed to exhibit almost constant sales throughout the selected months whereas products in cluster 1 have the largest amount of sales, but also a greater variance as compared to others. A report can be generated inside Reports Frame by clicking “Generate Report” button in the left bottom corner. Figure 4. Comparison of product clusters The user can also analyze customers, clustering them into similar-behaving groups. Figure 5 offers the result of one such clustering. In this stage of the analysis, all the customers are selected at all the times, and clustering is performed. It can be observed that cluster 1 shows 856
  • 11. stable sales, with minimal variability. Trying to increase the sales volume to these customers can be a profitable strategy, since these customers do not cause big fluctuations in production schedules. Clusters 2 and 3 are observed to have very high levels of sales in March, while clusters 0 and 4 exhibit high sales volumes in August and September respectively. These fluctuations should be further investigated by examining the reports generated and by filtering the sales transaction data for these months. Figure 5. Comparison of customer clusters Conclusions and Future Work In this paper a framework is presented for the analysis of sales transaction data using visual and analytical data mining techniques. The framework suggests applying filtering, clustering and comparison through interactive pie charts, K-Means method and parallel coordinate plots, respectively. The framework is implemented in a software to demonstrate how the analysis can be carried out. A sample session is exhibited. The framework proposed in this paper can be extended to answer a greater range of questions. Visual metaphors reported in information visualization survey papers (e.g. de Oliveira and Levkowitz, 2003) can be adapted to the current framework, allowing observation and query of other aspects regarding the data. From an analytical point of view, other data mining techniques such as deriving association rules for revealing patterns (Changchien and Lu, 2001) and performing cross-tabulation analysis (Verhoef et al., 2002) can be made part of the framework and implemented in CuReMa. 857
  • 12. CuReMa can be tested and used with a variety of transactional datasets coming from different industries (e.g. from an e-commerce website) and the scope of the program can be broadened to include other data fields, such as ages and income levels of customers, and marginal profits of the products. On the methodological side, clustering algorithms other than K-Means can be built into the software and their performance can be compared to the performance of K-Means. One such method is SOM (Self Organized Maps), which is implemented in coherence within a visual data mining framework by Kreuseler and Schumann (2002). The clustering function in the software can be extended by taking into consideration other attributes besides monthly sales averages. One such attribute can be recency of purchases by customers and by products: Verhoef et al. (2002) report that recency and frequency of purchases are among the most popular variables used for clustering while carrying out market segmentation analysis. Data analyzed can have domain-specific considerations. For example, in many countries around the world, inflation is a real-world fact that cannot be neglected if the analyst attempts to perform a valid and insightful analysis. Adding tables into the database that keep track of the inflation indices on a monthly basis and adjusting calculations based on these indices would be pretty valuable for the dataset analyzed in such a study. Another domain-specific consideration would be the effects of holidays. One could take into account religious holidays in Turkey which turn out to be around ten days earlier from the previous year. This and similar considerations pose significant challenges. One possible area of future work is to investigate the implemented software from the point of view of human-computer interaction (Shneiderman, 1998). One can investigate the performance of various visual components and their arrangements by formal usability tests on groups of users and analyze results. Acknowledgement The authors would like to thank Ender Yalcin for providing the sales transaction data and for explaining the processes involved in decision making. The authors would also like to thank Selim Balcisoy, Pelin Gulsah Canbolat and the anonymous referees for their valuable suggestions and remarks. References Chambers, J. M., Cleveland, W. S., Kleiner, B. and Tukey, P. A. (1983) Graphical Methods for Data Analysis, Wadsworth & Brooks/Cole, Pacific Grove, California. Changchien, S. W. and Lu, T. (2001) Mining association rules procedure to support on-line recommendation by customers and products fragmentation. Expert Systems with Applications, 20, 325--335. Chen, C. (2002) Information visualization. Information Visualization, 1, 1--4. Dabbas, R. M. and Chen, H. (2001) Mining semiconductor manufacturing data for productivity improvement – an integrated relational database approach. Computers in Industry, 45, 29--44. De Oliveira, M. C. F. and Levkowitz, H. (2003) From visual data exploration to visual data mining: A survey. IEEE Transactions on Visualization and Computer Graphics, 9, 378--394. 858
  • 13. Doke, E. R., Satzinger, J. W. and Williams, S. R. (2002) Object-oriented application development using Java, Course Technology, Boston, Massachusetts. Eick, S. G. (2000) Visual discovery and analysis. IEEE Transactions on Visualization and Computer Graphics, 6, 44--58. Elmasri, R. and Navathe, S. (1994) Fundamentals of Database Systems, Addison-Wesley, Menlo Park, California. Han, J. and Kamber, M. (2001) Data Mining: Concepts and Techniques, Morgan Kaufmann, San Francisco. Inselberg, A. and Dimsdale, B. (1990) Parallel Coordinates: A tool for visualizing multidimensional geometry. Proc. IEEE Visualization ’90, 361--375. Keim, D. A., Hao, M. C., Dayal, U. and Hsu, M. (2002) Pixel bar charts: a visualization technique for very large multi-attribute data sets. Information Visualization, 1, 20—34. Kielmann, T. K., Hatcher, P., Bouge, L. and Bal, H. E. (2001) Enabling Java for high- performance computing. Communications of the ACM, 44, 110--117. Kreuseler, M. and Schumann, H. (2002) A flexible approach for visual data mining. IEEE Transactions on Visualization and Computer Graphics, 8, 39--51. Shaw, M. J., Subramaniam, C., Tan, G. W. and Welge, M. E. (2001) Knowledge management and data mining for marketing. Decision Support Systems, 31, 127--137. Shneiderman, B. (1998) Designing the User Interface: Strategies for Effective Human-Computer Interaction, Addison Wesley Longman, Reading, Massachusetts. Spence, R. (2001) Information Visualization, ACM Press, London. Verhoef, P. C., Spring, P. N., Hoekstra, J. C. and Leeflang, P. S.H. (2002) The commercial use of segmentation and predictive modeling techniques for database marketing in the Netherlands. Decision Support Systems, 34, 471--481. 859