SlideShare ist ein Scribd-Unternehmen logo
1 von 37
Downloaden Sie, um offline zu lesen
Presented by:
Harish Chand
Data Mining and Data
Warehousing
Data Mining
• Data mining, the extraction of hidden predictive information from large databases,
• The overall goal of the data mining process is to extract information from a data
set and transform it into an understandable structure for further use
• Often Not to be confused with analytics, information extraction, or data analysis
• But its goal is the extraction of patterns and knowledge from large amount of
data, not the extraction of data itself
DATA MINIING TECHIQUES
• It extract interesting patterns such as groups of data records(cluster analysis ),
unusual records(anamoly detection)
• These patterns can then be seen as a kind of summary of the input data, and may
be used in further analysis or, for example, in machine learning and predictive
analytics
• For example, the data mining step might identify multiple groups in the data,
which can then be used to obtain more accurate prediction results by a decision
support system
Data Warehousing
• also known as an enterprise data warehouse (EDW), is a system used
for reporting and data analysis. DWs are central repositories of integrated data
from one or more disparate sources
• Storing company data in a secondary location which is typically away from
production systems.
• Location= where information can be proactively reported and queried against
• Goal =creation of single logical view of data that may reside in many different
physical database
• it exists to help users understand and enhance their organization's performance
• It is designed for query and analysis rather than for transaction processing,
• and usually contains historical data derived from transaction data, but can include
data from other sources
TYPES OF DATA WAREHOUSING
• Data Mart
• Online Transaction Processing
• Online Analytical Processing.
Data Mart
• A data mart is a simple form of a data warehouse that is focused on functional
area such as sales, finance or marketing
.
• Data marts are often built and controlled by a single department within an
organization.
• Given their single-subject focus, data marts usually draw data from only a few
sources.
• The sources could be internal operational systems, a central data warehouse, or
external data.
Online Analytical
Processing(OLAP)
• Is characterized by a relatively low volume of transactions. Queries are often very
complex and involve aggregations.
• For OLAP systems, response time is an effectiveness measure.
• OLAP databases store aggregated, historical data in multi-dimensional schemas
(usually star schemas).
• OLAP systems typically have data latency of a few hours, as opposed to data
marts, where latency is expected to be closer to one day.
Online Transaction
Processing(OTLP)
• Is characterized by a large number of short on-line transactions (INSERT,
UPDATE, DELETE).
• OLTP systems emphasize very fast query processing and maintaining data
integrity in multi-access environments.
• For OLTP systems, effectiveness is measured by the number of transactions per
second.
• OLTP databases contain detailed and current data.
• The schema used to store transactional databases is the entity model
(usually 3NF).
Data mining VS Data
warehousing
Data warehouse Data mining
Process of storing data in order in given
dataset
Process of finding pattern in given
dataset.
Data warehousing is the process of
extracting and storing data to allow easier
reporting.
Data mining is the use of pattern
recognition logic to identity trends within
a sample data set and extrapolate this
information against the larger data pool
The tools in data warehousing are
designed to extract data and store it in a
method designed to provide enhanced
system performance
A typical use of data mining is to create
targeted marketing programs, identify
financial fraud,
Helps in identifing the certain data in a
collection of data
Helps in figuring out a certain pattern of a
data or a cluster of data
Benefits of Data Warehouse
• A Data Warehouse Delivers Enhanced Business Intelligence
By providing data from various sources, managers and executives will
no longer need to make business decisions based on limited data
• A Data Warehouse Saves Time
Since business users can quickly access critical data from a number of
sources—all in one place—they can rapidly make informed decisions
on key initiatives
Benefits of Data Warehouse
• A Data Warehouse Enhances Data Quality and Consistency
Individual business units and departments including sales,
marketing, finance, and operations, will start to utilize the same data
repository as the source system for their individual queries and
reports.
Thus each of these individual business units and departments will
produce results that are consistent with the other business units
within the organization.
Benefits of Data Warehousing
• A Data Warehouse Provides Historical Intelligence
 A data warehouse stores large amounts of historical data so we can
analyze different time periods and trends in order to make future
predictions
 can enable advanced business intelligence including time-period
analysis, trend analysis, and trend prediction.
Benefits of Data Warehousing
• A Data Warehouse Generates a High ROI
 Return on Investment(ROI)
 Past references shows that companies that have implemented data
warehouses have generated more revenue and saved more money
than companies that haven’t invested on data warehouses.
Challenges faced on Data Warehousing
• User expectation
 end-user demands and expects more accurate and refined results in
return of processing,
 however the performance decreases with exploding data and so the
efficiency of the system reduces.
• Systems optimization
 business intelligence tools require frequent maintenance and fine
tuning of whole system in order to meet users' expectations.
Challenges faced on Data Warehousing
• Data structuring
Proper processing of data requires structuring it in a desired format
so that further operations can be executed
As the volume of data increases the task of structuring the
unstructured data add-on, slowing down the processing capabilities of
system and eventually becomes hectic for the system manager to
qualify the data for analytic purpose.
Challenges faced on Data Warehousing
• Prefabricated vs. Custom warehouse
The varieties of warehouses available in market create ambiguity
about which type to choose or go for.
Custom warehouse saves the time of building the warehouses from
various operational
Prefabricated warehouses saves the time of initial configuration and
installation.
Challenges faced on Data Warehousing
• Resource Balancing
Many departments inside an organization tend to access the
processing capabilities of the warehouse which eventually reduces the
performance of the system and decreases the efficiency as the stress
on the system increases.
 Access control and security are some techniques which can be used
to maintain a balance between the utilization and performance of
warehouse systems.
Decision support systems (DSS)
Decision support systems (DSS) are a specific class of computerized
information system that supports business and organizational
decision-making activities.
DSS is a well integrated ,user friendly, computer based tools that
combine data with various decision making models to solve semi
structure and unstructured problems.
Characteristics
Provide decision support for several interdependent decision.
Assist the decision maker to make decision under dynamic business
conditions.
Supports a wide variety of decision making processes and style.
How a DSS works???
22
Database management system
Model management system
Support tools
Components of DSS
23
In database management ,the problem necessary to solve may come
from internal and external database.
Within the organization, internal data are generated by systems such
as TPS and MIS; external data come from variety of sources such as
periodicals, databases, newspapers and online data services.
Database Management
24
It stores and access models that managers use to make decisions.
Models are integral part of most decision making and are used for
many tasks, such as designing a manufacturing facility, analysing the
financial health of an organization, forecasting demand for a product
or service, and determining the quality of a particular batch of
products.
Model Management Component
25
It consist of tools such as pull down menus, on-line help, users
interface, graphical analysis and error-correction mechanisms all of
which facilitate users interactions with the system.
Interfaces are an important support tools. This is because middle and
top managers have neither the time nor the inclination to learn
difficult and complicated procedures in order to run a system. For
better the interface, the greater the chances that users will accept the
system.
Support Tools
26
Cost saving
Improve managerial effectiveness
Flexible and adaptive
Improve the effectiveness of the decision
Reduces the time and efforts in collecting and analysis of data for different
sources, a large no of alternatives can be evaluated.
Advantages of DSS
27
It is also termed as Executive Support System[ESS].
It is a specialized decision support system used to assist senior
executives in the decision-making process.
It includes various hardware, software, data, procedures and the
people.
It is very user friendly in the nature.
Executive Information
System[EIS]
28
29
1. Informational characteristics
2. User interface/orientation characteristics
3. Managerial / executive characteristics
Characteristics of Executive
Information System
30
i. Flexibility and ease of use.
i. Provides the timely information with the short response time and also
with the quick retrieval.
i. Produces the correct information.
i. Produces the relevant information.
ii. Produces the validated information.
1. Informational characteristics
31
• Consists of the sophisticated self help.
• Contains the user friendly interfaces consisting of the graphic user.
• Can be used from many places.
2. User interface/orientation
characteristics
32
• Offers secure reliable, confidential access along with the access
procedure.
• Is very much customized. Suites the management style of the
individual executives.
i. Supports the over all vision, mission and the strategy.
ii. Provides the support for the strategic management.
iii. Sometimes helps to deal with the situations that have a high degree of risk.
iv. Is linked to the value added business processes.
v. Supports the access to database.
vi. Is very much result oriented in the nature.
3. Managerial / executive
characteristics
34
• Achievement of the various organizational objectives.
Facilitates access to the information by integrating many sources of the
data.
• Facilitates broad, aggregated perspective and the context.
• Offers broad highly aggregated information.
• User’s productivity is also improved to a large extent.
• Communication capability and the quality are increased.
Advantages of EIS
35
• Internal factors- accurate & reliable information, improve
communications, use of historical data
• External factors- increasing global competition, changing the
business environment, government regulations.
Factors affecting EIS
DSS EIS
• Used by professionals
• Required for day to day operations
• Deals both with semi & unstructured
data
• Consists only of internal information
• Used by executives
• Required for strategic plans and procedures
• Deals only with unstructured data (which
cannot be described in detail)
• Consists of both internal & external
information
37
Differences between DSS and
EIS

Weitere ähnliche Inhalte

Was ist angesagt? (20)

Business intelligence and data warehouses
Business intelligence and data warehousesBusiness intelligence and data warehouses
Business intelligence and data warehouses
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data Warehouse
 
Deductive databases
Deductive databasesDeductive databases
Deductive databases
 
Oltp vs olap
Oltp vs olapOltp vs olap
Oltp vs olap
 
Data mining slides
Data mining slidesData mining slides
Data mining slides
 
Data Mining
Data MiningData Mining
Data Mining
 
Data mining
Data mining Data mining
Data mining
 
multi dimensional data model
multi dimensional data modelmulti dimensional data model
multi dimensional data model
 
Data Mining: Association Rules Basics
Data Mining: Association Rules BasicsData Mining: Association Rules Basics
Data Mining: Association Rules Basics
 
OLAP operations
OLAP operationsOLAP operations
OLAP operations
 
Big data Analytics
Big data AnalyticsBig data Analytics
Big data Analytics
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecture
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Distributed database
Distributed databaseDistributed database
Distributed database
 
Relational database
Relational database Relational database
Relational database
 
Data warehouse
Data warehouse Data warehouse
Data warehouse
 
Data Warehousing
Data WarehousingData Warehousing
Data Warehousing
 
Data mining concepts and work
Data mining concepts and workData mining concepts and work
Data mining concepts and work
 
Distributed database management system
Distributed database management  systemDistributed database management  system
Distributed database management system
 
Data analytics
Data analyticsData analytics
Data analytics
 

Andere mochten auch

Introduction to the Computer-Based Information System
Introduction to the Computer-Based Information SystemIntroduction to the Computer-Based Information System
Introduction to the Computer-Based Information SystemFathur Rohman
 
E-Learning and Distance Education in Higher Education: Organizational Implica...
E-Learning and Distance Education in Higher Education: Organizational Implica...E-Learning and Distance Education in Higher Education: Organizational Implica...
E-Learning and Distance Education in Higher Education: Organizational Implica...Mark Bullen
 
Data, Information And Knowledge Management Framework And The Data Management ...
Data, Information And Knowledge Management Framework And The Data Management ...Data, Information And Knowledge Management Framework And The Data Management ...
Data, Information And Knowledge Management Framework And The Data Management ...Alan McSweeney
 
The Role of Management at Possible
The Role of Management at PossibleThe Role of Management at Possible
The Role of Management at PossiblePossible
 
computer based information system
computer based information systemcomputer based information system
computer based information systemjandian
 
Introduction to Brand Management
Introduction to Brand ManagementIntroduction to Brand Management
Introduction to Brand ManagementSunny Bose
 
08-Management Information System
08-Management Information System08-Management Information System
08-Management Information SystemWahyu Wijanarko
 
Security & ethical challenges
Security & ethical challengesSecurity & ethical challenges
Security & ethical challengesLouie Medinaceli
 
Computer based information system
Computer based information systemComputer based information system
Computer based information systemshoaibzaheer1
 
E-Learning and the Future of Distance Education
E-Learning and the Future of Distance EducationE-Learning and the Future of Distance Education
E-Learning and the Future of Distance EducationMark Bullen
 
Functions of management
Functions of managementFunctions of management
Functions of managementAltyeb Sayf
 
Functions Of Management
Functions Of ManagementFunctions Of Management
Functions Of ManagementZohra Rubab
 
Functions of management
Functions of managementFunctions of management
Functions of managementprathapsaran
 
Management Information System (Full Notes)
Management Information System (Full Notes)Management Information System (Full Notes)
Management Information System (Full Notes)Harish Chand
 
Management information system
Management information systemManagement information system
Management information systemSikander Saini
 

Andere mochten auch (20)

Introduction to the Computer-Based Information System
Introduction to the Computer-Based Information SystemIntroduction to the Computer-Based Information System
Introduction to the Computer-Based Information System
 
E-Learning and Distance Education in Higher Education: Organizational Implica...
E-Learning and Distance Education in Higher Education: Organizational Implica...E-Learning and Distance Education in Higher Education: Organizational Implica...
E-Learning and Distance Education in Higher Education: Organizational Implica...
 
Data, Information And Knowledge Management Framework And The Data Management ...
Data, Information And Knowledge Management Framework And The Data Management ...Data, Information And Knowledge Management Framework And The Data Management ...
Data, Information And Knowledge Management Framework And The Data Management ...
 
Chapter 02
Chapter 02Chapter 02
Chapter 02
 
Telecommuting
TelecommutingTelecommuting
Telecommuting
 
The Role of Management at Possible
The Role of Management at PossibleThe Role of Management at Possible
The Role of Management at Possible
 
computer based information system
computer based information systemcomputer based information system
computer based information system
 
Introduction to Brand Management
Introduction to Brand ManagementIntroduction to Brand Management
Introduction to Brand Management
 
08-Management Information System
08-Management Information System08-Management Information System
08-Management Information System
 
Security & ethical challenges
Security & ethical challengesSecurity & ethical challenges
Security & ethical challenges
 
Computer based information system
Computer based information systemComputer based information system
Computer based information system
 
E-Learning and the Future of Distance Education
E-Learning and the Future of Distance EducationE-Learning and the Future of Distance Education
E-Learning and the Future of Distance Education
 
Functions of management
Functions of managementFunctions of management
Functions of management
 
Types-of-computer-ppt
Types-of-computer-pptTypes-of-computer-ppt
Types-of-computer-ppt
 
Functions Of Management
Functions Of ManagementFunctions Of Management
Functions Of Management
 
Networking
NetworkingNetworking
Networking
 
Functions of management
Functions of managementFunctions of management
Functions of management
 
Chap 1 introduction to management
Chap 1 introduction to managementChap 1 introduction to management
Chap 1 introduction to management
 
Management Information System (Full Notes)
Management Information System (Full Notes)Management Information System (Full Notes)
Management Information System (Full Notes)
 
Management information system
Management information systemManagement information system
Management information system
 

Ähnlich wie Data mining & data warehousing (ppt)

ETL processes , Datawarehouse and Datamarts.pptx
ETL processes , Datawarehouse and Datamarts.pptxETL processes , Datawarehouse and Datamarts.pptx
ETL processes , Datawarehouse and Datamarts.pptxParnalSatle
 
Cognos datawarehouse
Cognos datawarehouseCognos datawarehouse
Cognos datawarehousessuser7fc7eb
 
Introduction to business intelligence
Introduction to business intelligenceIntroduction to business intelligence
Introduction to business intelligenceThilinaWanshathilaka
 
Data Mining & Data Warehousing
Data Mining & Data WarehousingData Mining & Data Warehousing
Data Mining & Data WarehousingAAKANKSHA JAIN
 
Manish tripathi-ea-dw-bi
Manish tripathi-ea-dw-biManish tripathi-ea-dw-bi
Manish tripathi-ea-dw-biA P
 
DWDM Unit 1 (1).pptx
DWDM Unit 1 (1).pptxDWDM Unit 1 (1).pptx
DWDM Unit 1 (1).pptxSalehaMariyam
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data WarehouseSOMASUNDARAM T
 
Data warehouse
Data warehouseData warehouse
Data warehouseRajThakuri
 
presentationofism-complete-1-100227093028-phpapp01.pptx
presentationofism-complete-1-100227093028-phpapp01.pptxpresentationofism-complete-1-100227093028-phpapp01.pptx
presentationofism-complete-1-100227093028-phpapp01.pptxvipush1
 
Business Intelligence Data Warehouse System
Business Intelligence Data Warehouse SystemBusiness Intelligence Data Warehouse System
Business Intelligence Data Warehouse SystemKiran kumar
 
data warehousing
data warehousingdata warehousing
data warehousing143sohil
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousingsumit621
 
Data warehousing.pptx
Data warehousing.pptxData warehousing.pptx
Data warehousing.pptxAnusuya123
 
20IT501_DWDM_PPT_Unit_I.ppt
20IT501_DWDM_PPT_Unit_I.ppt20IT501_DWDM_PPT_Unit_I.ppt
20IT501_DWDM_PPT_Unit_I.pptSumathiG8
 

Ähnlich wie Data mining & data warehousing (ppt) (20)

ETL processes , Datawarehouse and Datamarts.pptx
ETL processes , Datawarehouse and Datamarts.pptxETL processes , Datawarehouse and Datamarts.pptx
ETL processes , Datawarehouse and Datamarts.pptx
 
Data Mining
Data MiningData Mining
Data Mining
 
Cognos datawarehouse
Cognos datawarehouseCognos datawarehouse
Cognos datawarehouse
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
Introduction to business intelligence
Introduction to business intelligenceIntroduction to business intelligence
Introduction to business intelligence
 
Data Mining & Data Warehousing
Data Mining & Data WarehousingData Mining & Data Warehousing
Data Mining & Data Warehousing
 
Manish tripathi-ea-dw-bi
Manish tripathi-ea-dw-biManish tripathi-ea-dw-bi
Manish tripathi-ea-dw-bi
 
Unit 1
Unit 1Unit 1
Unit 1
 
DATA WAREHOUSING.2.pptx
DATA WAREHOUSING.2.pptxDATA WAREHOUSING.2.pptx
DATA WAREHOUSING.2.pptx
 
DWDM Unit 1 (1).pptx
DWDM Unit 1 (1).pptxDWDM Unit 1 (1).pptx
DWDM Unit 1 (1).pptx
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data Warehouse
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
presentationofism-complete-1-100227093028-phpapp01.pptx
presentationofism-complete-1-100227093028-phpapp01.pptxpresentationofism-complete-1-100227093028-phpapp01.pptx
presentationofism-complete-1-100227093028-phpapp01.pptx
 
Bi
BiBi
Bi
 
Business Intelligence Data Warehouse System
Business Intelligence Data Warehouse SystemBusiness Intelligence Data Warehouse System
Business Intelligence Data Warehouse System
 
MIS
MISMIS
MIS
 
data warehousing
data warehousingdata warehousing
data warehousing
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 
Data warehousing.pptx
Data warehousing.pptxData warehousing.pptx
Data warehousing.pptx
 
20IT501_DWDM_PPT_Unit_I.ppt
20IT501_DWDM_PPT_Unit_I.ppt20IT501_DWDM_PPT_Unit_I.ppt
20IT501_DWDM_PPT_Unit_I.ppt
 

Kürzlich hochgeladen

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 

Kürzlich hochgeladen (20)

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 

Data mining & data warehousing (ppt)

  • 1. Presented by: Harish Chand Data Mining and Data Warehousing
  • 2. Data Mining • Data mining, the extraction of hidden predictive information from large databases, • The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use • Often Not to be confused with analytics, information extraction, or data analysis • But its goal is the extraction of patterns and knowledge from large amount of data, not the extraction of data itself
  • 3. DATA MINIING TECHIQUES • It extract interesting patterns such as groups of data records(cluster analysis ), unusual records(anamoly detection) • These patterns can then be seen as a kind of summary of the input data, and may be used in further analysis or, for example, in machine learning and predictive analytics • For example, the data mining step might identify multiple groups in the data, which can then be used to obtain more accurate prediction results by a decision support system
  • 4. Data Warehousing • also known as an enterprise data warehouse (EDW), is a system used for reporting and data analysis. DWs are central repositories of integrated data from one or more disparate sources • Storing company data in a secondary location which is typically away from production systems. • Location= where information can be proactively reported and queried against • Goal =creation of single logical view of data that may reside in many different physical database
  • 5. • it exists to help users understand and enhance their organization's performance • It is designed for query and analysis rather than for transaction processing, • and usually contains historical data derived from transaction data, but can include data from other sources
  • 6. TYPES OF DATA WAREHOUSING • Data Mart • Online Transaction Processing • Online Analytical Processing.
  • 7. Data Mart • A data mart is a simple form of a data warehouse that is focused on functional area such as sales, finance or marketing . • Data marts are often built and controlled by a single department within an organization. • Given their single-subject focus, data marts usually draw data from only a few sources. • The sources could be internal operational systems, a central data warehouse, or external data.
  • 8. Online Analytical Processing(OLAP) • Is characterized by a relatively low volume of transactions. Queries are often very complex and involve aggregations. • For OLAP systems, response time is an effectiveness measure. • OLAP databases store aggregated, historical data in multi-dimensional schemas (usually star schemas). • OLAP systems typically have data latency of a few hours, as opposed to data marts, where latency is expected to be closer to one day.
  • 9. Online Transaction Processing(OTLP) • Is characterized by a large number of short on-line transactions (INSERT, UPDATE, DELETE). • OLTP systems emphasize very fast query processing and maintaining data integrity in multi-access environments. • For OLTP systems, effectiveness is measured by the number of transactions per second.
  • 10. • OLTP databases contain detailed and current data. • The schema used to store transactional databases is the entity model (usually 3NF).
  • 11. Data mining VS Data warehousing Data warehouse Data mining Process of storing data in order in given dataset Process of finding pattern in given dataset. Data warehousing is the process of extracting and storing data to allow easier reporting. Data mining is the use of pattern recognition logic to identity trends within a sample data set and extrapolate this information against the larger data pool The tools in data warehousing are designed to extract data and store it in a method designed to provide enhanced system performance A typical use of data mining is to create targeted marketing programs, identify financial fraud, Helps in identifing the certain data in a collection of data Helps in figuring out a certain pattern of a data or a cluster of data
  • 12. Benefits of Data Warehouse • A Data Warehouse Delivers Enhanced Business Intelligence By providing data from various sources, managers and executives will no longer need to make business decisions based on limited data • A Data Warehouse Saves Time Since business users can quickly access critical data from a number of sources—all in one place—they can rapidly make informed decisions on key initiatives
  • 13. Benefits of Data Warehouse • A Data Warehouse Enhances Data Quality and Consistency Individual business units and departments including sales, marketing, finance, and operations, will start to utilize the same data repository as the source system for their individual queries and reports. Thus each of these individual business units and departments will produce results that are consistent with the other business units within the organization.
  • 14. Benefits of Data Warehousing • A Data Warehouse Provides Historical Intelligence  A data warehouse stores large amounts of historical data so we can analyze different time periods and trends in order to make future predictions  can enable advanced business intelligence including time-period analysis, trend analysis, and trend prediction.
  • 15. Benefits of Data Warehousing • A Data Warehouse Generates a High ROI  Return on Investment(ROI)  Past references shows that companies that have implemented data warehouses have generated more revenue and saved more money than companies that haven’t invested on data warehouses.
  • 16. Challenges faced on Data Warehousing • User expectation  end-user demands and expects more accurate and refined results in return of processing,  however the performance decreases with exploding data and so the efficiency of the system reduces. • Systems optimization  business intelligence tools require frequent maintenance and fine tuning of whole system in order to meet users' expectations.
  • 17. Challenges faced on Data Warehousing • Data structuring Proper processing of data requires structuring it in a desired format so that further operations can be executed As the volume of data increases the task of structuring the unstructured data add-on, slowing down the processing capabilities of system and eventually becomes hectic for the system manager to qualify the data for analytic purpose.
  • 18. Challenges faced on Data Warehousing • Prefabricated vs. Custom warehouse The varieties of warehouses available in market create ambiguity about which type to choose or go for. Custom warehouse saves the time of building the warehouses from various operational Prefabricated warehouses saves the time of initial configuration and installation.
  • 19. Challenges faced on Data Warehousing • Resource Balancing Many departments inside an organization tend to access the processing capabilities of the warehouse which eventually reduces the performance of the system and decreases the efficiency as the stress on the system increases.  Access control and security are some techniques which can be used to maintain a balance between the utilization and performance of warehouse systems.
  • 20. Decision support systems (DSS) Decision support systems (DSS) are a specific class of computerized information system that supports business and organizational decision-making activities. DSS is a well integrated ,user friendly, computer based tools that combine data with various decision making models to solve semi structure and unstructured problems.
  • 21. Characteristics Provide decision support for several interdependent decision. Assist the decision maker to make decision under dynamic business conditions. Supports a wide variety of decision making processes and style.
  • 22. How a DSS works??? 22
  • 23. Database management system Model management system Support tools Components of DSS 23
  • 24. In database management ,the problem necessary to solve may come from internal and external database. Within the organization, internal data are generated by systems such as TPS and MIS; external data come from variety of sources such as periodicals, databases, newspapers and online data services. Database Management 24
  • 25. It stores and access models that managers use to make decisions. Models are integral part of most decision making and are used for many tasks, such as designing a manufacturing facility, analysing the financial health of an organization, forecasting demand for a product or service, and determining the quality of a particular batch of products. Model Management Component 25
  • 26. It consist of tools such as pull down menus, on-line help, users interface, graphical analysis and error-correction mechanisms all of which facilitate users interactions with the system. Interfaces are an important support tools. This is because middle and top managers have neither the time nor the inclination to learn difficult and complicated procedures in order to run a system. For better the interface, the greater the chances that users will accept the system. Support Tools 26
  • 27. Cost saving Improve managerial effectiveness Flexible and adaptive Improve the effectiveness of the decision Reduces the time and efforts in collecting and analysis of data for different sources, a large no of alternatives can be evaluated. Advantages of DSS 27
  • 28. It is also termed as Executive Support System[ESS]. It is a specialized decision support system used to assist senior executives in the decision-making process. It includes various hardware, software, data, procedures and the people. It is very user friendly in the nature. Executive Information System[EIS] 28
  • 29. 29
  • 30. 1. Informational characteristics 2. User interface/orientation characteristics 3. Managerial / executive characteristics Characteristics of Executive Information System 30
  • 31. i. Flexibility and ease of use. i. Provides the timely information with the short response time and also with the quick retrieval. i. Produces the correct information. i. Produces the relevant information. ii. Produces the validated information. 1. Informational characteristics 31
  • 32. • Consists of the sophisticated self help. • Contains the user friendly interfaces consisting of the graphic user. • Can be used from many places. 2. User interface/orientation characteristics 32
  • 33. • Offers secure reliable, confidential access along with the access procedure. • Is very much customized. Suites the management style of the individual executives.
  • 34. i. Supports the over all vision, mission and the strategy. ii. Provides the support for the strategic management. iii. Sometimes helps to deal with the situations that have a high degree of risk. iv. Is linked to the value added business processes. v. Supports the access to database. vi. Is very much result oriented in the nature. 3. Managerial / executive characteristics 34
  • 35. • Achievement of the various organizational objectives. Facilitates access to the information by integrating many sources of the data. • Facilitates broad, aggregated perspective and the context. • Offers broad highly aggregated information. • User’s productivity is also improved to a large extent. • Communication capability and the quality are increased. Advantages of EIS 35
  • 36. • Internal factors- accurate & reliable information, improve communications, use of historical data • External factors- increasing global competition, changing the business environment, government regulations. Factors affecting EIS
  • 37. DSS EIS • Used by professionals • Required for day to day operations • Deals both with semi & unstructured data • Consists only of internal information • Used by executives • Required for strategic plans and procedures • Deals only with unstructured data (which cannot be described in detail) • Consists of both internal & external information 37 Differences between DSS and EIS