SlideShare ist ein Scribd-Unternehmen logo
1 von 22
Microsoft Naive Bayes Algorithm
overview Naive Bayes Algorithm DMX Queries Exploring a Naive Bayes Model Naive Bayes Principles Naive Bayes Parameters
Naive Bayes Algorithm The Microsoft Naive Bayes algorithm is a classification algorithm provided by Microsoft SQL Server Analysis Services for use in predictive modeling.  The name Naive Bayes derives from the fact that the algorithm uses Bayes theorem but does not take into account dependencies that may exist, and therefore its assumptions are said to be naive.
How to use the Naive Bayes algorithm in SQL server? This algorithm is less computationally intense than other Microsoft algorithms It is therefore is useful for quickly generating mining models to discover relationships between input columns and predictable columns.  The algorithm considers each pair of input attribute values and output attribute values. Exploring a Naive Bayes model will tell you how your attributes are related to each other.
DMX  When you create a query against a data mining model you can create either a content query, which provides details about the patterns discovered in analysis, or you can create a prediction query, which uses the patterns in the model to make predictions for new data  You can also retrieve metadata about the model by using a query against the data mining schema rowset.
DMX Queries SELECT MODEL_CATALOG, MODEL_NAME, DATE_CREATED, LAST_PROCESSED, SERVICE_NAME, PREDICTION_ENTITY, FILTER  FROM  $system.DMSCHEMA_MINING_MODELS WHERE MODEL_NAME = 'TM_NaiveBayes_Filtered‘ Getting Model Metadata by Using DMX you can find metadata for the model, by querying the data mining schema rowset. This might include when the model was created, when the model was last processed, the name of the mining structure that the model is based on, and the name of the columns used as the predictable attribute.
DMX Queries Retrieving a Summary of Training Data Query to retrieve the data from the node specified. Because the statistics are stored in a nested table, the FLATTENED keyword is used to make the results easier to view.      SELECT FLATTENED MODEL_NAME,  (SELECT ATTRIBUTE_NAME,  ATTRIBUTE_VALUE, [SUPPORT],  [PROBABILITY], VALUETYPE  FROM  NODE_DISTRIBUTION) AS t FROM  TM_NaiveBayes.CONTENT WHERE  NODE_TYPE = 26
DMX Queries Finding More Information about Attributes Example to show how to return information from the model about a particular attribute( here ”Region”)  The Result of this query is shown in the next slide. SELECT NODE_TYPE, NODE_CAPTION, MSOLAP_NODE_SCORE FROM TM_NaiveBayes.CONTENT WHERE ATTRIBUTE_NAME = 'Region'
DMX Queries Sample Resultto showing information from the model about a particular  attribute  ”Region”
DMX Queries SELECT NODE_CAPTION, MSOLAP_NODE_SCORE  FROM  TM_NaiveBayes.CONTENT WHERE NODE_TYPE = 10 ORDER BY MSOLAP_NODE_SCORE DESC Query returns the importance scores of all attributes in the Model.  The Result of this query is shown in the next slide.
DMX Queries query returns the importance scores of all attributes in the Model.
Exploring a Naive Bayes Model The convenient way to start analyzing a new data set is to create a Naive Bayes model and mark all the non-key columns as both input and predictive. The content of each model is presented as a series of nodes.  A node is an object within a mining model that contains metadata and information about a portion of the model.  Nodes are arranged in a hierarchy. 
Naive Bayes Model Content
Exploring a Naive Bayes Model The Naive Bayes viewer is accessed through either the BI Development Studio or SQL Management Studio by right-clicking on the model and selecting Browse. SQL Server Data Mining provides four different views on Naive Bayes models : ,[object Object],Provides a quick display of how all of the attributes in your model are related.  Each node in the graph represents an attribute, whereas each edge represents a relationship.  outgoing edge (it is predictive of the attribute in the node at the end of the edge) Incoming edge( it is predicted by the other node)
Exploring a Naive Bayes Model ,[object Object],provides you with an exhaustive report of how each input attribute corresponds to each output attribute, one attribute at a time.  At the top of the Attribute Profiles view, you select which output you want to look at, and the rest of the view shows how all of the input attributes are correlated to the states of the selected output attribute.
Exploring a Naive Bayes Model ,[object Object],This tab allows you to select an output attribute and value and shows you a description of the cases where that attribute and value occur. ,[object Object], Provides the answers to the most interesting question:  What is the difference between X and Y?  With this viewer, you choose the attribute you are interested in, and select the states you want to compare.
Naive Bayes Principles Bayes mathematical methods use a combination of conditional and unconditional probabilities. The Naive part of Naive Bayes tells you to treat all of your input attributes as independent of each other with respect to the target variable.  This may be a faulty assumption, but it allows you to multiply your probabilities to determine the likelihood of each state.
Naive Bayes Principles The Bayes rule states that if you have a hypothesis Hand evidence about that hypothesis E, then the probability of H is calculated using the following formula: P(H | E) =   P(E | H) × P(H)                                    P(E) This simply states that the probability of your hypothesis given the evidence is equal to the probability of the evidence given the hypothesis multiplied by the probability of the hypothesis, and then normalized.
Naive Bayes Parameters MAXIMUM _INPUT _ATTRIBUTES determines the number of attributes that will be considered as inputs for training.  If there is more than this number of inputs, the algorithm will select the most important inputs and ignore the rest.  Setting this parameter to 0 causes the algorithm to consider all attributes. The default value is 255. MAXIMUM _OUTPUT _ATTRIBUTES determines the number of attributes that will be considered as outputs for training.  If there is more than this number of outputs, the algorithm will select the most important outputs and ignore the rest.  Setting this parameter to 0 causes the algorithm to consider all attributes. The default value is 255.
Naive Bayes Parameters MAXIMUM _STATES controls how many states of an attribute are considered. If an attribute has more than this number of states, only the most popular states will be used.  States that are not selected will be considered to be missing data. This parameter is useful when an attribute has a high cardinality
Summary Naive Bayes Algorithm DMX Queries Naive Bayes Model Content Exploring a Naive Bayes Model Naive Bayes Principles Naive Bayes Parameters
Visit more self help tutorials Pick a tutorial of your choice and browse through it at your own pace. The tutorials section is free, self-guiding and will not involve any additional support. Visit us at www.dataminingtools.net

Weitere ähnliche Inhalte

Was ist angesagt?

Data mining techniques using weka
Data mining techniques using wekaData mining techniques using weka
Data mining techniques using weka
rathorenitin87
 
Data Mining with WEKA WEKA
Data Mining with WEKA WEKAData Mining with WEKA WEKA
Data Mining with WEKA WEKA
butest
 
Ap Power Point Chpt9
Ap Power Point Chpt9Ap Power Point Chpt9
Ap Power Point Chpt9
dplunkett
 
Machine Learning with WEKA
Machine Learning with WEKAMachine Learning with WEKA
Machine Learning with WEKA
butest
 
WEKA Tutorial
WEKA TutorialWEKA Tutorial
WEKA Tutorial
butest
 

Was ist angesagt? (20)

Visualizing Model Selection with Scikit-Yellowbrick: An Introduction to Devel...
Visualizing Model Selection with Scikit-Yellowbrick: An Introduction to Devel...Visualizing Model Selection with Scikit-Yellowbrick: An Introduction to Devel...
Visualizing Model Selection with Scikit-Yellowbrick: An Introduction to Devel...
 
Generating test data for Statistical and ML models
Generating test data for Statistical and ML modelsGenerating test data for Statistical and ML models
Generating test data for Statistical and ML models
 
Data mining techniques using weka
Data mining techniques using wekaData mining techniques using weka
Data mining techniques using weka
 
weka data mining
weka data mining weka data mining
weka data mining
 
Handling SQL Server Null Values
Handling SQL Server Null ValuesHandling SQL Server Null Values
Handling SQL Server Null Values
 
Data Mining with WEKA WEKA
Data Mining with WEKA WEKAData Mining with WEKA WEKA
Data Mining with WEKA WEKA
 
Data mining Part 1
Data mining Part 1Data mining Part 1
Data mining Part 1
 
Weka
Weka Weka
Weka
 
data mining with weka application
data mining with weka applicationdata mining with weka application
data mining with weka application
 
An Introduction To Weka
An Introduction To WekaAn Introduction To Weka
An Introduction To Weka
 
Ap Power Point Chpt9
Ap Power Point Chpt9Ap Power Point Chpt9
Ap Power Point Chpt9
 
Data mining with weka
Data mining with wekaData mining with weka
Data mining with weka
 
Data Mining Techniques using WEKA (Ankit Pandey-10BM60012)
Data Mining Techniques using WEKA (Ankit Pandey-10BM60012)Data Mining Techniques using WEKA (Ankit Pandey-10BM60012)
Data Mining Techniques using WEKA (Ankit Pandey-10BM60012)
 
Machine Learning
Machine LearningMachine Learning
Machine Learning
 
Machine Learning with WEKA
Machine Learning with WEKAMachine Learning with WEKA
Machine Learning with WEKA
 
Interaction Modeling
Interaction ModelingInteraction Modeling
Interaction Modeling
 
somhelpdoc
somhelpdocsomhelpdoc
somhelpdoc
 
Some Basic Concepts of Object Oriented Methodology
Some Basic Concepts of Object Oriented MethodologySome Basic Concepts of Object Oriented Methodology
Some Basic Concepts of Object Oriented Methodology
 
XL-MINER:Partition
XL-MINER:PartitionXL-MINER:Partition
XL-MINER:Partition
 
WEKA Tutorial
WEKA TutorialWEKA Tutorial
WEKA Tutorial
 

Andere mochten auch

Marketing VezetőI Wj
Marketing VezetőI WjMarketing VezetőI Wj
Marketing VezetőI Wj
NoniJano
 
Turban and its importance in sikhism
Turban and its importance in sikhismTurban and its importance in sikhism
Turban and its importance in sikhism
ramanpreet
 

Andere mochten auch (14)

Social media in Ministry seminar (NST 2010)
Social media in Ministry seminar (NST 2010)Social media in Ministry seminar (NST 2010)
Social media in Ministry seminar (NST 2010)
 
Big break Social Media Seminar
Big break Social Media SeminarBig break Social Media Seminar
Big break Social Media Seminar
 
MS SQL SERVER: Microsoft time series algorithm
MS SQL SERVER: Microsoft time series algorithmMS SQL SERVER: Microsoft time series algorithm
MS SQL SERVER: Microsoft time series algorithm
 
FastTrac Final Presentations
FastTrac Final PresentationsFastTrac Final Presentations
FastTrac Final Presentations
 
The doula advantage _Spanish version
The doula advantage _Spanish versionThe doula advantage _Spanish version
The doula advantage _Spanish version
 
Newton March Report
Newton March ReportNewton March Report
Newton March Report
 
Marketing VezetőI Wj
Marketing VezetőI WjMarketing VezetőI Wj
Marketing VezetőI Wj
 
Inforum e befogadas_eu
Inforum e befogadas_euInforum e befogadas_eu
Inforum e befogadas_eu
 
Cuadro
CuadroCuadro
Cuadro
 
Turban and its importance in sikhism
Turban and its importance in sikhismTurban and its importance in sikhism
Turban and its importance in sikhism
 
Daru Toll 2014. június
Daru Toll 2014. júniusDaru Toll 2014. június
Daru Toll 2014. június
 
Designnet > 09/11
Designnet > 09/11Designnet > 09/11
Designnet > 09/11
 
2005 vuestros cabellos estan todos contados
2005 vuestros cabellos estan todos contados2005 vuestros cabellos estan todos contados
2005 vuestros cabellos estan todos contados
 
Tynki QUICK-MIX - FOLDER PRODUKTÓW
Tynki QUICK-MIX - FOLDER PRODUKTÓWTynki QUICK-MIX - FOLDER PRODUKTÓW
Tynki QUICK-MIX - FOLDER PRODUKTÓW
 

Ähnlich wie MS SQL SERVER: Microsoft naive bayes algorithm

Task A. [20 marks] Data Choice. Name the chosen data set(s) .docx
Task A. [20 marks] Data Choice. Name the chosen data set(s) .docxTask A. [20 marks] Data Choice. Name the chosen data set(s) .docx
Task A. [20 marks] Data Choice. Name the chosen data set(s) .docx
josies1
 
CPSC 50900 Database Systems ProjectAll your efforts this semeste
CPSC 50900 Database Systems ProjectAll your efforts this semesteCPSC 50900 Database Systems ProjectAll your efforts this semeste
CPSC 50900 Database Systems ProjectAll your efforts this semeste
CruzIbarra161
 
Obiee interview questions and answers faq
Obiee interview questions and answers faqObiee interview questions and answers faq
Obiee interview questions and answers faq
maheshboggula
 

Ähnlich wie MS SQL SERVER: Microsoft naive bayes algorithm (20)

MS SQL SERVER: Microsoft sequence clustering and association rules
MS SQL SERVER: Microsoft sequence clustering and association rulesMS SQL SERVER: Microsoft sequence clustering and association rules
MS SQL SERVER: Microsoft sequence clustering and association rules
 
MS SQL SERVER: Microsoft sequence clustering and association rules
MS SQL SERVER: Microsoft sequence clustering and association rulesMS SQL SERVER: Microsoft sequence clustering and association rules
MS SQL SERVER: Microsoft sequence clustering and association rules
 
Machine learning Algorithms
Machine learning AlgorithmsMachine learning Algorithms
Machine learning Algorithms
 
Introduction to Machine Learning with SciKit-Learn
Introduction to Machine Learning with SciKit-LearnIntroduction to Machine Learning with SciKit-Learn
Introduction to Machine Learning with SciKit-Learn
 
Task A. [20 marks] Data Choice. Name the chosen data set(s) .docx
Task A. [20 marks] Data Choice. Name the chosen data set(s) .docxTask A. [20 marks] Data Choice. Name the chosen data set(s) .docx
Task A. [20 marks] Data Choice. Name the chosen data set(s) .docx
 
Excel Datamining Addin Beginner
Excel Datamining Addin BeginnerExcel Datamining Addin Beginner
Excel Datamining Addin Beginner
 
Machine Learning - Simple Linear Regression
Machine Learning - Simple Linear RegressionMachine Learning - Simple Linear Regression
Machine Learning - Simple Linear Regression
 
Predicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project PresentationPredicting Employee Churn: A Data-Driven Approach Project Presentation
Predicting Employee Churn: A Data-Driven Approach Project Presentation
 
Php and MySQL Web Development
Php and MySQL Web DevelopmentPhp and MySQL Web Development
Php and MySQL Web Development
 
8606BICA2.pptx
8606BICA2.pptx8606BICA2.pptx
8606BICA2.pptx
 
CPSC 50900 Database Systems ProjectAll your efforts this semeste
CPSC 50900 Database Systems ProjectAll your efforts this semesteCPSC 50900 Database Systems ProjectAll your efforts this semeste
CPSC 50900 Database Systems ProjectAll your efforts this semeste
 
Excel Datamining Addin Intermediate
Excel Datamining Addin IntermediateExcel Datamining Addin Intermediate
Excel Datamining Addin Intermediate
 
Predicting Employee Attrition
Predicting Employee AttritionPredicting Employee Attrition
Predicting Employee Attrition
 
Excel Datamining Addin Advanced
Excel Datamining Addin AdvancedExcel Datamining Addin Advanced
Excel Datamining Addin Advanced
 
Obiee interview questions and answers faq
Obiee interview questions and answers faqObiee interview questions and answers faq
Obiee interview questions and answers faq
 
VBA work.pdf
VBA work.pdfVBA work.pdf
VBA work.pdf
 
Analysis Services en SQL Server 2008
Analysis Services en SQL Server 2008Analysis Services en SQL Server 2008
Analysis Services en SQL Server 2008
 
Introduction to System verilog
Introduction to System verilog Introduction to System verilog
Introduction to System verilog
 
MS SQL SERVER: Decision trees algorithm
MS SQL SERVER: Decision trees algorithmMS SQL SERVER: Decision trees algorithm
MS SQL SERVER: Decision trees algorithm
 
MS SQL SERVER: Decision trees algorithm
MS SQL SERVER: Decision trees algorithmMS SQL SERVER: Decision trees algorithm
MS SQL SERVER: Decision trees algorithm
 

Mehr von sqlserver content

Mehr von sqlserver content (20)

MS SQL SERVER: Using the data mining tools
MS SQL SERVER: Using the data mining toolsMS SQL SERVER: Using the data mining tools
MS SQL SERVER: Using the data mining tools
 
MS SQL SERVER: SSIS and data mining
MS SQL SERVER: SSIS and data miningMS SQL SERVER: SSIS and data mining
MS SQL SERVER: SSIS and data mining
 
MS SQL SERVER: Programming sql server data mining
MS SQL SERVER:  Programming sql server data miningMS SQL SERVER:  Programming sql server data mining
MS SQL SERVER: Programming sql server data mining
 
MS SQL SERVER: Olap cubes and data mining
MS SQL SERVER:  Olap cubes and data miningMS SQL SERVER:  Olap cubes and data mining
MS SQL SERVER: Olap cubes and data mining
 
MS SQL SERVER: Neural network and logistic regression
MS SQL SERVER: Neural network and logistic regressionMS SQL SERVER: Neural network and logistic regression
MS SQL SERVER: Neural network and logistic regression
 
MS SQL Server: Data mining concepts and dmx
MS SQL Server: Data mining concepts and dmxMS SQL Server: Data mining concepts and dmx
MS SQL Server: Data mining concepts and dmx
 
MS Sql Server: Reporting models
MS Sql Server: Reporting modelsMS Sql Server: Reporting models
MS Sql Server: Reporting models
 
MS Sql Server: Reporting manipulating data
MS Sql Server: Reporting manipulating dataMS Sql Server: Reporting manipulating data
MS Sql Server: Reporting manipulating data
 
MS Sql Server: Reporting introduction
MS Sql Server: Reporting introductionMS Sql Server: Reporting introduction
MS Sql Server: Reporting introduction
 
MS Sql Server: Reporting basics
MS Sql  Server: Reporting basicsMS Sql  Server: Reporting basics
MS Sql Server: Reporting basics
 
MS Sql Server: Datamining Introduction
MS Sql Server: Datamining IntroductionMS Sql Server: Datamining Introduction
MS Sql Server: Datamining Introduction
 
MS Sql Server: Business Intelligence
MS Sql Server: Business IntelligenceMS Sql Server: Business Intelligence
MS Sql Server: Business Intelligence
 
MS SQLSERVER:Feeding Data Into Database
MS SQLSERVER:Feeding Data Into DatabaseMS SQLSERVER:Feeding Data Into Database
MS SQLSERVER:Feeding Data Into Database
 
MS SQLSERVER:Doing Calculations With Functions
MS SQLSERVER:Doing Calculations With FunctionsMS SQLSERVER:Doing Calculations With Functions
MS SQLSERVER:Doing Calculations With Functions
 
MS SQLSERVER:Deleting A Database
MS SQLSERVER:Deleting A DatabaseMS SQLSERVER:Deleting A Database
MS SQLSERVER:Deleting A Database
 
MS SQLSERVER:Customizing Your D Base Design
MS SQLSERVER:Customizing Your D Base DesignMS SQLSERVER:Customizing Your D Base Design
MS SQLSERVER:Customizing Your D Base Design
 
MS SQLSERVER:Creating Views
MS SQLSERVER:Creating ViewsMS SQLSERVER:Creating Views
MS SQLSERVER:Creating Views
 
MS SQLSERVER:Creating A Database
MS SQLSERVER:Creating A DatabaseMS SQLSERVER:Creating A Database
MS SQLSERVER:Creating A Database
 
MS SQLSERVER:Advanced Query Concepts Copy
MS SQLSERVER:Advanced Query Concepts   CopyMS SQLSERVER:Advanced Query Concepts   Copy
MS SQLSERVER:Advanced Query Concepts Copy
 
MS SQLSERVER:Joining Databases
MS SQLSERVER:Joining DatabasesMS SQLSERVER:Joining Databases
MS SQLSERVER:Joining Databases
 

Kürzlich hochgeladen

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
Earley Information Science
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Kürzlich hochgeladen (20)

Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 

MS SQL SERVER: Microsoft naive bayes algorithm

  • 2. overview Naive Bayes Algorithm DMX Queries Exploring a Naive Bayes Model Naive Bayes Principles Naive Bayes Parameters
  • 3. Naive Bayes Algorithm The Microsoft Naive Bayes algorithm is a classification algorithm provided by Microsoft SQL Server Analysis Services for use in predictive modeling. The name Naive Bayes derives from the fact that the algorithm uses Bayes theorem but does not take into account dependencies that may exist, and therefore its assumptions are said to be naive.
  • 4. How to use the Naive Bayes algorithm in SQL server? This algorithm is less computationally intense than other Microsoft algorithms It is therefore is useful for quickly generating mining models to discover relationships between input columns and predictable columns. The algorithm considers each pair of input attribute values and output attribute values. Exploring a Naive Bayes model will tell you how your attributes are related to each other.
  • 5. DMX When you create a query against a data mining model you can create either a content query, which provides details about the patterns discovered in analysis, or you can create a prediction query, which uses the patterns in the model to make predictions for new data You can also retrieve metadata about the model by using a query against the data mining schema rowset.
  • 6. DMX Queries SELECT MODEL_CATALOG, MODEL_NAME, DATE_CREATED, LAST_PROCESSED, SERVICE_NAME, PREDICTION_ENTITY, FILTER FROM $system.DMSCHEMA_MINING_MODELS WHERE MODEL_NAME = 'TM_NaiveBayes_Filtered‘ Getting Model Metadata by Using DMX you can find metadata for the model, by querying the data mining schema rowset. This might include when the model was created, when the model was last processed, the name of the mining structure that the model is based on, and the name of the columns used as the predictable attribute.
  • 7. DMX Queries Retrieving a Summary of Training Data Query to retrieve the data from the node specified. Because the statistics are stored in a nested table, the FLATTENED keyword is used to make the results easier to view. SELECT FLATTENED MODEL_NAME, (SELECT ATTRIBUTE_NAME, ATTRIBUTE_VALUE, [SUPPORT], [PROBABILITY], VALUETYPE FROM NODE_DISTRIBUTION) AS t FROM TM_NaiveBayes.CONTENT WHERE NODE_TYPE = 26
  • 8. DMX Queries Finding More Information about Attributes Example to show how to return information from the model about a particular attribute( here ”Region”) The Result of this query is shown in the next slide. SELECT NODE_TYPE, NODE_CAPTION, MSOLAP_NODE_SCORE FROM TM_NaiveBayes.CONTENT WHERE ATTRIBUTE_NAME = 'Region'
  • 9. DMX Queries Sample Resultto showing information from the model about a particular attribute  ”Region”
  • 10. DMX Queries SELECT NODE_CAPTION, MSOLAP_NODE_SCORE FROM TM_NaiveBayes.CONTENT WHERE NODE_TYPE = 10 ORDER BY MSOLAP_NODE_SCORE DESC Query returns the importance scores of all attributes in the Model. The Result of this query is shown in the next slide.
  • 11. DMX Queries query returns the importance scores of all attributes in the Model.
  • 12. Exploring a Naive Bayes Model The convenient way to start analyzing a new data set is to create a Naive Bayes model and mark all the non-key columns as both input and predictive. The content of each model is presented as a series of nodes. A node is an object within a mining model that contains metadata and information about a portion of the model. Nodes are arranged in a hierarchy. 
  • 13. Naive Bayes Model Content
  • 14.
  • 15.
  • 16.
  • 17. Naive Bayes Principles Bayes mathematical methods use a combination of conditional and unconditional probabilities. The Naive part of Naive Bayes tells you to treat all of your input attributes as independent of each other with respect to the target variable. This may be a faulty assumption, but it allows you to multiply your probabilities to determine the likelihood of each state.
  • 18. Naive Bayes Principles The Bayes rule states that if you have a hypothesis Hand evidence about that hypothesis E, then the probability of H is calculated using the following formula: P(H | E) = P(E | H) × P(H) P(E) This simply states that the probability of your hypothesis given the evidence is equal to the probability of the evidence given the hypothesis multiplied by the probability of the hypothesis, and then normalized.
  • 19. Naive Bayes Parameters MAXIMUM _INPUT _ATTRIBUTES determines the number of attributes that will be considered as inputs for training. If there is more than this number of inputs, the algorithm will select the most important inputs and ignore the rest. Setting this parameter to 0 causes the algorithm to consider all attributes. The default value is 255. MAXIMUM _OUTPUT _ATTRIBUTES determines the number of attributes that will be considered as outputs for training. If there is more than this number of outputs, the algorithm will select the most important outputs and ignore the rest. Setting this parameter to 0 causes the algorithm to consider all attributes. The default value is 255.
  • 20. Naive Bayes Parameters MAXIMUM _STATES controls how many states of an attribute are considered. If an attribute has more than this number of states, only the most popular states will be used. States that are not selected will be considered to be missing data. This parameter is useful when an attribute has a high cardinality
  • 21. Summary Naive Bayes Algorithm DMX Queries Naive Bayes Model Content Exploring a Naive Bayes Model Naive Bayes Principles Naive Bayes Parameters
  • 22. Visit more self help tutorials Pick a tutorial of your choice and browse through it at your own pace. The tutorials section is free, self-guiding and will not involve any additional support. Visit us at www.dataminingtools.net