SlideShare ist ein Scribd-Unternehmen logo
1 von 18
Predictive Analysis with SQL Server 2008<br />White Paper<br />Published: November 2007<br />Updated: July 2008<br />Summary: Microsoft SQL Server 2008 offers predictive analysis through a complete and intuitive set of data mining tools. Seamless integration with the Microsoft Business Intelligence platform provides rich insight at every step of the data lifecycle. Furthermore, the flexible platform empowers you to extend prediction into any application.<br />For the latest information, see Microsoft SQL Server 2008.<br />Contents<br /> TOC  quot;
1-2quot;
 Introduction PAGEREF _Toc205277543  1<br />Predictive Analysis for All Users PAGEREF _Toc205277544  2<br />Pervasive Delivery through Microsoft Office PAGEREF _Toc205277545  2<br />Comprehensive Development Environment PAGEREF _Toc205277546  4<br />Insight at Every Step of the Data Lifecycle PAGEREF _Toc205277547  8<br />Native Reporting Integration PAGEREF _Toc205277548  8<br />In-Flight Data Mining During Data Integration PAGEREF _Toc205277549  10<br />Insightful Analysis PAGEREF _Toc205277550  12<br />Predictive KPIs PAGEREF _Toc205277551  13<br />Data Mining Awareness in Every Application PAGEREF _Toc205277552  14<br />Predictive Programming PAGEREF _Toc205277553  14<br />Plug-In Algorithms and Custom Visualizations PAGEREF _Toc205277554  14<br />Conclusion PAGEREF _Toc205277555  15<br />Introduction<br />One of the most valuable assets of any company is the large volume of business data in various applications and systems throughout the organization. This data has the potential to provide previously unimagined insights into the business and to form a reliable basis for effective decision-making and accurate forecasting that can drive a company forward to success. Unfortunately, all too often the data is collected by the various computer systems and left dormant in isolated data stores. Some organizations may generate historical reports from this data, and some may even measure the company’s performance against key performance indicators (KPIs); but surprisingly few organizations realize the benefits of mining their historical data to detect patterns and trends, and even fewer embed predictive analysis into their day-to-day business processes to make decisions and predictions and to improve the overall agility of the company.<br />Over the past few releases, Microsoft has refined the reporting and analytical capabilities in Microsoft® SQL Server® to create a comprehensive Business Intelligence (BI) platform that can be integrated into everyday business activity and used effectively by employees throughout the organization instead of only by a few specialized analysts. Many organizations that previously would have found BI solutions too expensive or complex to implement are now taking advantage of the comprehensive report authoring, rendering, and delivery capabilities of SQL Server Reporting Services and the powerful online analytical processing (OLAP) services provided by SQL Server Analysis Services. The close integration between these BI server products and the ubiquitous Microsoft Office system has brought business analysis to the masses and promoted the evolution of a new kind of information worker who can gain a deeper insight into the business and operate more effectively.<br />While this proliferation of reporting and multidimensional analytics has greatly benefited many organizations of all sizes, the next step in promoting business agility and operational efficiency is to make the leap from retrospective analysis of historical data to proactive actions based on predictive analysis of business data, and to embed intelligent, fact-based decision-making into business processes. The key to accomplishing this is to use powerful data mining algorithms to analyze data sets, compare new data to historical facts and behaviors, identify classifications and relationships between business entities and attributes, and to deliver accurate predictive insights to all of the systems and users who make business decisions. As with OLAP technologies, data mining was once considered a highly specialized field that required expensive software and rare expertise to implement. However, by including comprehensive data mining technologies in SQL Server Analysis Services, and through integration with the 2007 Microsoft Office system, Microsoft has delivered a cost-effective solution that can extend the power of data mining to everyone and provide the insights that are critical to success while taking advantage of the enterprise-scale capabilities of SQL Server Analysis Services.<br />Predictive Analysis for All Users<br />A predictive analysis solution is most effective when it is pervasive throughout the organization and helps to drive day-to-day decisions across the business with its scale and enterprise-level performance. Furthermore, providing a way to implement comprehensive predictive analysis intuitively enables self-service data mining for users, which in turn enables the business to gain actionable insight promptly. The data mining technology in SQL Server 2008 meets these requirements through close integration with the 2007 Office system, a comprehensive development environment, enterprise-grade capabilities, and an extensible set of rich and innovative data mining algorithms that are designed to meet common business problems. <br />Pervasive Delivery through Microsoft Office<br />Traditionally, predictive analysis was limited to only a fraction of employees who were statistically trained experts. Microsoft SQL Server 2008 Data Mining Add-Ins for the 2007 Office System, shown in Figure 1, extend insight and prediction to a wider audience by enabling information workers to harness the highly sophisticated data mining technology within a familiar spreadsheet environment. The array of tools empowers users to inform everyday decisions in a few simple steps by providing prompt and actionable recommendations. The Table Analysis Tools for Microsoft Office Excel® 2007 hide the complexity of data mining behind intuitive tasks, delivering a seamless experience that enables users to transition easily between exploration and discovery. The Data Mining Client for Excel 2007 offers a complete data mining development lifecycle, which empowers advanced users with more information, validation, and control. Furthermore, the Data Mining Templates for Visio enable users to render annotatable graphical visualizations of the data mining models. Altogether, the integration between SQL Server 2008 data mining and the 2007 Office System provides a comprehensive, intuitive, and collaborative business ecosystem that extends the insight of predictive analysis to inform business decisions throughout the organization.<br />Figure 1: Data Mining Add-Ins for Microsoft Office Excel 2007<br />The Data Mining Add-Ins for the 2007 Office system delivers the following benefits:<br />Comprehensive: Provide a wide range of tools to fit many needs.Data Mining Add-Ins for the 2007 Office System are designed to offer a remarkably broad and reliable set of data mining tools. The availability of these tools at the desktop enables all users to explore data and discover hidden trends and relationships between products, customers, markets, employees, and other factors; empowering them to anticipate needs, understand behaviors and discover hidden opportunities that can improve business processes and directly impact profitability. <br />Intuitive: Deliver actionable insight to every user.Access to predictive analysis within the familiar Microsoft Office environment helps users to easily incorporate prediction into everyday processes. The automated tasks provided in the Table Analysis Tools for Excel 2007 deliver clear and actionable insights promptly, in three simple steps:<br />Define your data. Identify the data that is necessary to inform the solution and create a table in an Excel 2007 spreadsheet that defines the data to be analyzed.<br />Identify the task. Select the appropriate data mining task to perform on the data from the Data Mining or Table Analysis ribbon.<br />Get results. Examine the output from the task delivered through clear and intuitive visualizations directly in the Excel 2007 environment.<br />The automated tasks provided in the Data Mining Add-Ins for Excel 2007 include:<br />Analyze Key Influencers - Detects the key characteristics that influence a certain outcome. A detailed report that ranks the key influencers based on importance is generated, enabling users to compare key factors for each set of distinct values.<br />Detect Categories - Helps users to identify and segment data based on common properties. A detailed report describing the discovered categories is generated, enabling re-labeling of categories with meaningful naming for further analysis.<br />Fill From Example - Helps users to complete a partially populated column automatically based on patterns in the table. A report explaining the detected patterns is generated, enabling users to re-analyze the data and refine patterns as more knowledge is acquired.<br />Forecast - Enables users to predict future values based on trends in the data set. The forecast values are added to the original table and charts displaying past and forecast evolution of the series are generated.<br />Highlight Exceptions - Enables users to detect cases in the data set that include values outside the expected range. The rows containing the exceptions are highlighted and the actual column likely to cause the exception is emphasized.<br />Scenario Analysis: What If - Enables users to gain insight into the impact of a potential change that is applied to one value on other values of the data set.<br />Scenario Analysis: Goal Seeking - Enables users to better understand the underlying factors that need to be changed to achieve a desired value in a certain target column (complementary to the What-If tool).<br />Prediction Calculator - Related to the Analyze Key Influencers task, the Prediction Calculator generates an interactive form for scoring new cases. The influence of each attribute is translated into a set of scores. A summary of a combination of attributes, which apply to a new case, predicts probable future behaviors.<br />Shopping Basket Analysis - Enables users to detect the relationship between items frequently purchased together. A report explaining the relationships can provide a better understanding of the financial significance, providing insight into bundling offerings or improved product placement.<br />The easy to understand, graphical output from these tools provides a seamless transition between exploration and discovery, and empowers users with rich prediction and insight that clearly translates into recommendations and actions.<br />Collaborative: Share insights throughout the organization - Having performed predictive analysis in Excel 2007, users can use the powerful publishing tools of the 2007 Office System to share findings and inform business decisions throughout the organization. For example, users can share analysis through interactive graphical visualizations in Office Visio® 2007 diagrams, or they can share tables, reports, and diagrams through Microsoft Office SharePoint® Server 2007.<br />Comprehensive Development Environment<br />The 2007 Office System is an ideal desktop tool for information workers, but for BI developers who deploy solutions throughout the enterprise, SQL Server Business Intelligence Development Studio is the environment of choice because it has a project-based environment, complete with debugging and source control integration that you can use to create end-to-end BI solutions.<br />Of course, pervasive delivery of data mining functionality is only useful if developers can build data mining solutions that meet the needs of the business quickly and easily. SQL Server Business Intelligence Development Studio provides a comprehensive development environment that is based on the Microsoft Visual Studio® development system. With Business Intelligence Development Studio, developers can create data mining structures, which identify the tables and columns to be included in the analysis, and add multiple data mining models that apply data mining algorithms to the data in those tables. The Analysis Services project template in Business Intelligence Development Studio, shown in Figure 2, includes an intuitive Data Mining Designer for creating and viewing data mining models, and provides cross-validation, lift charts, and profit charts to compare and contrast the quality of models visually and through statistical scores of error and accuracy before deploying them. <br />Figure 2: Data Mining Designer in Business Intelligence Development Studio<br />SQL Server 2008 introduces a number of enhancements to the already comprehensive development environment of SQL Server 2005, including the ability to:<br />Split data into training and testing partitions more effectively. Partitioning is available within the process of creating the data mining model. Developers can identify a portion of the training dataset to be randomly selected for testing.<br />Build models over filtered data. Data filtering enables the creation of mining models that use subsets of data in a mining structure. Filtering provides flexibility for designing mining structures and data sources, because developers can create a single mining structure, based on a comprehensive data source view, and then apply filters to use only a part of that data for training and testing a variety of models, instead of building a different structure and related model for each subset of data. For example, a developer could define the data source view on the Customers table and related tables, build a single mining structure that includes all of the required fields, and then create a model that is filtered on a particular customer attribute, such as Region. The developer can then easily make a copy of that model, and change the filter condition to generate a new model based on a different region. By applying filters to data models, you can:<br />Create separate models for discrete values. For example, a clothing store might use customer demographics to build separate models by gender, even though the sales data comes from a single data source for all customers.<br />Experiment with models by creating and then testing multiple groupings of the same data, such as ages 20-30 versus ages 20-40 versus ages 20-25.<br />Specify complex filters on nested table contents, such as requiring that a case be included in the model only if the customer has purchased at least two of a particular item.<br />Build incompatible models within the same structure. Models using continuous or discretized versions of the same column can co-exist in a single structure with the new aliasing ability in the Mining Model Editor in Business Intelligence Development Studio.<br />Test multiple models simultaneously with cross-validation. The models created by data mining algorithms have various applications that require different accuracy and stability measurements. Depending on the application, users demand these measurements. Additionally these measurements assist in ensuring that various settings result in the best model for a current data set and a given application. SQL Server 2008 offers a robust cross-validation feature that can test all of the models in a structure simultaneously by using a folding technique. This enables users to test a variety of settings on a subset of data before committing to an expensive processing step. Cross-validation results also tell users if the model results are stable or if the results would change given more or less data. Figure 3 shows a cross-validation report in the Data Mining Designer.<br />Figure 3: Cross-validation<br />Enterprise-Grade Capabilities<br />SQL Server Predictive Analysis is part of SQL Server Analysis Services, which provides enterprise-class server advantages: rapid development, high availability, superior performance and scalability, robust security, and enhanced manageability through SQL Server Management Studio. This enterprise-level capability means that the data mining technologies enabling predictive analysis can grow with the business and provide a high performance, scalable solution for any size of organization.<br />Rich and Innovative Algorithms<br />Different businesses have different goals and need to make different decisions. For this reason, any data mining technology must support a comprehensive set of capabilities and algorithms to meet a diverse range of business needs. SQL Server 2008 Analysis Services includes data mining technologies that support many rich and innovative algorithms, most of them designed by Microsoft Research to solve common business problems. Additionally, the data mining technologies of SQL Server Analysis Services are extensible, enabling you to add plug-in algorithms that meet uncommon analytical needs that are more specific to an individual business. The following table shows some of the tasks that SQL Server data mining can be used to perform.<br />Data Mining Tasks<br />TaskDescriptionAlgorithmsMarket Basket AnalysisDiscover items sold together to create recommendations on-the-fly and to determine how product placement can directly contribute to your bottom line.Association Decision Trees Churn AnalysisAnticipate customers who may be considering canceling their service and identify the benefits that will keep them from leaving.Decision TreesLinear RegressionLogistic RegressionMarket AnalysisDefine market segments by automatically grouping similar customers together. Use these segments to seek profitable customers.Clustering Sequence Clustering ForecastingPredict sales and inventory amounts and learn how they are interrelated to foresee bottlenecks and improve performance.Decision Trees Time Series Data ExplorationAnalyze profitability across customers, or compare customers that prefer different brands of the same product to discover new opportunities.Neural NetworkUnsupervised LearningIdentify previously unknown relationships between various elements of your business to inform your decisions.Neural NetworkWeb Site AnalysisUnderstand how people use your Web site and group similar usage patterns to offer a better experience.Sequence Clustering Campaign AnalysisSpend marketing funds more effectively by targeting the customers most likely to respond to a promotion.Decision Trees Naïve Bayes Clustering Information QualityIdentify and handle anomalies during data entry or data loading to improve the quality of information.Linear RegressionLogistic RegressionText AnalysisAnalyze feedback to find common themes and trends that concern your customers or employees, informing decisions with unstructured input.Text Mining<br />Insight at Every Step of the Data Lifecycle<br />Whether consuming, analyzing, monitoring, planning, exploring, or reporting on business data, predictive analysis can add rich insight to expose new avenues for growth. SQL Server 2008 is part of a family of business intelligence technologies, all working together to deliver a comprehensive platform that enables organizations to incorporate predictive analysis into every stage of the data life cycle.<br />Native Reporting Integration<br />Reporting is a fundamental activity in most businesses, and SQL Server 2008 Reporting Services provides a comprehensive solution for creating, rendering, and deploying reports throughout the enterprise. SQL Server Reporting Services can render reports directly from a data mining model by using a data mining extensions (DMX) query. This enables users to visualize the content of data mining models for optimized data representation. Furthermore, the ability to query directly against the data mining structure enables users to easily include attributes beyond the scope of the mining model requirements, presenting complete and meaningful information. Figure 4 shows the DMX query editor for Reporting Services.<br />Figure 4: The DMX query editor for SQL Server Reporting Services<br />SQL Server Reporting Services provides the ability to generate parameter-driven reports based on predictive probability. For example, the query shown in Figure 4 analyzes a list of prospective customers for the hypothetical Adventure Works cycle company and uses a data mining model to assess the probability of those customers buying a bicycle. The query is filtered to return only prospects that are more than 50% likely to make a purchase. Figure 5 shows the resulting report, which the company could use as the basis for a marketing campaign that targets only the customers most likely to make a purchase, significantly improving the effectiveness of the campaign and its return on investment. <br />Figure 5: A predictive analysis report<br />In-Flight Data Mining During Data Integration<br />As Business Intelligence becomes more pervasive, businesses are increasingly implementing extract, transform, and load (ETL) solutions to consolidate data from around the organization into a data warehouse for reporting and analysis. However, the source data for these operations can often be incomplete, or in some cases business entities, such as customers, might need to be classified into categories based on common profile characteristics. <br />Microsoft SQL Server 2008 Integration Services provides a powerful, extensible ETL platform that Business Intelligence solution developers can use to implement ETL operations that cleanse and transform data in-flight. SQL Server Integration Services includes a Data Mining Model Training destination for training data mining models, and a Data Mining Query transformation that can be used to perform predictive analysis on data as it is passed through the data flow. Integrating predictive analysis with SQL Server Integration Services enables organizations to flag unusual data, classify business entities, perform text mining, and fill-in missing values on the fly based on the power and insight of the data mining algorithms. For example, an ETL process might extract customer data from one or more source systems for inclusion in a data warehouse. Traditionally, data mining would be used after the data warehouse is loaded, to classify customers for predicted purchasing behavior or other campaign management tasks. However, with SQL Server Integration Services, the Data Mining Query Transformation can apply a data mining model during the ETL process, resulting in a data warehouse that is populated with classified data at load time. This reduces the work that must be done on the warehouse server, and ensures that the data available for analysis is always up-to-date and consistently classified. Moreover, classification during the ETL process may also be used to filter out customer records that do not fit any known classification. These records may be the result of poor data quality, or may represent a new classification not yet captured in the campaign management process. In either case, SQL Server Integration Services can detect these records by using data mining and redirect them for manual or automated review. <br />Figure 6 shows a SQL Server Integration Services data flow that includes a Data Mining Query transformation.<br />Figure 6: Data mining in SQL Server Integration Services<br />Insightful Analysis<br />SQL Server 2008 Analysis Services provides a highly scalable platform for multidimensional OLAP analysis. Many customers are already reaping the benefits of creating a unified dimensional model (UDM) in Analysis Services and using it to slice and dice business measures by multiple dimensions. Predictive analysis, being part of SQL Server 2008 Analysis Services provides a richer OLAP experience, featuring data mining dimensions that slice your data by the hidden patterns within. For example, a sales and marketing department can create a data mining structure that is based on an existing Customer OLAP dimension and use it to classify customers into clusters that exhibit similar characteristics. They can then use that data mining structure to generate a new data mining dimension and use it to analyze sales information based on the customer clusters that have been identified. Figure 7 shows a data mining dimension in an OLAP cube.<br />Figure 7: A data mining dimension in an OLAP cube<br />In addition to incorporating the results of data mining into OLAP dimensions, SQL Server 2008 enables you to incorporate predictive functions based on data mining models into calculations and KPIs.<br />Predictive KPIs<br />Many businesses use KPIs to evaluate critical business metrics against targets. SQL Server 2008 Analysis Services provides a centralized platform for KPIs across the organization, and integration with Microsoft Office PerformancePoint® Server 2007 enables decision makers to build business dashboards from which they can monitor the company’s performance. KPIs are traditionally retrospective, for example showing last month’s sales total compared to the sales target. However, with the insights made possible through data mining, organizations can build predictive KPIs that forecast future performance against targets, giving the business an opportunity to detect and resolve potential problems proactively. Figure 8 shows a KPI that displays the anticipated number of orders that are predicted to be placed.<br />Figure 8: Microsoft Office PerformancePoint Server 2007<br />Additionally, predictive analysis can detect attributes that influence KPIs. Together with Office PerformancePoint Server 2007, users can monitor trends in key influencers to recognize those attributes that have a sustained effect, for example identifying whether price discount on a competing product has a lasting impact on sales or only generates a short-term interference. Such insights enable businesses to inform and improve their response strategy. <br />Data Mining Awareness in Every Application<br />As you have seen in this whitepaper so far, SQL Server 2008 provides a comprehensive data mining solution, and the tight integration with the Microsoft Business Intelligence platform makes it easy to provide predictive analysis to users and automated processes across the enterprise. However, there may still be occasions where organizations need to embed data mining functionality into an application, to introduce intelligence into an existing business process, or to extend data mining technologies to meet a specific business problem. For this purpose, SQL Server offers a flexible and extensible programming platform for seamlessly incorporating prediction and insight into line-of-business applications.<br />Predictive Programming<br />SQL Server 2008 data mining supports a number of application programming interfaces (APIs) that developers can use to build custom solutions that take advantage of the predictive analysis capabilities in SQL Server. DMX, XMLA, OLEDB and ADOMD.NET, and Analysis Management Objects (AMO) offer a rich, fully documented development platform, empowering developers to build data mining aware applications and providing real-time discovery and recommendation through familiar tools. <br />This extensibility creates an opportunity for business organizations and independent software vendors (ISVs) to embed predictive analysis into line-of-business applications, introducing insight and forecasting that inform business decisions and processes. For example, the Analytics Foundation adds predictive scoring to Microsoft Dynamics® CRM, to enable information workers across sales, marketing, and service organizations to identify attainable opportunities that are more likely to lead to a sale, increasing efficiency and improving productivity (for more information, see the Microsoft Dynamics site).<br />Plug-In Algorithms and Custom Visualizations<br />The SQL Server data mining toolset is fully extensible through Microsoft .NET–stored procedures, plug-in algorithms, custom visualizations and PMML. This enables developers to extend the out-of-the-box data mining technologies of SQL Server 2008 to meet uncommon business needs that are specific to the organization by:<br />Creating custom data mining algorithms to solve business-specific analytical problems.<br />Using data mining algorithms from other software vendors.<br />Creating custom visualizations of data mining models through plug-in viewer APIs.<br />Conclusion <br />SQL Server 2008 Analysis Services provides a complete data mining platform that organizations can use to infuse insight and prediction into everyday business decisions. Pervasive delivery through the Data Mining Add-Ins for the 2007 Office system delivers predictive analysis capabilities with intuitive tools and clear results that are available throughout the enterprise at the desktop. The comprehensive development environment and extensible range of innovative data mining algorithms combined with the enterprise-level scalability and manageability of SQL Server Analysis Services makes SQL Server 2008 an ideal way to bring the benefits of predictive analysis to your business.<br />Because the predictive analysis capabilities of SQL Server 2008, as part of the Microsoft BI platform, are closely integrated into every stage of the data life cycle, they incorporate intelligence into reporting, data integration, OLAP analysis, and business performance monitoring. This helps organizations increase business agility and creates a tangible competitive advantage.<br />Although the data mining functionality provided with SQL Server 2008 is comprehensive enough to meet the needs of a wide range of business scenarios, its extensibility ensures that it can be used to solve virtually any predictive problem. The ability to extend the data mining technologies of SQL Server through custom algorithms and visualizations, together with the ability to embed predictive functionality into line-of-business applications makes SQL Server 2008 a powerful platform for introducing predictive analysis into existing business processes to add insight and recommendations into everyday operations.<br />For more information:<br />Microsoft SQL Server 2008http://www.microsoft.com/sqlserver/2008/en/us/default.aspx<br />SQL Server Developer Centerhttp://msdn2.microsoft.com/sqlserver<br />SQL Server TechCenterhttp://technet.microsoft.com/sqlserver<br />Please give us your feedback:<br />Did this paper help you? Tell us on a scale of 1 (poor) to 5 (excellent), how would you rate this paper and why have you given it this rating? For example:<br />Are you giving it a high rating because it has good examples, excellent screenshots, clear writing, or another reason? <br />Are you giving it a low rating because it has poor examples, fuzzy screenshots, unclear writing?<br />This feedback will help us improve the quality of white papers we release. Send feedback.<br />
Sql server 2008 r2 data mining whitepaper overview
Sql server 2008 r2 data mining whitepaper overview
Sql server 2008 r2 data mining whitepaper overview
Sql server 2008 r2 data mining whitepaper overview
Sql server 2008 r2 data mining whitepaper overview
Sql server 2008 r2 data mining whitepaper overview
Sql server 2008 r2 data mining whitepaper overview
Sql server 2008 r2 data mining whitepaper overview
Sql server 2008 r2 data mining whitepaper overview
Sql server 2008 r2 data mining whitepaper overview
Sql server 2008 r2 data mining whitepaper overview
Sql server 2008 r2 data mining whitepaper overview
Sql server 2008 r2 data mining whitepaper overview
Sql server 2008 r2 data mining whitepaper overview
Sql server 2008 r2 data mining whitepaper overview
Sql server 2008 r2 data mining whitepaper overview
Sql server 2008 r2 data mining whitepaper overview

Weitere ähnliche Inhalte

Was ist angesagt?

Resume - Stuart Arnold
Resume - Stuart ArnoldResume - Stuart Arnold
Resume - Stuart ArnoldStuart Arnold
 
SSIS_SSAS_SSRS_SP_PPS_HongBingLi
SSIS_SSAS_SSRS_SP_PPS_HongBingLiSSIS_SSAS_SSRS_SP_PPS_HongBingLi
SSIS_SSAS_SSRS_SP_PPS_HongBingLiHong-Bing Li
 
Ssis sql ssrs_sp_ssas_mdx_hb_li
Ssis sql ssrs_sp_ssas_mdx_hb_liSsis sql ssrs_sp_ssas_mdx_hb_li
Ssis sql ssrs_sp_ssas_mdx_hb_liHong-Bing Li
 
Tableau interview questions www.bigclasses.com
Tableau interview questions www.bigclasses.comTableau interview questions www.bigclasses.com
Tableau interview questions www.bigclasses.combigclasses.com
 
Microstrategy for Data Engineers
Microstrategy for Data EngineersMicrostrategy for Data Engineers
Microstrategy for Data EngineersFrancesco Mucio
 
Business Intelligence for users - Sharperlight
Business Intelligence for users - SharperlightBusiness Intelligence for users - Sharperlight
Business Intelligence for users - SharperlightMichell8240
 
Enabling Governed Data Access with Tableau Data Server
Enabling Governed Data Access with Tableau Data Server Enabling Governed Data Access with Tableau Data Server
Enabling Governed Data Access with Tableau Data Server Tableau Software
 
Office 365 Saturday Europe - Self-Service Business Intelligence with Power BI
Office 365 Saturday Europe - Self-Service Business Intelligence with Power BIOffice 365 Saturday Europe - Self-Service Business Intelligence with Power BI
Office 365 Saturday Europe - Self-Service Business Intelligence with Power BIMarius Constantinescu [MVP]
 
Daniel Bowlin Portfolio Rev1
Daniel Bowlin Portfolio Rev1Daniel Bowlin Portfolio Rev1
Daniel Bowlin Portfolio Rev1DanielWBowlin
 
Hadoop Integration with Microstrategy
Hadoop Integration with Microstrategy Hadoop Integration with Microstrategy
Hadoop Integration with Microstrategy snehal parikh
 
Tableau interview questions
Tableau interview questionsTableau interview questions
Tableau interview questionsbarbie0909
 
Kevin Fahy Bi Portfolio
Kevin Fahy   Bi PortfolioKevin Fahy   Bi Portfolio
Kevin Fahy Bi PortfolioKevinPFahy
 
Tony Von Gusmann & MS BI
Tony Von Gusmann & MS BITony Von Gusmann & MS BI
Tony Von Gusmann & MS BIvongusmann
 
Business Intelligence Project Portfolio
Business Intelligence Project PortfolioBusiness Intelligence Project Portfolio
Business Intelligence Project Portfoliodmrasek
 
Rahul_Resume
Rahul_ResumeRahul_Resume
Rahul_ResumeRahul R
 

Was ist angesagt? (20)

Resume - Stuart Arnold
Resume - Stuart ArnoldResume - Stuart Arnold
Resume - Stuart Arnold
 
SSIS_SSAS_SSRS_SP_PPS_HongBingLi
SSIS_SSAS_SSRS_SP_PPS_HongBingLiSSIS_SSAS_SSRS_SP_PPS_HongBingLi
SSIS_SSAS_SSRS_SP_PPS_HongBingLi
 
Microstrategy
MicrostrategyMicrostrategy
Microstrategy
 
Ssis sql ssrs_sp_ssas_mdx_hb_li
Ssis sql ssrs_sp_ssas_mdx_hb_liSsis sql ssrs_sp_ssas_mdx_hb_li
Ssis sql ssrs_sp_ssas_mdx_hb_li
 
Msbi Architecture
Msbi ArchitectureMsbi Architecture
Msbi Architecture
 
Tableau interview questions www.bigclasses.com
Tableau interview questions www.bigclasses.comTableau interview questions www.bigclasses.com
Tableau interview questions www.bigclasses.com
 
IntelligentEnterprise
IntelligentEnterpriseIntelligentEnterprise
IntelligentEnterprise
 
Microstrategy for Data Engineers
Microstrategy for Data EngineersMicrostrategy for Data Engineers
Microstrategy for Data Engineers
 
Business Intelligence for users - Sharperlight
Business Intelligence for users - SharperlightBusiness Intelligence for users - Sharperlight
Business Intelligence for users - Sharperlight
 
Enabling Governed Data Access with Tableau Data Server
Enabling Governed Data Access with Tableau Data Server Enabling Governed Data Access with Tableau Data Server
Enabling Governed Data Access with Tableau Data Server
 
Office 365 Saturday Europe - Self-Service Business Intelligence with Power BI
Office 365 Saturday Europe - Self-Service Business Intelligence with Power BIOffice 365 Saturday Europe - Self-Service Business Intelligence with Power BI
Office 365 Saturday Europe - Self-Service Business Intelligence with Power BI
 
Daniel Bowlin Portfolio Rev1
Daniel Bowlin Portfolio Rev1Daniel Bowlin Portfolio Rev1
Daniel Bowlin Portfolio Rev1
 
Hadoop Integration with Microstrategy
Hadoop Integration with Microstrategy Hadoop Integration with Microstrategy
Hadoop Integration with Microstrategy
 
Tableau interview questions
Tableau interview questionsTableau interview questions
Tableau interview questions
 
Kevin Fahy Bi Portfolio
Kevin Fahy   Bi PortfolioKevin Fahy   Bi Portfolio
Kevin Fahy Bi Portfolio
 
Tony Von Gusmann & MS BI
Tony Von Gusmann & MS BITony Von Gusmann & MS BI
Tony Von Gusmann & MS BI
 
Bo df b_layer
Bo df b_layerBo df b_layer
Bo df b_layer
 
Business Intelligence Project Portfolio
Business Intelligence Project PortfolioBusiness Intelligence Project Portfolio
Business Intelligence Project Portfolio
 
Power Bi Basics
Power Bi BasicsPower Bi Basics
Power Bi Basics
 
Rahul_Resume
Rahul_ResumeRahul_Resume
Rahul_Resume
 

Ähnlich wie Sql server 2008 r2 data mining whitepaper overview

Sql server 2008 r2 predictive analysis data sheet
Sql server 2008 r2 predictive analysis data sheetSql server 2008 r2 predictive analysis data sheet
Sql server 2008 r2 predictive analysis data sheetKlaudiia Jacome
 
SQL Server 2005 Everywhere Edition Value Proposition
SQL Server 2005 Everywhere Edition Value PropositionSQL Server 2005 Everywhere Edition Value Proposition
SQL Server 2005 Everywhere Edition Value Propositionbutest
 
Intro of Key Features of SoftCAAT BI Software
Intro of Key Features of SoftCAAT BI SoftwareIntro of Key Features of SoftCAAT BI Software
Intro of Key Features of SoftCAAT BI Softwarerafeq
 
Sql server 2008 r2 analysis services overview whitepaper
Sql server 2008 r2 analysis services overview whitepaperSql server 2008 r2 analysis services overview whitepaper
Sql server 2008 r2 analysis services overview whitepaperKlaudiia Jacome
 
Introducing microsoft bi tools
Introducing  microsoft bi  toolsIntroducing  microsoft bi  tools
Introducing microsoft bi toolsCMR WORLD TECH
 
How Can Business Analytics Dashboard Help Data Analysts.pdf
How Can Business Analytics Dashboard Help Data Analysts.pdfHow Can Business Analytics Dashboard Help Data Analysts.pdf
How Can Business Analytics Dashboard Help Data Analysts.pdfGrow
 
ow Do Data Analysis Tools Make Data Preparation Easier?
ow Do Data Analysis Tools Make Data Preparation Easier?ow Do Data Analysis Tools Make Data Preparation Easier?
ow Do Data Analysis Tools Make Data Preparation Easier?Grow
 
Business Intelligence for media datasheetfinal
Business Intelligence for media datasheetfinalBusiness Intelligence for media datasheetfinal
Business Intelligence for media datasheetfinalBinary Vintage
 
Bi Architecture And Conceptual Framework
Bi Architecture And Conceptual FrameworkBi Architecture And Conceptual Framework
Bi Architecture And Conceptual FrameworkSlava Kokaev
 
Scaling up your Analytics & Insights
Scaling up your Analytics & InsightsScaling up your Analytics & Insights
Scaling up your Analytics & InsightsLoQutus
 
Ankit Patel - Tableau Developer
Ankit Patel - Tableau DeveloperAnkit Patel - Tableau Developer
Ankit Patel - Tableau DeveloperAnkit Patel
 
Numerify IT Service Analytics for ServiceNow
Numerify IT Service Analytics for ServiceNowNumerify IT Service Analytics for ServiceNow
Numerify IT Service Analytics for ServiceNowNumerify
 
It7113 research project - group 7
It7113   research project - group 7It7113   research project - group 7
It7113 research project - group 7Hiren Patel
 

Ähnlich wie Sql server 2008 r2 data mining whitepaper overview (20)

Sql server 2008 r2 predictive analysis data sheet
Sql server 2008 r2 predictive analysis data sheetSql server 2008 r2 predictive analysis data sheet
Sql server 2008 r2 predictive analysis data sheet
 
REPORT ON (1)
REPORT ON (1)REPORT ON (1)
REPORT ON (1)
 
MEC Data sheet
MEC Data sheetMEC Data sheet
MEC Data sheet
 
SQL Server 2005 Everywhere Edition Value Proposition
SQL Server 2005 Everywhere Edition Value PropositionSQL Server 2005 Everywhere Edition Value Proposition
SQL Server 2005 Everywhere Edition Value Proposition
 
Spreadsheet server
Spreadsheet serverSpreadsheet server
Spreadsheet server
 
Intro of Key Features of SoftCAAT BI Software
Intro of Key Features of SoftCAAT BI SoftwareIntro of Key Features of SoftCAAT BI Software
Intro of Key Features of SoftCAAT BI Software
 
Report PPT
Report PPTReport PPT
Report PPT
 
Sql server 2008 r2 analysis services overview whitepaper
Sql server 2008 r2 analysis services overview whitepaperSql server 2008 r2 analysis services overview whitepaper
Sql server 2008 r2 analysis services overview whitepaper
 
Introducing microsoft bi tools
Introducing  microsoft bi  toolsIntroducing  microsoft bi  tools
Introducing microsoft bi tools
 
IBM Planning Analytics
IBM Planning AnalyticsIBM Planning Analytics
IBM Planning Analytics
 
How Can Business Analytics Dashboard Help Data Analysts.pdf
How Can Business Analytics Dashboard Help Data Analysts.pdfHow Can Business Analytics Dashboard Help Data Analysts.pdf
How Can Business Analytics Dashboard Help Data Analysts.pdf
 
ow Do Data Analysis Tools Make Data Preparation Easier?
ow Do Data Analysis Tools Make Data Preparation Easier?ow Do Data Analysis Tools Make Data Preparation Easier?
ow Do Data Analysis Tools Make Data Preparation Easier?
 
Business Intelligence for media datasheetfinal
Business Intelligence for media datasheetfinalBusiness Intelligence for media datasheetfinal
Business Intelligence for media datasheetfinal
 
Bi Architecture And Conceptual Framework
Bi Architecture And Conceptual FrameworkBi Architecture And Conceptual Framework
Bi Architecture And Conceptual Framework
 
Offers bank dss
Offers bank dssOffers bank dss
Offers bank dss
 
Scaling up your Analytics & Insights
Scaling up your Analytics & InsightsScaling up your Analytics & Insights
Scaling up your Analytics & Insights
 
MuthulakshmiRajendran
MuthulakshmiRajendranMuthulakshmiRajendran
MuthulakshmiRajendran
 
Ankit Patel - Tableau Developer
Ankit Patel - Tableau DeveloperAnkit Patel - Tableau Developer
Ankit Patel - Tableau Developer
 
Numerify IT Service Analytics for ServiceNow
Numerify IT Service Analytics for ServiceNowNumerify IT Service Analytics for ServiceNow
Numerify IT Service Analytics for ServiceNow
 
It7113 research project - group 7
It7113   research project - group 7It7113   research project - group 7
It7113 research project - group 7
 

Mehr von Klaudiia Jacome

Aoutsourcing para capitulo 7
Aoutsourcing para capitulo 7Aoutsourcing para capitulo 7
Aoutsourcing para capitulo 7Klaudiia Jacome
 
Applicationandmulti instances
Applicationandmulti instancesApplicationandmulti instances
Applicationandmulti instancesKlaudiia Jacome
 
Sql server2008 r2_mds_datasheet
Sql server2008 r2_mds_datasheetSql server2008 r2_mds_datasheet
Sql server2008 r2_mds_datasheetKlaudiia Jacome
 
Microsoft sql server 2008 r2 business intelligence
Microsoft sql server 2008 r2 business intelligenceMicrosoft sql server 2008 r2 business intelligence
Microsoft sql server 2008 r2 business intelligenceKlaudiia Jacome
 
Introduction to master data services
Introduction to master data servicesIntroduction to master data services
Introduction to master data servicesKlaudiia Jacome
 
Sql server2008 r2_bi_datasheet_final
Sql server2008 r2_bi_datasheet_finalSql server2008 r2_bi_datasheet_final
Sql server2008 r2_bi_datasheet_finalKlaudiia Jacome
 
Sql server 2008 business intelligence tdm deck
Sql server 2008 business intelligence tdm deckSql server 2008 business intelligence tdm deck
Sql server 2008 business intelligence tdm deckKlaudiia Jacome
 
Microsoft sql server 2008 r2 business intelligence
Microsoft sql server 2008 r2 business intelligenceMicrosoft sql server 2008 r2 business intelligence
Microsoft sql server 2008 r2 business intelligenceKlaudiia Jacome
 

Mehr von Klaudiia Jacome (20)

Aoutsourcing para capitulo 7
Aoutsourcing para capitulo 7Aoutsourcing para capitulo 7
Aoutsourcing para capitulo 7
 
Si las cosas van mal
Si las cosas van malSi las cosas van mal
Si las cosas van mal
 
Analysis services
Analysis  servicesAnalysis  services
Analysis services
 
Enterprise security
Enterprise securityEnterprise security
Enterprise security
 
Performance
PerformancePerformance
Performance
 
Performance
PerformancePerformance
Performance
 
Enterprise security
Enterprise securityEnterprise security
Enterprise security
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Managemen tools
Managemen toolsManagemen tools
Managemen tools
 
Managemen tolos
Managemen tolosManagemen tolos
Managemen tolos
 
Datos espaciales
Datos espacialesDatos espaciales
Datos espaciales
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Avances analticos
Avances analticosAvances analticos
Avances analticos
 
Applicationandmulti instances
Applicationandmulti instancesApplicationandmulti instances
Applicationandmulti instances
 
Sql server2008 r2_mds_datasheet
Sql server2008 r2_mds_datasheetSql server2008 r2_mds_datasheet
Sql server2008 r2_mds_datasheet
 
Microsoft sql server 2008 r2 business intelligence
Microsoft sql server 2008 r2 business intelligenceMicrosoft sql server 2008 r2 business intelligence
Microsoft sql server 2008 r2 business intelligence
 
Introduction to master data services
Introduction to master data servicesIntroduction to master data services
Introduction to master data services
 
Sql server2008 r2_bi_datasheet_final
Sql server2008 r2_bi_datasheet_finalSql server2008 r2_bi_datasheet_final
Sql server2008 r2_bi_datasheet_final
 
Sql server 2008 business intelligence tdm deck
Sql server 2008 business intelligence tdm deckSql server 2008 business intelligence tdm deck
Sql server 2008 business intelligence tdm deck
 
Microsoft sql server 2008 r2 business intelligence
Microsoft sql server 2008 r2 business intelligenceMicrosoft sql server 2008 r2 business intelligence
Microsoft sql server 2008 r2 business intelligence
 

Kürzlich hochgeladen

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 

Kürzlich hochgeladen (20)

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 

Sql server 2008 r2 data mining whitepaper overview

  • 1. Predictive Analysis with SQL Server 2008<br />White Paper<br />Published: November 2007<br />Updated: July 2008<br />Summary: Microsoft SQL Server 2008 offers predictive analysis through a complete and intuitive set of data mining tools. Seamless integration with the Microsoft Business Intelligence platform provides rich insight at every step of the data lifecycle. Furthermore, the flexible platform empowers you to extend prediction into any application.<br />For the latest information, see Microsoft SQL Server 2008.<br />Contents<br /> TOC quot; 1-2quot; Introduction PAGEREF _Toc205277543 1<br />Predictive Analysis for All Users PAGEREF _Toc205277544 2<br />Pervasive Delivery through Microsoft Office PAGEREF _Toc205277545 2<br />Comprehensive Development Environment PAGEREF _Toc205277546 4<br />Insight at Every Step of the Data Lifecycle PAGEREF _Toc205277547 8<br />Native Reporting Integration PAGEREF _Toc205277548 8<br />In-Flight Data Mining During Data Integration PAGEREF _Toc205277549 10<br />Insightful Analysis PAGEREF _Toc205277550 12<br />Predictive KPIs PAGEREF _Toc205277551 13<br />Data Mining Awareness in Every Application PAGEREF _Toc205277552 14<br />Predictive Programming PAGEREF _Toc205277553 14<br />Plug-In Algorithms and Custom Visualizations PAGEREF _Toc205277554 14<br />Conclusion PAGEREF _Toc205277555 15<br />Introduction<br />One of the most valuable assets of any company is the large volume of business data in various applications and systems throughout the organization. This data has the potential to provide previously unimagined insights into the business and to form a reliable basis for effective decision-making and accurate forecasting that can drive a company forward to success. Unfortunately, all too often the data is collected by the various computer systems and left dormant in isolated data stores. Some organizations may generate historical reports from this data, and some may even measure the company’s performance against key performance indicators (KPIs); but surprisingly few organizations realize the benefits of mining their historical data to detect patterns and trends, and even fewer embed predictive analysis into their day-to-day business processes to make decisions and predictions and to improve the overall agility of the company.<br />Over the past few releases, Microsoft has refined the reporting and analytical capabilities in Microsoft® SQL Server® to create a comprehensive Business Intelligence (BI) platform that can be integrated into everyday business activity and used effectively by employees throughout the organization instead of only by a few specialized analysts. Many organizations that previously would have found BI solutions too expensive or complex to implement are now taking advantage of the comprehensive report authoring, rendering, and delivery capabilities of SQL Server Reporting Services and the powerful online analytical processing (OLAP) services provided by SQL Server Analysis Services. The close integration between these BI server products and the ubiquitous Microsoft Office system has brought business analysis to the masses and promoted the evolution of a new kind of information worker who can gain a deeper insight into the business and operate more effectively.<br />While this proliferation of reporting and multidimensional analytics has greatly benefited many organizations of all sizes, the next step in promoting business agility and operational efficiency is to make the leap from retrospective analysis of historical data to proactive actions based on predictive analysis of business data, and to embed intelligent, fact-based decision-making into business processes. The key to accomplishing this is to use powerful data mining algorithms to analyze data sets, compare new data to historical facts and behaviors, identify classifications and relationships between business entities and attributes, and to deliver accurate predictive insights to all of the systems and users who make business decisions. As with OLAP technologies, data mining was once considered a highly specialized field that required expensive software and rare expertise to implement. However, by including comprehensive data mining technologies in SQL Server Analysis Services, and through integration with the 2007 Microsoft Office system, Microsoft has delivered a cost-effective solution that can extend the power of data mining to everyone and provide the insights that are critical to success while taking advantage of the enterprise-scale capabilities of SQL Server Analysis Services.<br />Predictive Analysis for All Users<br />A predictive analysis solution is most effective when it is pervasive throughout the organization and helps to drive day-to-day decisions across the business with its scale and enterprise-level performance. Furthermore, providing a way to implement comprehensive predictive analysis intuitively enables self-service data mining for users, which in turn enables the business to gain actionable insight promptly. The data mining technology in SQL Server 2008 meets these requirements through close integration with the 2007 Office system, a comprehensive development environment, enterprise-grade capabilities, and an extensible set of rich and innovative data mining algorithms that are designed to meet common business problems. <br />Pervasive Delivery through Microsoft Office<br />Traditionally, predictive analysis was limited to only a fraction of employees who were statistically trained experts. Microsoft SQL Server 2008 Data Mining Add-Ins for the 2007 Office System, shown in Figure 1, extend insight and prediction to a wider audience by enabling information workers to harness the highly sophisticated data mining technology within a familiar spreadsheet environment. The array of tools empowers users to inform everyday decisions in a few simple steps by providing prompt and actionable recommendations. The Table Analysis Tools for Microsoft Office Excel® 2007 hide the complexity of data mining behind intuitive tasks, delivering a seamless experience that enables users to transition easily between exploration and discovery. The Data Mining Client for Excel 2007 offers a complete data mining development lifecycle, which empowers advanced users with more information, validation, and control. Furthermore, the Data Mining Templates for Visio enable users to render annotatable graphical visualizations of the data mining models. Altogether, the integration between SQL Server 2008 data mining and the 2007 Office System provides a comprehensive, intuitive, and collaborative business ecosystem that extends the insight of predictive analysis to inform business decisions throughout the organization.<br />Figure 1: Data Mining Add-Ins for Microsoft Office Excel 2007<br />The Data Mining Add-Ins for the 2007 Office system delivers the following benefits:<br />Comprehensive: Provide a wide range of tools to fit many needs.Data Mining Add-Ins for the 2007 Office System are designed to offer a remarkably broad and reliable set of data mining tools. The availability of these tools at the desktop enables all users to explore data and discover hidden trends and relationships between products, customers, markets, employees, and other factors; empowering them to anticipate needs, understand behaviors and discover hidden opportunities that can improve business processes and directly impact profitability. <br />Intuitive: Deliver actionable insight to every user.Access to predictive analysis within the familiar Microsoft Office environment helps users to easily incorporate prediction into everyday processes. The automated tasks provided in the Table Analysis Tools for Excel 2007 deliver clear and actionable insights promptly, in three simple steps:<br />Define your data. Identify the data that is necessary to inform the solution and create a table in an Excel 2007 spreadsheet that defines the data to be analyzed.<br />Identify the task. Select the appropriate data mining task to perform on the data from the Data Mining or Table Analysis ribbon.<br />Get results. Examine the output from the task delivered through clear and intuitive visualizations directly in the Excel 2007 environment.<br />The automated tasks provided in the Data Mining Add-Ins for Excel 2007 include:<br />Analyze Key Influencers - Detects the key characteristics that influence a certain outcome. A detailed report that ranks the key influencers based on importance is generated, enabling users to compare key factors for each set of distinct values.<br />Detect Categories - Helps users to identify and segment data based on common properties. A detailed report describing the discovered categories is generated, enabling re-labeling of categories with meaningful naming for further analysis.<br />Fill From Example - Helps users to complete a partially populated column automatically based on patterns in the table. A report explaining the detected patterns is generated, enabling users to re-analyze the data and refine patterns as more knowledge is acquired.<br />Forecast - Enables users to predict future values based on trends in the data set. The forecast values are added to the original table and charts displaying past and forecast evolution of the series are generated.<br />Highlight Exceptions - Enables users to detect cases in the data set that include values outside the expected range. The rows containing the exceptions are highlighted and the actual column likely to cause the exception is emphasized.<br />Scenario Analysis: What If - Enables users to gain insight into the impact of a potential change that is applied to one value on other values of the data set.<br />Scenario Analysis: Goal Seeking - Enables users to better understand the underlying factors that need to be changed to achieve a desired value in a certain target column (complementary to the What-If tool).<br />Prediction Calculator - Related to the Analyze Key Influencers task, the Prediction Calculator generates an interactive form for scoring new cases. The influence of each attribute is translated into a set of scores. A summary of a combination of attributes, which apply to a new case, predicts probable future behaviors.<br />Shopping Basket Analysis - Enables users to detect the relationship between items frequently purchased together. A report explaining the relationships can provide a better understanding of the financial significance, providing insight into bundling offerings or improved product placement.<br />The easy to understand, graphical output from these tools provides a seamless transition between exploration and discovery, and empowers users with rich prediction and insight that clearly translates into recommendations and actions.<br />Collaborative: Share insights throughout the organization - Having performed predictive analysis in Excel 2007, users can use the powerful publishing tools of the 2007 Office System to share findings and inform business decisions throughout the organization. For example, users can share analysis through interactive graphical visualizations in Office Visio® 2007 diagrams, or they can share tables, reports, and diagrams through Microsoft Office SharePoint® Server 2007.<br />Comprehensive Development Environment<br />The 2007 Office System is an ideal desktop tool for information workers, but for BI developers who deploy solutions throughout the enterprise, SQL Server Business Intelligence Development Studio is the environment of choice because it has a project-based environment, complete with debugging and source control integration that you can use to create end-to-end BI solutions.<br />Of course, pervasive delivery of data mining functionality is only useful if developers can build data mining solutions that meet the needs of the business quickly and easily. SQL Server Business Intelligence Development Studio provides a comprehensive development environment that is based on the Microsoft Visual Studio® development system. With Business Intelligence Development Studio, developers can create data mining structures, which identify the tables and columns to be included in the analysis, and add multiple data mining models that apply data mining algorithms to the data in those tables. The Analysis Services project template in Business Intelligence Development Studio, shown in Figure 2, includes an intuitive Data Mining Designer for creating and viewing data mining models, and provides cross-validation, lift charts, and profit charts to compare and contrast the quality of models visually and through statistical scores of error and accuracy before deploying them. <br />Figure 2: Data Mining Designer in Business Intelligence Development Studio<br />SQL Server 2008 introduces a number of enhancements to the already comprehensive development environment of SQL Server 2005, including the ability to:<br />Split data into training and testing partitions more effectively. Partitioning is available within the process of creating the data mining model. Developers can identify a portion of the training dataset to be randomly selected for testing.<br />Build models over filtered data. Data filtering enables the creation of mining models that use subsets of data in a mining structure. Filtering provides flexibility for designing mining structures and data sources, because developers can create a single mining structure, based on a comprehensive data source view, and then apply filters to use only a part of that data for training and testing a variety of models, instead of building a different structure and related model for each subset of data. For example, a developer could define the data source view on the Customers table and related tables, build a single mining structure that includes all of the required fields, and then create a model that is filtered on a particular customer attribute, such as Region. The developer can then easily make a copy of that model, and change the filter condition to generate a new model based on a different region. By applying filters to data models, you can:<br />Create separate models for discrete values. For example, a clothing store might use customer demographics to build separate models by gender, even though the sales data comes from a single data source for all customers.<br />Experiment with models by creating and then testing multiple groupings of the same data, such as ages 20-30 versus ages 20-40 versus ages 20-25.<br />Specify complex filters on nested table contents, such as requiring that a case be included in the model only if the customer has purchased at least two of a particular item.<br />Build incompatible models within the same structure. Models using continuous or discretized versions of the same column can co-exist in a single structure with the new aliasing ability in the Mining Model Editor in Business Intelligence Development Studio.<br />Test multiple models simultaneously with cross-validation. The models created by data mining algorithms have various applications that require different accuracy and stability measurements. Depending on the application, users demand these measurements. Additionally these measurements assist in ensuring that various settings result in the best model for a current data set and a given application. SQL Server 2008 offers a robust cross-validation feature that can test all of the models in a structure simultaneously by using a folding technique. This enables users to test a variety of settings on a subset of data before committing to an expensive processing step. Cross-validation results also tell users if the model results are stable or if the results would change given more or less data. Figure 3 shows a cross-validation report in the Data Mining Designer.<br />Figure 3: Cross-validation<br />Enterprise-Grade Capabilities<br />SQL Server Predictive Analysis is part of SQL Server Analysis Services, which provides enterprise-class server advantages: rapid development, high availability, superior performance and scalability, robust security, and enhanced manageability through SQL Server Management Studio. This enterprise-level capability means that the data mining technologies enabling predictive analysis can grow with the business and provide a high performance, scalable solution for any size of organization.<br />Rich and Innovative Algorithms<br />Different businesses have different goals and need to make different decisions. For this reason, any data mining technology must support a comprehensive set of capabilities and algorithms to meet a diverse range of business needs. SQL Server 2008 Analysis Services includes data mining technologies that support many rich and innovative algorithms, most of them designed by Microsoft Research to solve common business problems. Additionally, the data mining technologies of SQL Server Analysis Services are extensible, enabling you to add plug-in algorithms that meet uncommon analytical needs that are more specific to an individual business. The following table shows some of the tasks that SQL Server data mining can be used to perform.<br />Data Mining Tasks<br />TaskDescriptionAlgorithmsMarket Basket AnalysisDiscover items sold together to create recommendations on-the-fly and to determine how product placement can directly contribute to your bottom line.Association Decision Trees Churn AnalysisAnticipate customers who may be considering canceling their service and identify the benefits that will keep them from leaving.Decision TreesLinear RegressionLogistic RegressionMarket AnalysisDefine market segments by automatically grouping similar customers together. Use these segments to seek profitable customers.Clustering Sequence Clustering ForecastingPredict sales and inventory amounts and learn how they are interrelated to foresee bottlenecks and improve performance.Decision Trees Time Series Data ExplorationAnalyze profitability across customers, or compare customers that prefer different brands of the same product to discover new opportunities.Neural NetworkUnsupervised LearningIdentify previously unknown relationships between various elements of your business to inform your decisions.Neural NetworkWeb Site AnalysisUnderstand how people use your Web site and group similar usage patterns to offer a better experience.Sequence Clustering Campaign AnalysisSpend marketing funds more effectively by targeting the customers most likely to respond to a promotion.Decision Trees Naïve Bayes Clustering Information QualityIdentify and handle anomalies during data entry or data loading to improve the quality of information.Linear RegressionLogistic RegressionText AnalysisAnalyze feedback to find common themes and trends that concern your customers or employees, informing decisions with unstructured input.Text Mining<br />Insight at Every Step of the Data Lifecycle<br />Whether consuming, analyzing, monitoring, planning, exploring, or reporting on business data, predictive analysis can add rich insight to expose new avenues for growth. SQL Server 2008 is part of a family of business intelligence technologies, all working together to deliver a comprehensive platform that enables organizations to incorporate predictive analysis into every stage of the data life cycle.<br />Native Reporting Integration<br />Reporting is a fundamental activity in most businesses, and SQL Server 2008 Reporting Services provides a comprehensive solution for creating, rendering, and deploying reports throughout the enterprise. SQL Server Reporting Services can render reports directly from a data mining model by using a data mining extensions (DMX) query. This enables users to visualize the content of data mining models for optimized data representation. Furthermore, the ability to query directly against the data mining structure enables users to easily include attributes beyond the scope of the mining model requirements, presenting complete and meaningful information. Figure 4 shows the DMX query editor for Reporting Services.<br />Figure 4: The DMX query editor for SQL Server Reporting Services<br />SQL Server Reporting Services provides the ability to generate parameter-driven reports based on predictive probability. For example, the query shown in Figure 4 analyzes a list of prospective customers for the hypothetical Adventure Works cycle company and uses a data mining model to assess the probability of those customers buying a bicycle. The query is filtered to return only prospects that are more than 50% likely to make a purchase. Figure 5 shows the resulting report, which the company could use as the basis for a marketing campaign that targets only the customers most likely to make a purchase, significantly improving the effectiveness of the campaign and its return on investment. <br />Figure 5: A predictive analysis report<br />In-Flight Data Mining During Data Integration<br />As Business Intelligence becomes more pervasive, businesses are increasingly implementing extract, transform, and load (ETL) solutions to consolidate data from around the organization into a data warehouse for reporting and analysis. However, the source data for these operations can often be incomplete, or in some cases business entities, such as customers, might need to be classified into categories based on common profile characteristics. <br />Microsoft SQL Server 2008 Integration Services provides a powerful, extensible ETL platform that Business Intelligence solution developers can use to implement ETL operations that cleanse and transform data in-flight. SQL Server Integration Services includes a Data Mining Model Training destination for training data mining models, and a Data Mining Query transformation that can be used to perform predictive analysis on data as it is passed through the data flow. Integrating predictive analysis with SQL Server Integration Services enables organizations to flag unusual data, classify business entities, perform text mining, and fill-in missing values on the fly based on the power and insight of the data mining algorithms. For example, an ETL process might extract customer data from one or more source systems for inclusion in a data warehouse. Traditionally, data mining would be used after the data warehouse is loaded, to classify customers for predicted purchasing behavior or other campaign management tasks. However, with SQL Server Integration Services, the Data Mining Query Transformation can apply a data mining model during the ETL process, resulting in a data warehouse that is populated with classified data at load time. This reduces the work that must be done on the warehouse server, and ensures that the data available for analysis is always up-to-date and consistently classified. Moreover, classification during the ETL process may also be used to filter out customer records that do not fit any known classification. These records may be the result of poor data quality, or may represent a new classification not yet captured in the campaign management process. In either case, SQL Server Integration Services can detect these records by using data mining and redirect them for manual or automated review. <br />Figure 6 shows a SQL Server Integration Services data flow that includes a Data Mining Query transformation.<br />Figure 6: Data mining in SQL Server Integration Services<br />Insightful Analysis<br />SQL Server 2008 Analysis Services provides a highly scalable platform for multidimensional OLAP analysis. Many customers are already reaping the benefits of creating a unified dimensional model (UDM) in Analysis Services and using it to slice and dice business measures by multiple dimensions. Predictive analysis, being part of SQL Server 2008 Analysis Services provides a richer OLAP experience, featuring data mining dimensions that slice your data by the hidden patterns within. For example, a sales and marketing department can create a data mining structure that is based on an existing Customer OLAP dimension and use it to classify customers into clusters that exhibit similar characteristics. They can then use that data mining structure to generate a new data mining dimension and use it to analyze sales information based on the customer clusters that have been identified. Figure 7 shows a data mining dimension in an OLAP cube.<br />Figure 7: A data mining dimension in an OLAP cube<br />In addition to incorporating the results of data mining into OLAP dimensions, SQL Server 2008 enables you to incorporate predictive functions based on data mining models into calculations and KPIs.<br />Predictive KPIs<br />Many businesses use KPIs to evaluate critical business metrics against targets. SQL Server 2008 Analysis Services provides a centralized platform for KPIs across the organization, and integration with Microsoft Office PerformancePoint® Server 2007 enables decision makers to build business dashboards from which they can monitor the company’s performance. KPIs are traditionally retrospective, for example showing last month’s sales total compared to the sales target. However, with the insights made possible through data mining, organizations can build predictive KPIs that forecast future performance against targets, giving the business an opportunity to detect and resolve potential problems proactively. Figure 8 shows a KPI that displays the anticipated number of orders that are predicted to be placed.<br />Figure 8: Microsoft Office PerformancePoint Server 2007<br />Additionally, predictive analysis can detect attributes that influence KPIs. Together with Office PerformancePoint Server 2007, users can monitor trends in key influencers to recognize those attributes that have a sustained effect, for example identifying whether price discount on a competing product has a lasting impact on sales or only generates a short-term interference. Such insights enable businesses to inform and improve their response strategy. <br />Data Mining Awareness in Every Application<br />As you have seen in this whitepaper so far, SQL Server 2008 provides a comprehensive data mining solution, and the tight integration with the Microsoft Business Intelligence platform makes it easy to provide predictive analysis to users and automated processes across the enterprise. However, there may still be occasions where organizations need to embed data mining functionality into an application, to introduce intelligence into an existing business process, or to extend data mining technologies to meet a specific business problem. For this purpose, SQL Server offers a flexible and extensible programming platform for seamlessly incorporating prediction and insight into line-of-business applications.<br />Predictive Programming<br />SQL Server 2008 data mining supports a number of application programming interfaces (APIs) that developers can use to build custom solutions that take advantage of the predictive analysis capabilities in SQL Server. DMX, XMLA, OLEDB and ADOMD.NET, and Analysis Management Objects (AMO) offer a rich, fully documented development platform, empowering developers to build data mining aware applications and providing real-time discovery and recommendation through familiar tools. <br />This extensibility creates an opportunity for business organizations and independent software vendors (ISVs) to embed predictive analysis into line-of-business applications, introducing insight and forecasting that inform business decisions and processes. For example, the Analytics Foundation adds predictive scoring to Microsoft Dynamics® CRM, to enable information workers across sales, marketing, and service organizations to identify attainable opportunities that are more likely to lead to a sale, increasing efficiency and improving productivity (for more information, see the Microsoft Dynamics site).<br />Plug-In Algorithms and Custom Visualizations<br />The SQL Server data mining toolset is fully extensible through Microsoft .NET–stored procedures, plug-in algorithms, custom visualizations and PMML. This enables developers to extend the out-of-the-box data mining technologies of SQL Server 2008 to meet uncommon business needs that are specific to the organization by:<br />Creating custom data mining algorithms to solve business-specific analytical problems.<br />Using data mining algorithms from other software vendors.<br />Creating custom visualizations of data mining models through plug-in viewer APIs.<br />Conclusion <br />SQL Server 2008 Analysis Services provides a complete data mining platform that organizations can use to infuse insight and prediction into everyday business decisions. Pervasive delivery through the Data Mining Add-Ins for the 2007 Office system delivers predictive analysis capabilities with intuitive tools and clear results that are available throughout the enterprise at the desktop. The comprehensive development environment and extensible range of innovative data mining algorithms combined with the enterprise-level scalability and manageability of SQL Server Analysis Services makes SQL Server 2008 an ideal way to bring the benefits of predictive analysis to your business.<br />Because the predictive analysis capabilities of SQL Server 2008, as part of the Microsoft BI platform, are closely integrated into every stage of the data life cycle, they incorporate intelligence into reporting, data integration, OLAP analysis, and business performance monitoring. This helps organizations increase business agility and creates a tangible competitive advantage.<br />Although the data mining functionality provided with SQL Server 2008 is comprehensive enough to meet the needs of a wide range of business scenarios, its extensibility ensures that it can be used to solve virtually any predictive problem. The ability to extend the data mining technologies of SQL Server through custom algorithms and visualizations, together with the ability to embed predictive functionality into line-of-business applications makes SQL Server 2008 a powerful platform for introducing predictive analysis into existing business processes to add insight and recommendations into everyday operations.<br />For more information:<br />Microsoft SQL Server 2008http://www.microsoft.com/sqlserver/2008/en/us/default.aspx<br />SQL Server Developer Centerhttp://msdn2.microsoft.com/sqlserver<br />SQL Server TechCenterhttp://technet.microsoft.com/sqlserver<br />Please give us your feedback:<br />Did this paper help you? Tell us on a scale of 1 (poor) to 5 (excellent), how would you rate this paper and why have you given it this rating? For example:<br />Are you giving it a high rating because it has good examples, excellent screenshots, clear writing, or another reason? <br />Are you giving it a low rating because it has poor examples, fuzzy screenshots, unclear writing?<br />This feedback will help us improve the quality of white papers we release. Send feedback.<br />