Data Mining Concepts and Tools Purchase the entire course

27 May 2012 · · 10656 views

Fundamentals of SQL Server Data Mining

This 50-minute video introduces the fundamental concepts of Data Mining, a powerful analytical technology. We start by introducing to you the process of data mining and the SQL Server Analysis Services (SSAS) Data Mining architecture, using the Multidimensional and Data Mining Mode mode of SSAS. We introduce data mining tools, starting with Excel with the free Data Mining Add-Ins for Office, and focusing on SQL Server Data Tools (SSDT), which are well suited to longer-duration analytical mining projects.

The most fundamental concept in data mining is that of Cases, which represent the entities you wish to analyse, such as customers, products, or events. The simplest form of a case is just a flat, denormalized row of data. We briefly explain other formats of cases, too: Customer Signatures, which contains as-of validity dates, and Nested Cases, on which we focus towards the end of this tutorial, when you can also see a demo comparing the use of Decision Trees with, and without, nesting.

You will also learn about the concepts of Mining Structures, used to describe Cases, Mining Models, and Mining Algorithms. We briefly introduce 9 of the Microsoft data mining algorithms: Naïve Bayes, Clustering, Decision Trees, Association Rules, Sequence Clustering, Neural Networks, Logistics Regression, Linear Regression, and Time Series. They will be explained in more detail in other modules of this series. 

The remainder of this module discuses Column Data Types (especially Text, Long, and Double), and Column Content Types, focussing on the differences between Continuous, Discrete, and Discretized data. You will hear about different approaches to automatic Discretization, including Equal Areas, Clusters, and Thresholds technique, and about assisting the algorithms by hinting the statistical distribution of data in a column, such as Normal, LogNormal, or Uniform.

To help you learn, there are 5 demos in this module, which you can follow using your own datasets, Adventure Works from GitHub, or by downloading our educational dataset, HappyCars, available when you purchase access to this course.

Log in or purchase access to play the video.

  • Introduction to Data Mining with Microsoft SQL Server 24-min Watch with Free Subscription

  • Data Mining Concepts and Tools 50-min

  • Data Mining Model Building, Testing and Predicting with Microsoft SQL Server and Excel 1-hour 20-min

  • What Are Decision Trees? 10-min Free—Watch Now

  • Decision Trees in Depth 1-hour 54-min

  • Why Cluster and Segment Data? 9-min Watch with Free Subscription

  • Clustering in Depth 1-hour 50-min

  • What is Market Basket Analysis? 10-min Watch with Free Subscription

  • Association Rules in Depth 1-hour 35-min

  • HappyCars Sample Data Set for Learning Data Mining

  • Additional Code and Data Samples (R, ML Services, SSAS) Get with Free Subscription

Purchase a Full Access Subscription

Individual Subscription


Access all content on this site for 1 year.
Group Purchase

from $480/year

For small business & enterprise.
Group Purchase
  • You can also redeem a prepaid code.
  • Payments are instant and you will receive a tax invoice straight away.
  • We offer sales quotes/pro-forma invoices, and we accept purchase orders and bank transfers.
  • Your satisfaction is paramount: we offer a no-quibble refund guarantee.
  • See pricing FAQ for more detail.
In collaboration with
Project Botticelli logo Oxford Computer Training logo SQLBI logo Prodata logo